학술논문

GPU-Accelerated Scalable Solver for Large Linear Systems over Finite Fields
Document Type
Conference
Source
2018 Fifth International Conference on Parallel, Distributed and Grid Computing (PDGC) Parallel, Distributed and Grid Computing (PDGC), 2018 Fifth International Conference on. :324-329 Dec, 2018
Subject
Communication, Networking and Broadcast Technologies
Computing and Processing
Graphics processing units
Linear systems
Kernel
Cryptography
Hardware
Computer architecture
Parallel processing
cryptography
GF(2)
GPGPU computing
Language
ISSN
2573-3079
Abstract
Solving large and dense linear systems over finite fields (GF(2)) forms the basis for several crypt-analytic techniques. Many popular cryptographic algorithms like Number Field Sieve for Integer Factorization, Discrete Log Problem, Cryptanalysis of Ciphers and Algebraic attack requires solving dense systems of linear equations. Gaussian Elimination is the natural and popular approach for solving such systems. However, its cubic time complexity makes it very slow and hence, parallelization is made mandatory. In this paper, we propose a GPU-accelerated linear solver over GF(2), based on Gaussian Elimination. Our parallel solver is implemented using NVIDIA CUDA and MPI to utilize the multi-level parallelism on multi-node, multi-GPU systems, which are becoming increasingly common. CUDA-aware MPI is used to leverage GPUDirect P2P and GPUDirect RDMA for optimized intra- and inter-node communication. Our experimental results show that the proposed solver is able to effectively utilize the memory bandwidth on a single Tesla P100 GPU and shows a parallel efficiency of 95% on a 4 X Tesla P100 multi-GPU node. We see 89% and 94% parallel efficiency on DGX systems with 8, Tesla P100 and Tesla V100 GPUs respectively.