학술논문

Performance analysis of GPU accelerated meshfree q-LSKUM solvers in Fortran, C, Python, and Julia
Document Type
Conference
Source
2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics (HiPC) HIPC High Performance Computing, Data, and Analytics (HiPC), 2022 IEEE 29th International Conference on. :156-165 Dec, 2022
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Measurement
Codes
Graphics processing units
Benchmark testing
Entropy
Performance analysis
Kinetic theory
Fortran
C
Python
Julia
GPUs
CUDA
Mesh-free methods
LSKUM
Code optimisation
Language
ISSN
2640-0316
Abstract
This paper presents a comprehensive analysis of the performance of Fortran, C, Python, and Julia based GPU accelerated meshfree solvers for compressible flows. The programming model CUDA is used to develop the GPU codes. The meshfree solver is based on the least squares kinetic upwind method with entropy variables (q-LSKUM). To measure the performance of baseline codes, benchmark calculations are performed. The codes are then profiled to investigate the differences in their performance. Analysing various performance metrics for the computationally expensive flux residual kernel helped identify various bottlenecks in the codes. To resolve the bottlenecks, several optimisation techniques are employed. Post optimisation, the performance metrics have improved significantly, with the C GPU code exhibiting the best performance.