학술논문

Speeding up Madgraph5 aMC@NLO through CPU vectorization and GPU offloading: towards a first alpha release
Document Type
Working Paper
Source
Subject
Physics - Computational Physics
High Energy Physics - Experiment
High Energy Physics - Phenomenology
65C05, 81T18, 81V05
C.1.2
D.1.3
G.3
I.6.8
J.2
Language
Abstract
The matrix element (ME) calculation in any Monte Carlo physics event generator is an ideal fit for implementing data parallelism with lockstep processing on GPUs and vector CPUs. For complex physics processes where the ME calculation is the computational bottleneck of event generation workflows, this can lead to large overall speedups by efficiently exploiting these hardware architectures, which are now largely underutilized in HEP. In this paper, we present the status of our work on the reengineering of the Madgraph5_aMC@NLO event generator at the time of the ACAT2022 conference. The progress achieved since our previous publication in the ICHEP2022 proceedings is discussed, for our implementations of the ME calculations in vectorized C++, in CUDA and in the SYCL framework, as well as in their integration into the existing MadEvent framework. The outlook towards a first alpha release of the software supporting QCD LO processes usable by the LHC experiments is also discussed.
Comment: 8 pages, 4 figures, 4 tables; submitted to ACAT 2022 proceedings in IOP; version 2 includes additional references and figure caption fixes as suggested by journal referee