학술논문

Fast Training Data Generation for Machine Learning Analysis of Cosmic Ray Showers
Document Type
Periodical
Source
IEEE Access Access, IEEE. 11:7410-7419 2023
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Detectors
Cosmic rays
Mesons
Computational modeling
Atmospheric modeling
Training data
Machine learning
Data processing
Cosmic ray shower
simulation
data generation
detectors
machine learning
Language
ISSN
2169-3536
Abstract
Applying Machine Learning (ML) methods for the analysis of muon lateral distributions in Extensive Air Showers detected by citizen science projects, while taking into account the spatial distribution of detectors requires enormous training data sets. Therefore, generating these data sets with typical Monte Carlo (MC) generators like CORSIKA is computationally prohibitive. Here we present a method which by the application of special augmentation procedures produces the training dataset that is compatible in all essential aspects to the data produced with regular MC computations while avoiding their time overhead. We utilize the Nakamura-Kamata-Greisen (NKG) distribution which was proven to be an attractive alternative to full-fledged simulations. The simulation of $10^{4}$ muons at the ground level takes just a few seconds using our implementation of the NKG approach. For $10^{6}$ muons this figure is still around 1 minute. For comparison, CORSIKA based simulation performed on Prometheus supercomputer at CYFRONET computing center an ensemble of $\sim 100$ showers initiated by a particle of $10^{16} eV$ resulted in $\sim 10^{4}$ muons and $\sim 10^{5}$ electrons required computation time of the order of a few days.