28–29 May 2026
HUN-REN Centre
Europe/Budapest timezone

Towards Cost-Effective HEP Simulations Using GAN-Based Data Augmentation

28 May 2026, 17:10
20m
HUN-REN Centre

HUN-REN Centre

1054 Budapest Alkotmány utca 29.
Lecture Session IV

Speaker

Anisa Khatun

Description

Modern high-energy physics analyses rely heavily on large-scale Monte Carlo (MC)
simulations for machine-learning training, efficiency corrections, and systematic
studies. For rare-signal workflows, obtaining sufficiently large reconstructed-level
signal samples often require computationally expensive MC campaigns with large
CPU and storage demands.
This work explores the use of Generative Adversarial Networks (GANs) for
reconstructed-level data augmentation in the ALICE experiment at CERN. The
proposed approach learns the multi-dimensional distribution of reconstructed observables directly from MC and generates statistically consistent synthetic signal
samples for downstream analysis workflows.
The framework is validated through feature-distribution comparisons, correlation
studies, Machine-learning-based classification, and signal extraction tests. The generated samples show good agreement with standard MC while significantly reducing
the marginal cost of producing large reconstructed-level datasets. The method provides a complementary generative layer within the simulation-to-analysis workflow
and demonstrates the potential of AI-driven augmentation for scalable MC-statistics
production in future rare-signal analyses.

Author

Presentation materials

There are no materials yet.