학술논문

Towards Universal Adversarial Examples and Defenses

Document Type

Conference

Author

Rakin, Adnan Siraj; Wang, Ye; Aeron, Shuchin; Koike-Akino, Toshiaki; Moulin, Pierre; Parsons, Kieran

Source

2021 IEEE Information Theory Workshop (ITW) Information Theory Workshop (ITW), 2021 IEEE. :1-6 Oct, 2021

Subject

Communication, Networking and Broadcast Technologies
Training
Costs
Computational modeling
Conferences
Neural networks
Rate-distortion
Inference algorithms

Language

Abstract

Adversarial examples have recently exposed the severe vulnerability of neural network models. However, most of the existing attacks require some form of target model information (i.e., weights/model inquiry/architecture) to improve the efficacy of the attack. We leverage the information-theoretic connections between robust learning and generalized rate-distortion theory to formulate a universal adversarial example (UAE) generation algorithm. Our algorithm trains an offline adversarial generator to minimize the mutual information between the label and perturbed data. At the inference phase, our UAE method can efficiently generate effective adversarial examples without high computation cost. These adversarial examples in turn allow for developing universal defenses through adversarial training. Our experiments demonstrate promising gains in improving the training efficiency of conventional adversarial training.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송