학술논문

Metamorphic Detection of Adversarial Examples in Deep Learning Models with Affine Transformations
Document Type
Conference
Source
2019 IEEE/ACM 4th International Workshop on Metamorphic Testing (MET) MET Metamorphic Testing (MET), 2019 IEEE/ACM 4th International Workshop on. :55-62 May, 2019
Subject
Computing and Processing
Testing
Perturbation methods
Deep learning
Neural networks
Marine vehicles
Computational modeling
neural networks
machine learning models
adversarial attacks
adversarial detection
metamorphic testing
Language
Abstract
Adversarial attacks are small, carefully crafted perturbations, imperceptible to the naked eye; that when added to an image cause deep learning models to misclassify the image with potentially detrimental outcomes. With the rise of artificial intelligence models in consumer safety and security intensive industries such as self-driving cars, camera surveillance and face recognition, there is a growing need for guarding against adversarial attacks. In this paper, we present an approach that uses metamorphic testing principles to automatically detect such adversarial attacks. The approach can detect image manipulations that are so small, that they are impossible to detect by a human through visual inspection. By applying metamorphic relations based on distance ratio preserving affine image transformations which compare the behavior of the original and transformed image; we show that our proposed approach can determine whether or not the input image is adversarial with a high degree of accuracy.