학술논문

Property Analysis of Adversarially Robust Representation / 敵対的サンプルに頑健な特徴表現の性質の分析
Document Type
Journal Article
Source
精密工学会誌 / Journal of the Japan Society for Precision Engineering. 2021, 87(1):83
Subject
adversarial examples
adversarial robustness
interpretability
robust representation
trade-off between accuracy and robustness
Language
Japanese
ISSN
0912-0289
1882-675X
Abstract
In this paper, we address the open question: “What do adversarially robust models look at?” Recently, it has been reported in many works that there exists the trade-off between standard accuracy and adversarial robustness. According to prior works, this trade-off is rooted in the fact that adversarially robust and standard accurate models might depend on very different sets of features. However, it has not been well studied what kind of difference actually exists. In this paper, we analyze this difference through various experiments visually and quantitatively. Experimental results show that adversarially robust models look at things at a larger scale than standard models and pay less attention to fine textures. Furthermore, although it has been claimed that adversarially robust features are not compatible with standard accuracy, there is even a positive effect by using them as pre-trained models particularly in low resolution datasets.