학술논문

WordDeceiver: Black Box Attack on Chinese Text Classification
Document Type
Article
Source
International Journal of Network Security. Vol. 25 Issue 3, p414-424. 11 p.
Subject
Adversarial Attack
Black Box
Chinese Text Classification
Deep Neural Network
Language
英文
ISSN
1816-353X
Abstract
Aiming at the problem that DNNs-based text classification systems are vulnerable to adversarial example attacks, a black-box adversarial attack method of adversarial example generation for Chinese text classification, WordDeceiver, is proposed. In this method, we use the glyph and phonetic features of Chinese characters to construct adversarial search space, determine the replacement order by word saliency and classification probability, generate adversarial examples using word substitution strategy, and design a new method to improve the semantic similarity between the adversarial examples and the original samples. The effectiveness and transferability are verified on different classification datasets using two mainstream models. The experimental results show that WordDeceiver can preserve the original semantics and grammatical correctness to some extent and can be effectively transferred to other models and cloud platforms.

Online Access