1. Adversarial robustness via attention transfer.
- Author
-
Li, Zhuorong, Feng, Chao, Wu, Minghui, Yu, Hongchuan, Zheng, Jianwei, and Zhu, Fanwei
- Subjects
- *
MACHINE learning , *KNOWLEDGE transfer - Abstract
• An adversarial training technique using visual attention is proposed. • Transfer learning is applied to enforce the consistency of attention. • Universal ?rst-order adversary is employed for a broad security guarantee. • The proposed technique is applicable to varied network architectures. • Results show the success of attention transfer to improve adversarial robustness. Deep neural networks are known to be vulnerable to adversarial attacks. The empirical analysis in our study suggests that attacks tend to induce diverse network architectures to shift the attention to irrelevant regions. Motivated by this observation, we propose a regularization technique which enforces the attentions to be well aligned via the knowledge transfer mechanism, thereby encouraging the robustness. Resultant model exhibits unprecedented robustness, securing 63.81 % adversarial accuracy where the prior art is 51.59 % on CIFAR-10 dataset under PGD attacks. In addition, we go beyond performance to analytically investigate the proposed method as an effective defense. Significantly flattened loss landscape can be observed, demonstrating the promise of the proposed method for improving robustness and thus the deployment in security-sensitive settings. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF