Li, Yi and Angelov, Plamen and Suri, Neeraj (2024) Self-Supervised Representation Learning for Adversarial Attack Detection. In: Computer Vision – ECCV 2024 : 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part I. Lecture Notes in Computer Science . Springer, Cham, pp. 236-252. ISBN 9783031730269
ECCV_2024_Paper_Template_Camera_Ready_Version_Copy_.pdf - Accepted Version
Available under License Creative Commons Attribution.
Download (1MB)
Abstract
Supervised learning-based adversarial attack detection methods rely on a large number of labeled data and suffer significant performance degradation when applying the trained model to new domains. In this paper, we propose a self-supervised representation learning framework for the adversarial attack detection task to address this drawback. Firstly, we map the pixels of augmented input images into an embedding space. Then, we employ the prototype-wise contrastive estimation loss to cluster prototypes as latent variables. Additionally, drawing inspiration from the concept of memory banks, we introduce a discrimination bank to distinguish and learn representations for each individual instance that shares the same or a similar prototype, establishing a connection between instances and their associated prototypes. Experimental results show that, compared to various benchmark self-supervised vision learning models and supervised adversarial attack detection methods, the proposed model achieves state-of-the-art performance on the adversarial attack detection task across a wide range of images.