Unsupervised adversarial example detection based on image transformation

doi:10.13878/j.cnki.jnuist.20240321001

2025-5-31- 14

Home > Archive>Volume 16, Issue 6, 2024 >760-770. DOI:10.13878/j.cnki.jnuist.20240321001

Unsupervised adversarial example detection based on image transformation
DOI:
                        10.13878/j.cnki.jnuist.20240321001
                    
CSTR:
                        
                    
Author:
                        ZHANG LingZHANG Ling
Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education/School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
ZHAO BoZHAO Bo
Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education/School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
HUANG LinquanHUANG Linquan
School of Information, Wuhan Vocational College of Software and Engineering (Wuhan Open University), Wuhan 430205, China
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:TP391.4
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Deep Neural Networks (DNNs) exhibit vulnerability to specially designed adversarial examples and are prone to deception.Although current detection techniques can identify some malicious inputs,their protective capabilities remain insufficient when confronted with complex attacks.This paper proposes a novel unsupervised adversarial example detection method based on unlabeled data.The core idea is to transform the adversarial example detection problem into an anomaly detection problem through feature construction and fusion.To this end,five core components are designed,including image transformation,neural network classifier,heatmap generation,distance calculation,and anomaly detector.Firstly,the original images are transformed,and the images before and after the transformation are input into the neural network classifier.The prediction probability array and convolutional layer features are extracted to generate a heatmap.The detector is extended from focusing solely on the model's output layer to the input layer features,enhancing its ability to model and measure the disparities between adversarial and normal samples.Subsequently,the KL divergence of the probability arrays and the change distance of the heatmap focus points of the images before and after the transformation are calculated,and the distance features are then input into the anomaly detector to determine whether the example is adversarial.Experiments on the large-scale,high-quality image dataset ImageNet show that our detector achieves an average AUC (Area Under the ROC Curve) value of 0.77 against five different types of attacks,demonstrating robust detection performance.Compared with other cutting-edge unsupervised adversarial example detectors,our detector has a drastically enhanced TPR (True Positive Rate) while maintaining a comparable false alarm rate,indicating its significant advantage in detection capability.

Key words:adversarial example detection;unsupervised learning;adversarial attack;deep neural networks(DNNs);image transformation

Get Citation

ZHANG Ling, ZHAO Bo, HUANG Linquan. Unsupervised adversarial example detection based on image transformation[J]. Journal of Nanjing University of Information Science & Technology,2024,16(6):760-770

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:March 21,2024
Revised:
Adopted:
Online: January 06,2025
Published:

Article QR Code

Address：No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode：210044

Phone：025-58731025

Get Citation

Share

Article Metrics

History

Article QR Code