Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification Classes

Ozbulak, Utku; Pintor, Maria; Van Messem, Arnout; De Neve, Wesley

Paper published on a website (Scientific congresses and symposiums)

Ozbulak, Utku; Pintor, Maria; Van Messem, Arnout et al.

2021 • 35th Conference on Neural Information Processing Systems (NeurIPS 2021): Workshop on ImageNet: Past, Present, and Future

Peer reviewed

Permalink
https://hdl.handle.net/2268/289453

Files (2)Send to Details Statistics Bibliography Similar publications

Files

Full Text

21_08_NIPSW_Accept_ImageNetClasses.pdf

Author postprint (388.68 kB)

Download

Annexes

21_08_NIPSW_Accept_ImageNetClasses Supp.pdf

(527.06 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Abstract :

[en] Although ImageNet was initially proposed as a dataset for performance benchmarking in the domain of computer vision, it also enabled a variety of other research efforts. Adversarial machine learning is one such research effort, employing deceptive inputs to fool models in making wrong predictions. To evaluate attacks and defenses in the field of adversarial machine learning, ImageNet remains one of the most frequently used datasets. However, a topic that is yet to be investigated is the nature of the classes into which adversarial examples are misclassified. In this paper, we perform a detailed analysis of these misclassification classes, leveraging the ImageNet class hierarchy and measuring the relative positions of the aforementioned type of classes in the unperturbed origins of the adversarial examples. We find that 71% of the adversarial examples that achieve model-to-model adversarial transferability are misclassified into one of the top-5 classes predicted for the underlying source images. We also find that a large subset of untargeted misclassifications are, in fact, misclassifications into semantically similar classes. Based on these findings, we discuss the need to take into account the ImageNet class hierarchy when evaluating untargeted adversarial successes. Furthermore, we advocate for future research efforts to incorporate categorical information.

Disciplines :

Computer science

Author, co-author :

Ozbulak, Utku

Pintor, Maria; University of Cagliari

Van Messem, Arnout ; Université de Liège - ULiège > Département de mathématique > Statistique appliquée aux sciences

De Neve, Wesley; Ghent University Global Campus

Language :

English

Title :

Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification Classes

Publication date :

2021

Event name :

35th Conference on Neural Information Processing Systems (NeurIPS 2021): Workshop on ImageNet: Past, Present, and Future

Event date :

December 6–14, 2021

Audience :

International

Peer reviewed :

Peer reviewed

Source :

Openreview

Additional URL :

https://openreview.net/pdf?id=oWk2dULs1x

Available on ORBi :

since 06 April 2022

Statistics

Number of views

44 (6 by ULiège)

Number of downloads

12 (0 by ULiège)

More statistics

Bibliography

Similar publications

Sorry the service is unavailable at the moment. Please try again later.

Name	Provider / Domaine	Expiration	Description
JSESSIONID	Oracle Corporation www.uliege.be	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
CookieScriptConsent	CookieScript .uliege.be	1 year	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name	Provider / Domaine	Expiration	Description
_pk_id	InnoCraft Ltd .uliege.be	1 year	Used to store a few details about the user such as the unique visitor ID
_pk_ses	InnoCraft Ltd .uliege.be	30 minutes	Short lived cookies used to temporarily store data for the visit
_pk_ref	InnoCraft Ltd .uliege.be	6 months	Used to store the attribution information, the referrer initially used to visit the website