Article (Scientific journals)
Token-Based Detection of Spurious Correlations in Vision Transformers
Kang, Solha; Anzaku, Esla Timothy; De Neve, Wesley et al.
2026In Transactions on Machine Learning Research
Peer Reviewed verified by ORBi
 

Files


Full Text
_6339_Token_Based_Detection_of_.pdf
Publisher postprint (19.22 MB) Creative Commons License - Attribution
Download

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] Due to their powerful feature association capabilities, neural network-based computer vision models have the ability to detect and exploit unintended patterns within the data, potentially leading to correct predictions based on incorrect or unintended but statistically relevant signals. These clues may vary from simple color aberrations to small pieces of text within the image. In situations where these unintended signals align with the predictive task, models can mistakenly link these features with the task and rely on them for making predictions. This phenomenon is referred to as spurious correlations, where patterns appear to be associated with the task but are actually coincidental. As a result, detection and mitigation of spurious correlations have become crucial tasks for building trustworthy, reliable, and generalizable machine learning models. In this work, we present a token-based diagnostic pipeline that applies leave-one-out token removal to detect spurious correlations in vision transformers. The proposed approach quantifies a model's reliance on non-core visual cues through complementary measures that capture both aggregate and localized spurious effects at the token level. Using both supervised and self-supervised trained models, we present large-scale experiments on the ImageNet dataset demonstrating the ability of the proposed method to identify spurious correlations. We also find that, even if the same architecture is used, the training methodology has a substantial impact on the model's reliance on spurious correlations. Furthermore, we show that for certain ImageNet classes, many images exhibit Published in Transactions on Machine Learning Research (04/2026
Disciplines :
Computer science
Author, co-author :
Kang, Solha;  Ghent University Ghent University Global Campus
Anzaku, Esla Timothy;  Ghent University Ghent University Global Campus
De Neve, Wesley;  Ghent University Ghent University Global Campus
Van Messem, Arnout  ;  Université de Liège - ULiège > Mathematics
Vankerschaver, Joris;  Ghent University Ghent University Global Campus
Rameau, Francois;  State University of New
Ozbulak, Utku;  Ghent University Ghent University Global Campus George Mason University, Korea
Language :
English
Title :
Token-Based Detection of Spurious Correlations in Vision Transformers
Publication date :
May 2026
Journal title :
Transactions on Machine Learning Research
eISSN :
2835-8856
Publisher :
OpenReview, Amherst, United States - Massachusetts
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi :
since 18 May 2026

Statistics


Number of views
17 (0 by ULiège)
Number of downloads
28 (0 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0

Bibliography


Similar publications



Contact ORBi