Eprint already available on another site (E-prints, working papers and research blog)
What Is the Optimal Ranking Score Between Precision and Recall? We Can Always Find It and It Is Rarely F1
Pierard, Sébastien; Deliège, Adrien; Van Droogenbroeck, Marc
2025
 

Files


Full Text
Pierard2025What.pdf
Author preprint (10.5 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Performance; Ranking; Detection; F1; Precision; Recall; Tile; ROC
Abstract :
[en] Ranking methods or models based on their performance is of prime importance but is tricky because performance is fundamentally multidimensional. In the case of classification, precision and recall are scores with probabilistic interpretations that are both important to consider and complementary. The rankings induced by these two scores are often in partial contradiction. In practice, therefore, it is extremely useful to establish a compromise between the two views to obtain a single, global ranking. Over the last fifty years or so,it has been proposed to take a weighted harmonic mean, known as the F-score, F-measure, or $F_β$. Generally speaking, by averaging basic scores, we obtain a score that is intermediate in terms of values. However, there is no guarantee that these scores lead to meaningful rankings and no guarantee that the rankings are good tradeoffs between these base scores. Given the ubiquity of $F_β$ scores in the literature, some clarification is in order. Concretely: (1) We establish that $F_β$-induced rankings are meaningful and define a shortest path between precision- and recall-induced rankings. (2) We frame the problem of finding a tradeoff between two scores as an optimization problem expressed with Kendall rank correlations. We show that $F_1$ and its skew-insensitive version are far from being optimal in that regard. (3) We provide theoretical tools and a closed-form expression to find the optimal value for $β$ for any distribution or set of performances, and we illustrate their use on six case studies.
Research Center/Unit :
Montefiore Institute - Montefiore Institute of Electrical Engineering and Computer Science - ULiège
VIULab
TELIM
Disciplines :
Electrical & electronics engineering
Author, co-author :
Pierard, Sébastien  ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Deliège, Adrien  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Télécommunications
Van Droogenbroeck, Marc  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Télécommunications
Language :
English
Title :
What Is the Optimal Ranking Score Between Precision and Recall? We Can Always Find It and It Is Rarely F1
Publication date :
27 November 2025
Number of pages :
83
Source :
Name of the research project :
ReconnAIssance
ARIAC
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique
SPW EER - Service Public de Wallonie. Economie, Emploi, Recherche
Funding number :
8573; 2010235
Funding text :
S. Piérard is funded by grants 8573 (ReconnAIssance project) and 2010235 (ARIAC by DIGITALWALLONIA4.AI) of the SPW EER, Wallonia, Belgium; A. Deliègeis a F.R.S.-FNRS postdoc researcher.
Available on ORBi :
since 11 December 2025

Statistics


Number of views
16 (0 by ULiège)
Number of downloads
27 (0 by ULiège)

Bibliography


Similar publications



Contact ORBi