Article (Scientific journals)
On protocols and measures for the validation of supervised methods for the inference of biological networks
Schrynemackers, Marie; Kuffner, Robert; Geurts, Pierre
2013In Frontiers in Genetics, 4 (262)
Peer Reviewed verified by ORBi
 

Files


Full Text
schrynemackers13-frontiers.pdf
Publisher postprint (2.83 MB)
Download
Annexes
schrynemackers13-supplementary.pdf
Publisher postprint (117.13 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
biological network inference; supervised learning; cross-validation; evaluation protocols; ROC curves; precision-recall curves
Abstract :
[en] Networks provide a natural representation of molecular biology knowledge, in particular to model relationships between biological entities such as genes, proteins, drugs, or diseases. Because of the effort, the cost, or the lack of the experiments necessary for the elucidation of these networks, computational approaches for network inference have been frequently investigated in the literature. In this paper, we examine the assessment of supervised network inference. Supervised inference is based on machine learning techniques that infer the network from a training sample of known interacting and possibly non-interacting entities and additional measurement data. While these methods are very effective, their reliable validation in silico poses a challenge, since both prediction and validation need to be performed on the basis of the same partially known network. Cross-validation techniques need to be specifically adapted to classification problems on pairs of objects. We perform a critical review and assessment of protocols and measures proposed in the literature and derive specific guidelines how to best exploit and evaluate machine learning techniques for network inference. Through theoretical considerations and in silico experiments, we analyze in depth how important factors influence the outcome of performance estimation. These factors include the amount of information available for the interacting entities, the sparsity and topology of biological networks, and the lack of experimentally verified non-interacting pairs.
Disciplines :
Computer science
Author, co-author :
Schrynemackers, Marie ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Kuffner, Robert;  Ludwig-Maximilians-University, Munich, Germany > Institute for Practical Informatics and Bioinformatics
Geurts, Pierre  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
On protocols and measures for the validation of supervised methods for the inference of biological networks
Publication date :
03 December 2013
Journal title :
Frontiers in Genetics
eISSN :
1664-8021
Publisher :
Frontiers Media S.A., Switzerland
Volume :
4
Issue :
262
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi :
since 10 December 2013

Statistics


Number of views
115 (25 by ULiège)
Number of downloads
243 (13 by ULiège)

Scopus citations®
 
55
Scopus citations®
without self-citations
53
OpenCitations
 
44
OpenAlex citations
 
72

Bibliography


Similar publications



Contact ORBi