Article (Scientific journals)
On the Encoding of Proteins for Disordered Regions Prediction
Becker, Julien; Maes, Francis; Wehenkel, Louis
2013In PLoS ONE
Peer Reviewed verified by ORBi
 

Files


Full Text
journal.pone.0082252.pdf
Publisher postprint (600.29 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Disordered regions; extremely randomized trees; feature selection
Abstract :
[en] Disordered regions, i.e., regions of proteins that do not adopt a stable three-dimensional structure, have been shown to play various and critical roles in many biological processes. Predicting and understanding their formation is therefore a key sub-problem of protein structure and function inference. A wide range of machine learning approaches have been developed to automatically predict disordered regions of proteins. One key factor of the success of these methods is the way in which protein information is encoded into features. Recently, we have proposed a systematic methodology to study the relevance of various feature encodings in the context of disulfide connectivity pattern prediction. In the present paper, we adapt this methodology to the problem of predicting disordered regions and assess it on proteins from the 10th CASP competition, as well as on a very large subset of proteins extracted from PDB. Our results, obtained with ensembles of extremely randomized trees, highlight a novel feature function encoding the proximity of residues according to their accessibility to the solvent, which is playing the second most important role in the prediction of disordered regions, just after evolutionary information. Furthermore, even though our approach treats each residue independently, our results are very competitive in terms of accuracy with respect to the state-of-the-art. A web-application is available at http://m24.giga.ulg.ac.be:81/x3Disorder.
Research center :
GIGA-Bioinformatics
Disciplines :
Life sciences: Multidisciplinary, general & others
Author, co-author :
Becker, Julien ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Maes, Francis
Wehenkel, Louis  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
On the Encoding of Proteins for Disordered Regions Prediction
Publication date :
16 December 2013
Journal title :
PLoS ONE
eISSN :
1932-6203
Publisher :
Public Library of Science, San Franscisco, United States - California
Peer reviewed :
Peer Reviewed verified by ORBi
Funders :
FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]
Available on ORBi :
since 18 December 2013

Statistics


Number of views
45 (4 by ULiège)
Number of downloads
117 (2 by ULiège)

Scopus citations®
 
7
Scopus citations®
without self-citations
7
OpenCitations
 
9

Bibliography


Similar publications



Contact ORBi