Article (Scientific journals)
A screening methodology based on Random Forests to improve the detection of gene-gene interactions
De Lobel, L.; Geurts, Pierre; Baele, G. et al.
2010In European Journal of Human Genetics, 18 (1127), p. 1132
Peer Reviewed verified by ORBi
 

Files


Full Text
De Lobel_2010_Random forest.pdf
Publisher postprint (385.5 kB)
Request a copy

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] The search for susceptibility loci in gene-gene interactions imposes a methodological and computational challenge for statisticians because of the large dimensionality inherent to the modelling of gene-gene interactions or epistasis. In an era in which genome-wide scans have become relatively common, new powerful methods are required to handle the huge amount of feasible gene-gene interactions and to weed out false positives and negatives from these results. One solution to the dimensionality problem is to reduce data by preliminary screening of markers to select the best candidates for further analysis. Ideally, this screening step is statistically independent of the testing phase. Initially developed for small numbers of markers, the Multifactor Dimensionality Reduction (MDR) method is a nonparametric, model-free data reduction technique to associate sets of markers with optimal predictive properties to disease. In this study, we examine the power of MDR in larger data sets and compare it with other approaches that are able to identify gene-gene interactions. Under various interaction models (purely and not purely epistatic), we use a Random Forest (RF)-based prescreening method, before executing MDR, to improve its performance. We find that the power of MDR increases when noisy SNPs are first removed, by creating a collection of candidate markers with RFs. We validate our technique by extensive simulation studies and by application to asthma data from the European Committee of Respiratory Health Study II.European Journal of Human Genetics advance online publication, 12 May 2010; doi:10.1038/ejhg.2010.48.
Disciplines :
Genetics & genetic processes
Author, co-author :
De Lobel, L.
Geurts, Pierre ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Baele, G.
Castro-Giner, F.
Kogevinas, M.
Van Steen, Kristel  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Bioinformatique
Language :
English
Title :
A screening methodology based on Random Forests to improve the detection of gene-gene interactions
Publication date :
2010
Journal title :
European Journal of Human Genetics
ISSN :
1018-4813
eISSN :
1476-5438
Publisher :
Natue Publishing Group, United Kingdom
Volume :
18
Issue :
1127
Pages :
1132
Peer reviewed :
Peer Reviewed verified by ORBi
Commentary :
2010/05/13
Available on ORBi :
since 22 May 2010

Statistics


Number of views
95 (12 by ULiège)
Number of downloads
7 (6 by ULiège)

Scopus citations®
 
42
Scopus citations®
without self-citations
39
OpenCitations
 
39

Bibliography


Similar publications



Contact ORBi