[en] Trees are classical data structures allowing effectively classifying and predicting responses. Due to versatility and high performance in classification and prediction, there exist plenty of tree-based methods including popular Conditional Inference Tree (CIT) and Forests (CIF), Random Forests (RF), Randomized Trees (RT), randomized C4.5, etc. In this work we assessed the performance of CIT and CIF methods in correct gene regulatory network (GRN) prediction from expression data by using reference golden standard built from real transcriptional regulatory network of E. coli. The synthetic microarray expression data was obtained from DREAM4 challenge. The performance of each network inference method was assessed via Area Under Receiver Operating Characteristic (AUROC) and Area Under Precision Recall (AUPR) metrics.
Our preliminary results show that CIT and CIF successfully predict directed GRNs at acceptable performance rates although not optimal (the best AUROC at 0.68 and AUPR at 0.13 for CIF and the best AUROC at 0.58 and AUPR at 0.18 for CIT). Surprisingly by using the current aggregation scheme of feature importance that prefers features with the highest number of observations, a single CIT was a better performer compared to CIFs in all 5 networks. Nevertheless, the CIFs showed an overall 10% improvement in AUROC. A single CIT has 24% and CIFs have 27% lower overall performance compared to the best performer of DREAM4 Challenge based on cumulative areas of PR and ROC curves.
We plan to test other feature importance aggregation techniques in a single tree and in tree ensembles in order to outperform the top DREAM4 algorithms. In addition the effects of expression data standardization to unit variance will be presented. In future, the developed CIF framework will be used to perform data integration analysis of multi-omics datasets.
Disciplines :
Biochemistry, biophysics & molecular biology
Author, co-author :
Bessonov, Kyrylo ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Bioinformatique
Language :
English
Title :
Gene Regulatory Network Inference via Conditional Inference Trees and Forests
Publication date :
28 August 2014
Event name :
23rd Annual Conference of International Genetic Epidemiology society (IGES2014)
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.