[en] One of the pressing open problems of computational systems biology is the elucidation of the topology of gene regulatory networks (GRNs). In an attempt to solve this problem, the idea of systems genetics is to exploit the natural variations that exist between the DNA sequences of related individuals and that can represent the randomized and multifactorial perturbations necessary to recover GRNs.
In this chapter, we present new methods, called GENIE3-SG-joint and GENIE3- SG-sep, for the inference of GRNs from systems genetics data. Experiments on the artificial data of the StatSeq benchmark and of the DREAM5 Systems Genetics challenge show that exploiting jointly expression and genetic data is very helpful for recovering GRNs, and one of our methods outperforms by a large extent the official best performing method of the DREAM5 challenge.
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Aten JE, Fuller TF, Lusis AJ, Horvath S (2008) Using genetic markers to orient the edges in quantitative trait networks: the NEO software. BMC Syst Biol 2:34
Bing N, Hoeschele I (2005) Genetical genomics analysis of a yeast segregant population for transcription network inference. Genetics 170:533-542
Breiman L (1996) Bagging predictors. Mach Learn 24:123-124
Breiman L (2001) Random forests. Mach Learn 45:5-32
Breiman L, Friedman JH, OlsenRA, Stone CJ (1984) Classification and regression trees.Wadsworth International, California
Brem RB, Kruglyak L (2005) The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proc Natl Acad Sci USA 102:1572-1577
Candis E, Tao T (2007) The dantzig selector: Statistical estimation when p is much larger than n. Ann Stat 35:2313-2351
Neto Chaibub E, Ferrara CT, Attie AD, Yandeli BS (2008) Inferring causal phenotype networks from segregating populations. Genetics 179:1089-1100
Chen LS, Emmert-Streib F, Storey JD(2007) Harnessing naturally randomized transcription to infer regulatory relationships among genes. Genome Biol 8:R219
De Smet R, Marchal K (2010) Advantages and limitations of current network inference methods. Nat Rev Microbiol 8:717-729
Friedman N, Linial M, Nachman I, Pe'er D (2000) Using Bayesian networks to analyze expression data. J Comp Biol 7:601-620
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. JMLR 3:1157-1182
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: Prediction, inference and data mining. Springer Verlag, Second Edition
Huynh-ThuVA, Irrthum A,Wehenkel L, Geurts P (2010) Inferring regulatory networks from expression data using tree-based methods. PLoS ONE 5:e12776
Jansen RC (2003) Studying complex biological systems using multifactorial perturbation. Nat Rev Genet 4:145-151
Jansen RC, Nap J-P (2001) Genetical genomics: the added value from segregation. Trends Genet 17:388-391
Kulp DC, JagalurM(2006) Causal inference of regulator-target pairs by gene mapping of expression phenotypes. BMC Genomics 7:125
Li. H, Lu L, Manly KF, Chesler EJ, Bao L, Wang J, Zhou M, Williams RW, Cu i Y (2005) Inferring gene transcriptional modulatory relations: a genetical genomics approach. Hum Mol Gen 14:1119-1125
Li R, Tsaih S-W, Shockley K, Stylianou IM,Wergedal J, Paigen B, Churchill GA (2006) Structural model analysis of multiple quantitative traits. PLoS Genet 2:e114
Liu B, de la Fuente A, Hoeschele I (2008) Gene network inference via structural equation modeling in genetical genomics experiments. Genetics 178:1763-1776
Marbach D, Costello J (2012) C., Küffner, R., Vega, N., Prill, R. J., Camacho, D. M., Allison, K. R., the DREAM5 Consortium, Kellis, M., Collins, J. J., Stolovitzky, G.: Wisdom of crowds for robust gene network inference. Nat Methods 9:796-804
MeinshausenN,Bühlmann P (2006) High-dimensional graphs and variable selectionwith the Lasso. Ann Stat 34:1436-1462
Meyer PE, Kontos K, Lafitte F, Bontempi G (2007) Information-theoretic inference of large transcriptional regulatory networks. EURASIP J Bioinform Syst Biol 2007:79879
Michaelson JJ, Alberts R, Schughart K, Beyer A (2010) Data-driven assessment of eQTL mapping methods. BMC Genomics 11:502
Pinna A, Soranzo N, Hoeschele I, de la Fuente A (2011) Simulating systems genetics data with SysGenSIM. Bioinformatics 27:2459-2462
Prill RJ, Marbach D, Saez-Rodriguez J, Sorger PK, Alexopoulos LG, Xue X, Clarke ND, Altan-Bonnet G, Stolovitzky G (2010) Towards a rigorous assessment of systems Biology models: the DREAM3 challenges. PLoS ONE 5:e9202
Saeys Y, Inza I, Larranaga P (2007) A review of feature selection techniques in bioinformatics. Bioinformatics 23:2507-2517
Schadt EE, Lamb J, Yang X, Zhu J, Edwards S, Guhathakurta D, Sieberts SK, Monks S, Reitman M, Zhang C, Lum PY, Leonardson A, Thieringer R, Metzger JM, Yang L, Castle J, Zhu H, Kash SF, Drake TA, Sachs A, Lusis AJ (2005) An integrative genomics approach to infer causal associations between gene expression and disease. Nat Genet 37:710-717
Stolovitzky G, Monroe D, Califano A (2007) Dialogue on Reverse-Engineering assessment and methods: the DREAM of high-throughput pathway inference. Ann NY Acad Sci 1115:11-22
Stolovitzky G, Prill RJ, Califano A (2009) Lessons from the DREAM2 challenges. Ann NY Acad Sci 1158:159-195
Strobl C, Boulesteix A-L, Zeileis A, Horthorn T (2007) Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinform 8:25
TibshiraniR(1996) Regression shrinkage and selection via the Lasso. JRStat Soc SerB58:267-288
Vignes M, Vandel J, Allouche D, Ramadan-Alban N, Cierco-Ayrolles C, Schiex T, Mangin B, de Givry S (2011) Gene regulatory network reconstruction using bayesian networks, the Dantzig selector, the Lasso and their meta-analysis. PLoS ONE 6:e29165
Zhu J,WienerMC, Zhang C, Fridman A, Minch E, Lum PY, Sachs JR, Schadt EE (2007) Increasing the power to detect causal associations by combining genotypic and expression data in segregating populations. PLoS Comput Biol 3:e69
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.