Abegaz F, Van Lishout F, Mahachie John JM, Chaichoompu K, Bhardwaj A, Gusareva E, Wei Z, Hakonarsson H, Van Steen K (2018) Epistasis detection using model-based multifactor dimensionality reduction in structured populations. bioRxiv 541946
Alanis-Lobato G, Cannistraci CV, Eriksson A, Manica A, Ravasi T (2015) Highlighting nonlinear patterns in population genetics datasets. Sci Rep 5:8140
Bateson W (1907) Facts limiting the theory of heredity. Science 26:649–660
Belsley DA, Kuh E, Welsch RE (1980) Regression diagnostics: identifying influential data and sources of collinearity. Wiley, New York
Benjamini Y, Yekutieli D (2001) The control of the false discovery rate in multiple testing under dependency. Ann Stat 29:1165–1188
Bessonov K, Gusareva ES, Van Steen K (2015) A cautionary note on the impact of protocol changes for genome-wide association SNP x SNP interaction studies: an example on ankylosing spondylitis. Hum Genet 134:761–773
Box GEP, Cox DR (1964) An analysis of transformations. J R Stat Soc Ser B 26:211–252
Bradley JV (1978) Robustness? Br J Math Stat Psychol 31:144–152
Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K (2011) A detailed view on model-based multifactor dimensionality reduction for detecting gene-gene interactions in case-control data in the presence of noise. Ann Hum Genet 75:78–89
Collins RL, Hu T, Wejse C, Sirugo G, Williams SM, Moore JH (2013) Multifactor dimensionality reduction reveals a three-locus epistatic interaction associated with susceptibility to pulmonary tuberculosis. BioData Min 6:4
Cordell HJ (2002) Epistasis: what it means, what it does not mean, and statistical methods to detect it in humans. Hum Mol Genet 11:2463–2468
Cordell HJ (2009) Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet 10:392–404
Culverhouse R, Suarez BK, Lin J, Reich T (2002) A perspective on epistasis: limits of models displaying no main effect. Am J Hum Genet 70:461–471
de Leeuw CA, Neale BM, Heskes T, Posthuma D (2016) The statistical properties of gene-set analysis. Nat Rev Genet 17:353–364
Dering C, Konig IR, Ramsey LB, Relling MV, Yang W, Ziegler A (2014) A comprehensive evaluation of collapsing methods using simulated and real data: excellent annotation of functionality and large sample sizes required. Front Genet 5:323
Dohoo IR, Ducrot C, Fourichon C, Donald A, Hurnik D (1996) An overview of techniques for dealing with large numbers of independent variables in epidemiologic studies. Prev Vet Med 29:221–239
Duraisingh MT, Refour P (2005) Multiple drug resistance genes in malaria— from epistasis to epidemiology. Mol Microbiol 57:874–877
Ebbert MTW, Ridge PG, Kauwe JSK (2015) Bridging the gap between statistical and biological epistasis in Alzheimer’s disease. BioMed Res Int 2015:870123
Evans DM, Spencer CC, Pointon JJ, Su Z, Harvey D, Kochan G, Oppermann U, Dilthey A, Pirinen M, Stone MA et al (2011) Interaction between ERAP1 and HLA-B27 in ankylosing spondylitis implicates peptide handling in the mechanism for HLA-B27 in disease susceptibility. Nat Genet 43:761–767
Fisher RA (1918) The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edin 52:399–433
Fouladi R, Bessonov K, Van Lishout F, Van Steen K (2015) Model-based multifactor dimensionality reduction for rare variant association analysis. Hum Hered 79:157–167
Frånberg M, Gertow K, Hamsten A, Consortium PROCARDIS, Lagergren J, Sennblad B (2015) Discovering genetic interactions in large-scale association studies by stage-wise likelihood ratio tests. PLoS Genet 11:e1005502
Friedman J (1991) Multivariate adaptive regression splines. Ann Stat 19:1–67
Glantz SA, Slinker BK (1990) Primer of applied regression and analysis of variance. McGraw-Hill, New York
Goeman JJ, Solari A (2014) Multiple hypothesis testing in genomics. Stat Med 33:1946–1978
Gola D, König IR (2016) Identification of interactions using model-based multifactor dimensionality reduction. BMC Proc 10:135–139
Gonzalez-Dominguez J, Wienbrandt L, Kassens JC, Ellinghaus D, Schimmler M, Schmidt B (2015) Parallelizing epistasis detection in GWAS on FPGA and GPU-accelerated computing systems. IEEE/ACM Trans Comput Biol Bioinform 12:982–994
Greene CS, Penrod NM, Kiralis J, Moore JH (2009a) Spatially uniform relieff (SURF) for computationally-efficient filtering of gene-gene interactions. BioData Min 2:5
Greene CS, Penrod NM, Williams SM, Moore JH (2009b) Failure to replicate a genetic association may provide important clues about genetic architecture. PLoS One 4:e5639
Greene CS, Sinnott-Armstrong NA, Himmelstein DS, Park PJ, Moore JH, Harris BT (2010) Multifactor dimensionality reduction for graphics processing units enables genome-wide testing of epistasis in sporadic ALS. Bioinformatics 26:694–695
Greene CS, Himmelstein DS, Nelson HH, Kelsey KT, Williams SM, Andrew AS, Karagas MR, Moore JH: Enabling personal genomics with an explicit test of epistasis. Pac Symp Biocomput 2010:327–336
Greenland S, Rothman KJ (1998) Concepts of interaction., 2nd edn. Lippincott-Raven, Philadelphia
Guerrero VM, Johnson RA (1982) Use of the Box-Cox transformation with binary response models. Biometrika 69:309–314
Gundlach S, Kässens JC, Wienbrandt L (2016) Genome-wide association interaction studies with MB-MDR and maxT multiple testing correction on FPGAs. Procedia Comput Sci 80:639–649
Gusareva ES, Van Steen K (2014) Practical aspects of genome-wide association interaction analysis. Hum Genet 133:1343–1358
Gusareva ES, Carrasquillo MM, Bellenguez C, Cuyvers E, Colon S, Graff-Radford NR, Petersen RC, Dickson DW, Mahachie John JM, Bessonov K et al (2014) Genome-wide association interaction analysis for Alzheimer’s disease. Neurobiol Aging 35:2436–2443
Gusareva E, Twizere JC, Sleegers K, Dourlen P, Abisambra JF, Meier S, Cloyd R, Weiss B, Dermaut B, Bessonov K et al (2018) Male-specific epistasis between WWC1 and TLN2 genes is associated with Alzheimer’s disease. Neurobiol Aging 72:188.e3–188.e12. 10.1016/j.neurobiolaging.2018.08.001
Guyon I, Elisseeff A (2003) An introduction to variable selection and feature selection. J Mach Learn Res 3:1157–1182
Hu X, Liu Q, Zhang Z, Li Z, Wang S, He L, Shi Y (2010) SHEsisEpi, a GPU-enhanced genome-wide SNP-SNP interaction scanning algorithm, efficiently reveals the risk genetic epistasis in bipolar disorder. Cell Res 20:854–857
Hu T, Chen Y, Kiralis JW, Moore JH (2013) ViSEN: methodology and software for visualization of statistical epistasis networks. Genet Epidemiol 37:283–285
Hu T, Andrew AS, Karagas MR, Moore JH: Statistical epistasis networks reduce the computational complexity of searching three-locus genetic models. Pac Symp Biocomput 2013:397–408
Jorgenson E, Witte JS (2006) A gene-centric approach to genome-wide association studies. Nat Rev Genet 7:885–891
Kam-Thong T, Czamara D, Tsuda K, Borgwardt K, Lewis CM, Erhardt-Lehmann A, Hemmer B, Rieckmann P, Daake M, Weber F et al (2011) EPIBLASTER-fast exhaustive two-locus epistasis detection strategy using graphical processing units. Eur J Hum Genet 19:465–471
Kam-Thong T, Azencott CA, Cayton L, Putz B, Altmann A, Karbalai N, Samann PG, Scholkopf B, Muller-Myhsok B, Borgwardt KM (2012) GLIDE: GPU-based linear regression for detection of epistasis. Hum Hered 73:220–236
Kim MK, Moore JH, Kim JK, Cho KH, Cho YW, Kim YS, Lee MC, Kim YO, Shin MH (2011) Evidence for epistatic interactions in antiepileptic drug resistance. J Hum Genet 56:71–76
Knol MJ, VanderWeele TJ (2012) Recommendations for presenting analyses of effect modification and interaction. Int J Epidemiol 41:514–520
Lareau CA, McKinney BA (2015) Network theory for data-driven epistasis networks. Methods Mol Biol 1253:285–300
Lehner B (2011) Molecular mechanisms of epistasis within and between genes. Trends Genet 27:323–331
Li W, Reich J (2000) A complete enumeration and classification of two-locus disease models. Hum Hered 50:334–349
Li J, Malley JD, Andrew AS, Karagas MR, Moore JH (2016) Detecting gene–gene interactions using a permutation-based random forest method. BioData Min 9:14
Liekens AM, De Knijf J, Daelemans W, Goethals B, De Rijk P, Del-Favero J (2011) BioGraph: unsupervised biomedical knowledge discovery via automated hypothesis generation. Genome Biol 12:R57
Liu Y, Koyuturk M, Barnholtz-Sloan JS, Chance MR (2012) Gene interaction enrichment and network analysis to identify dysregulated pathways and their interactions in complex diseases. BMC Syst Biol 6:65
Mahachie John JM, Van Lishout F, Van Steen K (2011a) Model-based multifactor dimensionality reduction to detect epistasis for quantitative traits in the presence of error-free and noisy data. Eur J Hum Genet 19:696–703
Mahachie John JM, Cattaert T, De Lobel L, Van Lishout F, Empain A, Van Steen K (2011b) Comparison of genetic association strategies in the presence of rare alleles. BMC Proc 5(Suppl 9):S32
Martin ER, Tunc I, Liu Z, Slifer SH, Beecham AH, Beecham GW (2018) Properties of global- and local-ancestry adjustments in genetic association tests in admixed populations. Genet Epidemiol 42:214–229
Montojo J, Zuberi K, Rodriguez H, Bader GD, Morris Q (2014) GeneMANIA: fast gene network construction and function prediction for cytoscape. F1000Res 3:153
Moore JH (2015) Epistasis analysis using ReliefF. Methods Mol Biol 1253:315–325
Moore JH, Hu T (2015) Epistasis analysis using information theory. Methods Mol Biol 1253:257–268
Moore JH, Williams SM (2005) Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis. Bioessays 27:637–646
Moore JH, Williams SM (2009) Epistasis and its implications for personal genetics. Am J Hum Genet 85:309–320
Moore JH, Gilbert JC, Tsai CT, Chiang FT, Holden T, Barney N, White BC (2006) A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. J Theor Biol 241:252–261
Moore JH, Amos R, Kiralis J, Andrews PC (2015) Heuristic identification of biological architectures for simulating complex hierarchical genetic interactions. Genet Epidemiol 39:25–34
Moskvina V, Schmidt KM (2008) On multiple-testing correction in genome-wide association studies. Genet Epidemiol 32:567–573
Nicodemus KK, Liu W, Chase GA, Tsai YY, Fallin MD (2005) Comparison of type I error for multiple test corrections in large single-nucleotide polymorphism studies using principal components versus haplotype blocking algorithms. BMC Genet 6(Suppl 1):S78
North BV, Curtis D, Sham PC (2005) Application of logistic regression to case-control association studies involving two causative loci. Hum Hered 59:79–87
Norton B, Pearson ES (1976) A note on the background to and refereeing of R.A. Fisher’s 1918 paper ‘The correlation between relatives on the supposition of Mendelian inheritance’. Notes Rec R Soc Lond 31:151–162
Nyholt DR (2004) A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. Am J Hum Genet 74:765–769
Park MY, Hastie T (2008) Penalized logistic regression for detecting gene interactions. Biostatistics 9:30–50
Phillips PC (1998) The language of gene interaction. Genetics 149:1167–1171
Phillips PC (2008) Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet 9:855–867
Piette ER, Moore JH (2017) Improving the reproducibility of genetic assoication results using genotype resampling methods. Lect Notes Comput Sci 10199:69–108
Piriyapongsa J, Ngamphiw C, Intarapanich A, Kulawonganunchai S, Assawamakin A, Bootchai C, Shaw PJ, Tongsima S (2012) iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies. BMC Genom 13(Suppl 7):S2
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575
Putz B, Kam-Thong T, Karbalai N, Altmann A, Muller-Myhsok B (2013) Cost-effective GPU-grid for genome-wide epistasis calculations. Methods Inf Med 52:91–95
Ritchie MD (2011) Using biological knowledge to uncover the mystery in the search for epistasis in genome-wide association studies. Ann Hum Genet 75:172–182
Ritchie MD, Van Steen K (2018) The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation. Ann Transl Med 6:157
Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF, Moore JH (2001) Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet 69:138–147
Sackton TB, Hartl DL (2016) Genotypic context and epistasis in individuals and populations. Cell 166:279–287
Satagopan JM, Elston RC (2013) Evaluation of removable statistical interaction for binary traits. Stat Med 32:1164–1190
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13:2498–2504
Slim L, Chatelain C, Azencott CA, Vert J-P (2018) Novel methods for epistasis detection in genome-wide association studies. bioRxiv 442749
Sun W, Cai TT (2009) Large-scale multiple testing under dependence. J R Statist Soc B (2009) 71:393–424
Sun X, Lu Q, Mukherjee S, Crane PK, Elston R, Ritchie MD (2014) Analysis pipeline for the epistasis search-statistical versus biological filtering. Front Genet 5:106
Ueki M, Cordell HJ (2012) Improved statistics for genome-wide interaction analysis. PLoS Genet 8:e1002625
Urbanowicz RJ, Kiralis J, Fisher JM, Moore JH (2012a) Predicting the difficulty of pure, strict, epistatic models: metrics for simulated model selection. BioData Min 5:15
Urbanowicz RJ, Kiralis J, Sinnott-Armstrong NA, Heberling T, Fisher JM, Moore JH (2012b) GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures. BioData Min 5:16
Van Steen K (2012) Travelling the world of gene-gene interactions. Brief Bioinform 13:1–19
Van Steen K, Malats N (2015) Perspectives on data integration in human complex disease analysis. In: Big data analytics in bioinformatics and healthcare. IGI GLobal, pp 284–322. https://www.igi-global.com/chapter/perspectives-on-data-integration-in-human-complex-disease-analysis/121463
Van Steen K, Molenberghs G (2012) Multicollinearity. In: Chow S-C (ed) Encyclopedia of biopharmaceutical statistics, 3rd edn
Van Steen K, McQueen MB, Herbert A, Raby B, Lyon H, Demeo DL, Murphy A, Su J, Datta S, Rosenow C et al (2005) Genomic screening and replication using the same data set in family-based association testing. Nat Genet 37:683–691
Van Lishout F, Mahachie John JM, Gusareva ES, Urrea V, Cleynen I, Theatre E, Charloteaux B, Calle ML, Wehenkel L, Van Steen K (2013) An efficient algorithm to perform multiple testing in epistasis screening. BMC Bioinformatics 14:138
Van Lishout F, Gadaleta F, Moore JH, Wehenkel L, Van Steen K (2015) gammaMAXT: a fast multiple-testing correction algorithm. BioData Min 8:36
VanderWeele TJ (2009) On the distinction between interaction and effect modification. Epidemiology 20:863–871
VanderWeele TJ, Laird NM (2011) Tests for compositional epistasis under single interaction-parameter models. Ann Hum Genet 75:146–156
Vansteelandt S, Vanderweele TJ, Robins JM (2008) Multiply robust inference for statistical interactions. J Am Stat Assoc 103:1693–1704
Vansteelandt S, Bekaert M, Claeskens G (2012) On model selection and model misspecification in causal inference. Stat Methods Med Res 21:7–30
Vermeulen SH, Den Heijer M, Sham P, Knight J (2007) Application of multi-locus analytical methods to identify interacting loci in case-control studies. Ann Hum Genet 71:689–700
Wan X, Yang C, Yang Q, Xue H, Fan X, Tang NL, Yu W (2010) BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies. Am J Hum Genet 87:325–340
Wang X, Elston RC, Zhu X (2010) The meaning of interaction. Hum Hered 70:269–277
Wang Z, Wang Y, Tan KL, Wong L, Agrawal D (2011) eCEO: an efficient cloud epistasis computing model in genome-wide association study. Bioinformatics 27:1045–1051
Wang MH, Sun R, Guo J, Weng H, Lee J, Hu I, Sham PC, Zee BC (2016) A fast and powerful W-test for pairwise epistasis testing. Nucleic Acids Res 44:e115
Wei WH, Hemani G, Haley CS (2014) Detecting epistasis in human complex traits. Nat Rev Genet 15:722–733
Westfall PH, Young SS (1993) Resampling-base multiple testing. Wiley, New York
Wilson BA, Garud NR, Feder AF, Assaf ZJ, Pennings PS (2016) The population genetics of drug resistance evolution in natural populations of viral, bacterial and eukaryotic pathogens. Mol Ecol 25:42–66
Winham SJ, Colby CL, Freimuth RR, Wang X, de Andrade M, Huebner M, Biernacka JM (2012) SNP interaction detection with random forests in high-dimensional genetic data. BMC Bioinform 13:164
Wong AK, Krishnan A, Yao V, Tadych A, Troyanskaya OG (2015) IMP 2.0: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks. Nucleic Acids Res 43:W128–W133
Zhang Q (2015) Associating rare genetic variants with human diseases. Front Genet 6:133
Zhang F, Boerwinkle E, Xiong M (2014) Epistasis analysis for quantitative traits by functional regression model. Genome Res 24:989–998