Algorithms; Cerebral Cortex/physiology/physiopathology; Humans; Image Processing, Computer-Assisted/methods; Magnetic Resonance Imaging/methods; Neuroimaging/methods; Pain Measurement/methods; Statistics, Nonparametric; conjunctions; general linear model; multiple testing; non-parametric combination; permutation tests
Abstract :
[en] In this work, we show how permutation methods can be applied to combination analyses such as those that include multiple imaging modalities, multiple data acquisitions of the same modality, or simply multiple hypotheses on the same data. Using the well-known definition of union-intersection tests and closed testing procedures, we use synchronized permutations to correct for such multiplicity of tests, allowing flexibility to integrate imaging data with different spatial resolutions, surface and/or volume-based representations of the brain, including non-imaging data. For the problem of joint inference, we propose and evaluate a modification of the recently introduced non-parametric combination (NPC) methodology, such that instead of a two-phase algorithm and large data storage requirements, the inference can be performed in a single phase, with reasonable computational demands. The method compares favorably to classical multivariate tests (such as MANCOVA), even when the latter is assessed using permutations. We also evaluate, in the context of permutation tests, various combining methods that have been proposed in the past decades, and identify those that provide the best control over error rate and power across a range of situations. We show that one of these, the method of Tippett, provides a link between correction for the multiplicity of tests and their combination. Finally, we discuss how the correction can solve certain problems of multiple comparisons in one-way ANOVA designs, and how the combination is distinguished from conjunctions, even though both can be assessed using permutation tests. We also provide a common algorithm that accommodates combination and correction.
Disciplines :
Physical, chemical, mathematical & earth Sciences: Multidisciplinary, general & others
Author, co-author :
Winkler, Anderson ; Université de Liège - ULiège > Form. doc. sc. bioméd. & pharma.
Webster, Matthew A.
Brooks, Jonathan C.
Tracey, Irene
Smith, Stephen M.
Nichols, Thomas E.
Language :
English
Title :
Non-parametric combination and related permutation tests for neuroimaging.
Publication date :
2016
Journal title :
Human Brain Mapping
ISSN :
1065-9471
eISSN :
1097-0193
Publisher :
John Wiley & Sons, Hoboken, United States - New York
Volume :
37
Issue :
4
Pages :
1486-511
Peer reviewed :
Peer Reviewed verified by ORBi
Commentary :
(c) 2016 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc.
Abou Elseoud A, Nissilä J, Liettu A, Remes J, Jokelainen J, Takala T, Aunio A, Starck T, Nikkinen J, Koponen H, Zang YF, Tervonen O, Timonen M, Kiviniemi V (2014): Altered resting-state activity in seasonal affective disorder. Hum Brain Mapp 35:161-172.
Anderson TW (2003): An Introduction to Multivariate Statistical Analysis. Hoboken, NJ: Wiley.
Benjamini Y, Hochberg Y (1995): Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B 57:289-300.
Benjamini Y, Heller R (2008): Screening for partial conjunction hypotheses. Biometrics 64:1215-1222.
Berk RH, Cohen A (1979): Asymptotically optimal methods of combining tests. J Am Stat Assoc 74:812-814.
Bhandary M, Zhang X (2011): Comparison of several tests for combining several independent tests. J Modern Appl Stat Meth 10:436-446.
Birnbaum A (1954): Combining independent tests of significance. J Am Stat Assoc 49:559-574.
Blair RC, Higgins JJ, Karniski W, Kromrey JD (1994): A study of multivariate permutation tests which may replace Hotelling's T2 test in prescribed circumstances. Multivariate Behav Res 29:141-163.
Bland JM, Altman DG (1986): Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 327:307-310.
Borenstein M, Hedges LV, Higgins JPT, Rothstein HR (2009): Introduction to Meta-Analysis. West Sussex, UK: Wiley.
Brombin C, Midena E, Salmaso L (2013): Robust non-parametric tests for complex-repeated measures problems in ophthalmology. Stat Meth Med Res 22:643-660.
Brooks JCW, Zambreanu L, Godinez A, Craig ADB, Tracey I (2005): Somatotopic organisation of the human insula to painful heat studied with high resolution functional imaging. NeuroImage 27:201-209.
Brown MB (1975): A method for combining non-independent, one-sided tests of significance. Biometrics 31:987-992.
Calhoun VD, Sui J (2016): Multimodal fusion of brain imaging data: A key to finding the missing link(s) in complex mental illness. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging (in press). doi:10.1016/j.bpsc.2015.12.005.
Chang L-C, Lin H-M, Sibille E, Tseng GC (2013): Meta-analysis methods for combining multiple expression profiles: Comparisons, statistical characterization and an application guideline. BMC Bioinformatics 14:368.
Chen G, Adleman NE, Saad ZS, Leibenluft E, Cox RW (2014): Applications of multivariate modeling to neuroimaging group analysis: A comprehensive alternative to univariate general linear model. NeuroImage 99:571-588.
Chen Z (2011): Is the weighted z-test the best method for combining probabilities from independent tests? J Evol Biol 24:926-930.
Christensen R (2001): Advanced Linear Modelling, 2nd ed. New York, USA: Springer.
Darlington RB, Hayes AF (2000): Combining independent p values: Extensions of the stouffer and binomial methods. Psychol Meth 5:496-515.
David FN (1934): On the Pλn test for randomness: Remarks, further illustration, and table of Pλn for given values of -log10λn. Biometrika 26:1. 1-11.
Draper D, Gaver DP, Goel PK, Greenhouse JB, Hedges LV, Morris CN, Waternaux C (1992): Combining information: Statistical issues and opportunities for research. Washington, DC: National Academy Press.
Dudbridge F, Koeleman BPC (2003): Rank truncated product of P-values, with application to genomewide association scans. Gene Epidemiol 25:360-366.
Eaton JW Bateman D, Hauberg S, Wehbring R (2015): GNU Octave: A highlevel interactive language for numerical computations. Samurai Media Ltd, Hong Kong, PRC. Available at: https://www.gnu.org/software/octave/octave.pdf.
Edgington ES (1972): An additive method for combining probability values from independent experiments. J Psychol 80:351-363.
Efron B (2004): Large-scale simultaneous hypothesis testing. J Am Stat Assoc 99:96-104.
Fisher RA (1932): Statistical Methods for Research Workers, 4th ed. Edinburgh: Oliver; Boyd.
Fox PT, Mintun MA, Reiman EM, Raichle ME (1988): Enhanced detection of focal brain responses using intersubject averaging and change-distribution analysis of subtracted PET images. J Cerebral Blood Flow Metab 8:642-653.
Genovese CR, Lazar NA, Nichols T (2002): Thresholding of statistical maps in functional neuroimaging using the false discovery rate. NeuroImage 15:870-878.
Good IJ (1955): On the weighted combination of significance tests. J R Stat Soc Series B 17:264-265.
Hall P, Wilson SR (1991): Two guidelines for bootstrap hypothesis testing. Biometrics 47:757-762.
Hayasaka S, Du A-T, Duarte A, Kornak J, Jahng G-H, Weiner MW, Schuff N (2006): A non-parametric approach for co-analysis of multi-modal brain imaging data: Application to alzheimer's disease. NeuroImage 30:768-779.
Hayasaka S, Nichols TE (2004): Combining voxel intensity and cluster extent with permutation test framework. NeuroImage 23:54-63.
Hayter AAJ (1986): The maximum familywise error rate of Fisher's least significant difference test. J Am Stat Assoc 81:1000-1004.
Hochberg Y, Tamhane AC (1987): Multiple Comparison Procedures. New York, NY: Wiley.
Holm S (1979): A simple sequentially rejective multiple test procedure. Scand J Stat 6:65-70.
Hotelling H (1951): A generalized T test and measure of multivariate dispersion. In: Neyman, J, editor. Proceedings of the second berkeley symposium on mathematical statistics and probability. Berkeley: University of California Press. 042 pp 23-41.
Hotelling H (1931): The generalization of Student's ratio. Ann Math Stat 2:360-378.
Hsu JC (1996): Multiple Comparison: Theory and Methods. Boca Raton, FL: Chapman & Hall/CRC.
Jiang B, Zhang X, Zuo Y, Kang G (2011): A powerful truncated tail strength method for testing multiple null hypotheses in one dataset. J Theoretical Biol 277:67-73.
Johnson RA, Wichern DW (2007): Applied Multivariate Statistical Analysis, 6th ed. Upper Sadle River, NJ: Pearson Prentice Hall.
Kost JT, McDermott MP (2002): Combining dependent p-values. Stat Probab Lett 60:183-190.
Kuhfeld WF (1986): A note on Roy's largest root. Psychometrika 51:479-481.
Lancaster HO (1961): The combination of probabilities: An application of orthonormal functions. Aus J Stat 3:20-33.
Lawley DN (1938): A generalization of Fisher's z test. Biometrika 30:180-187.
Lazar NA, Luna B, Sweeney JA, Eddy WF (2002): Combining brains: A survey of methods for statistical pooling of information. NeuroImage 16:538-550.
Lehmann EL, Romano JP (2005): Testing Statistical Hypotheses, 3rd ed. New York, NY: Springer.
Li J, Tseng GC (2011): An adaptively weighted statistic for detecting differential gene expression when combining multiple transcriptomic studies. Ann Appl Stat 5:994-1019.
Licata SC, Nickerson LD, Lowen SB, Trksak GH, MacLean RR, Lukas SE (2013): The hypnotic zolpidem increases the synchrony of BOLD signal fluctuations in widespread brain networks during a resting paradigm. NeuroImage 70:211-222.
Lipták T (1958): On the combination of independent tests. A Magyar Tudományos Akadémia Matematikai Kutató Intézetének Közlémenyei 3:171-197.
Loughin T (2004): A systematic comparison of methods for combining p-values from independent tests. Comput Stat Data Anal 47:467-485.
Marcus R, Peritz E, Gabriel KR (1976): On closed testing procedures with special reference to ordered analysis of variance. Biometrika 63:655.
Meier U (2006): A note on the power of Fisher's least significant difference procedure. Pharm Stat 5:253-263.
Mudholkar GS, George EO (1979): The logit statistic for combining probabilities. In: Rustagi, J, editor. Symposium on Optimizing Methods in Statistics. New York: Academic Press. pp 345-366.
Nichols T (2012): Multiple testing corrections, nonparametric methods, and random field theory. NeuroImage 62:811-815.
Nichols T, Brett M, Andersson J, Wager T, Poline J-B (2005): Valid conjunction inference with the minimum statistic. NeuroImage 25:653-660.
Nichols T, Hayasaka S (2003): Controlling the familywise error rate in functional neuroimaging: A comparative review. Stat Meth Med Res 12:419-446.
Oosterhoff J (1969): Combination of One-Sided Statistical Tests. Amsterdam, The Netherlands: Mathematisch Centrum.
Owen AB (2009): Karl Pearson's meta-analysis revisited. Ann Stat 37:3867-3892.
Pantazis D, Nichols TE, Baillet S, Leahy RM (2005): A comparison of random field theory and permutation methods for the statistical analysis of MEG data. Neuroimage 25:383-394.
Pearson K (1933): On a method of determining whether a sample of size n supposed to have been drawn from a parent population having a known probability integral has probably been drawn at random. Biometrika 25:379-410.
Pesarin F (1990): On a nonparametric combination method for dependent permutation tests with applications. Psychother Psychosom 54:172-179.
Pesarin F (1992): A resampling procedure for nonparametric combination of several dependent tests. J Italian Stat Soc 1:87-101.
Pesarin F (2001): Multivariate Permutation Tests, with Applications in Biostatistics. West Sussex, England, UK: Wiley.
Pesarin F, Salmaso L (2010a): Permutation Tests for Complex Data: Theory, Applications and Software. West Sussex, England, UK: Wiley.
Pesarin F, Salmaso L (2010b): Finite-sample consistency of combination-based permutation tests with application to repeated measures designs. J Nonparametr Stat 22:669-684.
Petrovic P, Kalso E, Petersson KM, Ingvar M (2002): Placebo and opioid analgesia-Imaging a shared neuronal network. Science 295:1737-1740.
Pillai KCS (1955): Some new test criteria in multivariate analysis. The Annals of Mathematical Statistics 26:117-121.
Reynolds DV (1969): Surgery in the rat during electrical analgesia induced by focal brain stimulation. Science 164:444-445.
Rosenthal R (1978): Combining results of independent studies. Psychol Bull 85:185-193.
Roy M, Shohamy D, Daw N, Jepma M, Wimmer GE, Wager TD (2014): Representation of aversive prediction errors in the human periaqueductal gray. Nat Neurosci 17:1607-1612.
Roy SN (1953): On a heuristic method of test construction and its use in multivariate analysis. Ann Math Stat 24:220-238.
Scheffé H (1959): The Analysis of Variance. New York: Wiley.
Shaffer JP (1986): Modified sequentially rejective multiple test procedures. J Am Stat Assoc 81:826-831.
Smith SM, Nichols TE (2009): Threshold-free cluster enhancement: Addressing problems of smoothing, threshold dependence and localisation in cluster inference. Neuroimage 44:83-98.
Stouffer SA, Suchman EA, DeVinney LC, Star SA Jr, Robin MW (1949): The American Soldier: Adjustment During Army Life (Vol. 1). Princeton, NJ: Princeton University Press.
Šidák Z (1967): Rectangular confidence regions for the means of multivariate normal distributions. J Am Stat Assoc 62:626-633.
Taylor J, Tibshirani R (2006): A tail strength measure for assessing the overall univariate significance in a dataset. Biostatistics 7:167-181.
The MathWorks Inc. (2013): MATLAB version 8.1 (r2013a). Natick, Massachusetts.
Thomas AG, Dennis A, Rawlings NB, Stagg CJ, Matthews L, Morris M, Kolind SH, Foxley S, Jenkinson M, Nichols TE, Dawes H, Bandettini PA, Johansen-Berg H (2015): Multi-modal characterization of rapid anterior hippocampal volume increase associated with aerobic exercise. NeuroImage (in press).
Timm NH (2002): Applied Multivariate Analysis. New York: Springer.
Tippett LHC (1931): The Methods of Statistics. London: Williams; Northgate.
Tracey I, Ploghaus A, Gati JS, Clare S, Smith S, Menon RS, Matthews PM (2002): Imaging attentional modulation of pain in the periaqueductal gray in humans. J Neurosci 22:2748-2752.
Tukey JW (1949): Comparing individual means in the analysis of variance. Biometrics 5:99-114.
Uludağ K, Roebroeck A (2014): General overview on the merits of multimodal neuroimaging data fusion. NeuroImage 102:3-10.
Westberg M (1985): Combining independent statistical tests. Statistician 34:287-296.
Zaykin DV (2011): Optimally weighted z-test is a powerful method for combining probabilities in meta-analysis. J Evol Biol 24:1836-1841.
Zhu D, Zhang T, Jiang X, Hu X, Chen H, Yang N, Lv J, Han J, Guo L, Liu T (2014): Fusing DTI and fMRI data: A survey of methods and applications. NeuroImage 102:184-191.
Zwet W, van Oosterhoff J (1967): On the combination of independent test statistics. Ann Math Stat 38:659-680.