학술논문

Variability in the analysis of a single neuroimaging dataset by many teams
Document Type
article
Author
Botvinik-Nezer, RotemHolzmeister, FelixCamerer, Colin FDreber, AnnaHuber, JuergenJohannesson, MagnusKirchler, MichaelIwanir, RoniMumford, Jeanette AAdcock, R AlisonAvesani, PaoloBaczkowski, Blazej MBajracharya, AahanaBakst, LeahBall, SherylBarilari, MarcoBault, NadègeBeaton, DerekBeitner, JuliaBenoit, Roland GBerkers, Ruud MWJBhanji, Jamil PBiswal, Bharat BBobadilla-Suarez, SebastianBortolini, TiagoBottenhorn, Katherine LBowring, AlexanderBraem, SenneBrooks, Hayley RBrudner, Emily GCalderon, Cristian BCamilleri, Julia ACastrellon, Jaime JCecchetti, LucaCieslik, Edna CCole, Zachary JCollignon, OlivierCox, Robert WCunningham, William ACzoschke, StefanDadi, KamalakerDavis, Charles PLuca, Alberto DeDelgado, Mauricio RDemetriou, LysiaDennison, Jeffrey BDi, XinDickie, Erin WDobryakova, EkaterinaDonnat, Claire LDukart, JuergenDuncan, Niall WDurnez, JokeEed, AmrEickhoff, Simon BErhart, AndrewFontanesi, LauraFricke, G MatthewFu, ShiguangGalván, AdrianaGau, RemiGenon, SarahGlatard, TristanGlerean, EnricoGoeman, Jelle JGolowin, Sergej AEGonzález-García, CarlosGorgolewski, Krzysztof JGrady, Cheryl LGreen, Mikella AGuassi Moreira, João FGuest, OliviaHakimi, ShabnamHamilton, J PaulHancock, RoelandHandjaras, GiacomoHarry, Bronson BHawco, ColinHerholz, PeerHerman, GabrielleHeunis, StephanHoffstaedter, FelixHogeveen, JeremyHolmes, SusanHu, Chuan-PengHuettel, Scott AHughes, Matthew EIacovella, VittorioIordan, Alexandru DIsager, Peder MIsik, Ayse IJahn, AndrewJohnson, Matthew RJohnstone, TomJoseph, Michael JEJuliano, Anthony CKable, Joseph WKassinopoulos, MichalisKoba, CemalKong, Xiang-Zhen
Source
Nature. 582(7810)
Subject
Generic health relevance
Brain
Data Analysis
Data Science
Datasets as Topic
Female
Functional Neuroimaging
Humans
Logistic Models
Magnetic Resonance Imaging
Male
Meta-Analysis as Topic
Models
Neurological
Reproducibility of Results
Research Personnel
Software
General Science & Technology
Language
Abstract
Data analysis workflows in many scientific domains have become increasingly complex and flexible. Here we assess the effect of this flexibility on the results of functional magnetic resonance imaging by asking 70 independent teams to analyse the same dataset, testing the same 9 ex-ante hypotheses1. The flexibility of analytical approaches is exemplified by the fact that no two teams chose identical workflows to analyse the data. This flexibility resulted in sizeable variation in the results of hypothesis tests, even for teams whose statistical maps were highly correlated at intermediate stages of the analysis pipeline. Variation in reported results was related to several aspects of analysis methodology. Notably, a meta-analytical approach that aggregated information across teams yielded a significant consensus in activated regions. Furthermore, prediction markets of researchers in the field revealed an overestimation of the likelihood of significant findings, even by researchers with direct knowledge of the dataset2-5. Our findings show that analytical flexibility can have substantial effects on scientific conclusions, and identify factors that may be related to variability in the analysis of functional magnetic resonance imaging. The results emphasize the importance of validating and sharing complex analysis workflows, and demonstrate the need for performing and reporting multiple analyses of the same data. Potential approaches that could be used to mitigate issues related to analytical variability are discussed.