Back to Search
Start Over
Cross-validation failure: small sample sizes lead to large error bars
- Source :
- NeuroImage, NeuroImage, 2017, ⟨10.1016/j.neuroimage.2017.06.061⟩, NeuroImage, Elsevier, 2017, ⟨10.1016/j.neuroimage.2017.06.061⟩
- Publication Year :
- 2017
-
Abstract
- International audience; Predictive models ground many state-of-the-art developments in statistical brain image analysis: decoding, MVPA, searchlight, or extraction of biomarkers. The principled approach to establish their validity and usefulness is cross-validation, testing prediction on unseen data. Here, I would like to raise awareness on error bars of cross-validation, which are often underestimated. Simple experiments show that sample sizes of many neuroimaging studies inherently lead to large error bars, eg ±10% for 100 samples. The standard error across folds strongly underestimates them. These large error bars compromise the reliability of conclusions drawn with predictive models, such as biomarkers or methods developments where, unlike with cognitive neuroimaging MVPA approaches, more samples cannot be acquired by repeating the experiment across many subjects. Solutions to increase sample size must be investigated, tackling possible increases in heterogeneity of the data.
- Subjects :
- 0301 basic medicine
model selection
decoding
Computer science
Cognitive Neuroscience
cross-validation
Quantitative Biology - Quantitative Methods
Cross-validation
03 medical and health sciences
0302 clinical medicine
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
Neuroimaging
Statistics - Machine Learning
MVPA
Error bar
Statistics
Image Processing, Computer-Assisted
Humans
Lead (electronics)
Reliability (statistics)
Statistics - Methodology
ACM: I.: Computing Methodologies/I.5: PATTERN RECOGNITION
Brain Mapping
[SCCO.NEUR]Cognitive science/Neuroscience
Model selection
fMRI
biomarkers
Reproducibility of Results
Magnetic Resonance Imaging
Comments and Controversies
030104 developmental biology
Standard error
Neurology
statistics
Sample size determination
Sample Size
[SCCO.PSYC]Cognitive science/Psychology
[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM]
[STAT.ME]Statistics [stat]/Methodology [stat.ME]
030217 neurology & neurosurgery
Subjects
Details
- Language :
- English
- ISSN :
- 10538119 and 10959572
- Database :
- OpenAIRE
- Journal :
- NeuroImage, NeuroImage, 2017, ⟨10.1016/j.neuroimage.2017.06.061⟩, NeuroImage, Elsevier, 2017, ⟨10.1016/j.neuroimage.2017.06.061⟩
- Accession number :
- edsair.doi.dedup.....f2e19910f53b34a411ef5ae42daa75a3
- Full Text :
- https://doi.org/10.1016/j.neuroimage.2017.06.061⟩