Back to Search
Start Over
Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data
- Source :
- Nature Communications, Vol 12, Iss 1, Pp 1-13 (2021)
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- Transcriptomic atlases have improved our understanding of the correlations between gene-expression patterns and spatially varying properties of brain structure and function. Gene-category enrichment analysis (GCEA) is a common method to identify functional gene categories that drive these associations, using gene-to-category annotation systems like the Gene Ontology (GO). Here, we show that applying standard GCEA methodology to spatial transcriptomic data is affected by substantial false-positive bias, with GO categories displaying an over 500-fold average inflation of false-positive associations with random neural phenotypes in mouse and human. The estimated false-positive rate of a GO category is associated with its rate of being reported as significantly enriched in the literature, suggesting that published reports are affected by this false-positive bias. We show that within-category gene–gene coexpression and spatial autocorrelation are key drivers of the false-positive bias and introduce flexible ensemble-based null models that can account for these effects, made available as a software toolbox. Identifying enriched gene sets in transcriptomic data is routine analysis. Here, the authors show that conventional gene category enrichment analysis (GCEA) applied to brain-wide atlas data yields biased results and develop a flexible ensemble-based null model framework to enable appropriate inference in GCEA.
- Subjects :
- Male
0301 basic medicine
Computer science
Science
General Physics and Astronomy
Inference
Computational biology
General Biochemistry, Genetics and Molecular Biology
03 medical and health sciences
Annotation
0302 clinical medicine
Animals
Humans
Gene
Spatial analysis
Multidisciplinary
Computational neuroscience
Null model
Gene Expression Profiling
Brain atlas
Brain
Reproducibility of Results
Molecular Sequence Annotation
General Chemistry
Mice, Inbred C57BL
Gene Ontology
030104 developmental biology
Null (SQL)
Software
030217 neurology & neurosurgery
Subjects
Details
- ISSN :
- 20411723
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- Nature Communications
- Accession number :
- edsair.doi.dedup.....f7654e4f829914aad4243ed04abc0633