1. MOCHA's advanced statistical modeling of scATAC-seq data enables functional genomic inference in large human cohorts.
- Author
-
Rachid Zaim S, Pebworth MP, McGrath I, Okada L, Weiss M, Reading J, Czartoski JL, Torgerson TR, McElrath MJ, Bumol TF, Skene PJ, and Li XJ
- Subjects
- Humans, SARS-CoV-2 genetics, Transposases metabolism, Transposases genetics, Chromatin Immunoprecipitation Sequencing methods, Cohort Studies, Gene Expression Regulation, COVID-19 genetics, COVID-19 virology, Models, Statistical, Single-Cell Analysis methods, Gene Regulatory Networks, Genomics methods, Chromatin genetics, Chromatin metabolism
- Abstract
Single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) is being increasingly used to study gene regulation. However, major analytical gaps limit its utility in studying gene regulatory programs in complex diseases. In response, MOCHA (Model-based single cell Open CHromatin Analysis) presents major advances over existing analysis tools, including: 1) improving identification of sample-specific open chromatin, 2) statistical modeling of technical drop-out with zero-inflated methods, 3) mitigation of false positives in single cell analysis, 4) identification of alternative transcription-starting-site regulation, and 5) modules for inferring temporal gene regulatory networks from longitudinal data. These advances, in addition to open chromatin analyses, provide a robust framework after quality control and cell labeling to study gene regulatory programs in human disease. We benchmark MOCHA with four state-of-the-art tools to demonstrate its advances. We also construct cross-sectional and longitudinal gene regulatory networks, identifying potential mechanisms of COVID-19 response. MOCHA provides researchers with a robust analytical tool for functional genomic inference from scATAC-seq data., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF