1. Chromatyping: Reconstructing Nucleosome Profiles from NOMe Sequencing Data
- Author
-
Stefan Canzar, Marcel H. Schulz, Tobias Marschall, and Shounak Chakraborty
- Subjects
Computer science ,Sequencing data ,Enumeration algorithm ,Computational biology ,Clique (graph theory) ,Nucleosome occupancy ,Epigenesis, Genetic ,Genetics ,Humans ,Nucleosome ,Promoter Regions, Genetic ,Hidden Markov model ,Molecular Biology ,Computational Biology ,Sequence Analysis, DNA ,DNA Methylation ,Markov Chains ,Nucleosomes ,Computational Mathematics ,Gene Expression Regulation ,Computational Theory and Mathematics ,Modeling and Simulation ,Graph (abstract data type) ,CpG Islands ,Deconvolution ,Algorithms - Abstract
Measuring nucleosome positioning in cells is crucial for the analysis of epigenetic gene regulation. Reconstruction of nucleosome profiles of individual cells or subpopulations of cells remains challenging because most genome-wide assays measure nucleosome positioning and DNA accessibility for thousands of cells using bulk sequencing. In this study we use characteristics of the NOMe (nucleosome occupancy and methylation)-sequencing assay to derive a new approach, called ChromaClique, for deconvolution of different nucleosome profiles (chromatypes) from cell subpopulations of one NOMe-seq measurement. ChromaClique uses a maximal clique enumeration algorithm on a newly defined NOMe read graph that is able to group reads according to their nucleosome profiles. We show that the edge probabilities of that graph can be efficiently computed using hidden Markov models. We demonstrate using simulated data that ChromaClique is more accurate than a related method and scales favorably, allowing genome-wide analyses of chromatypes in cell subpopulations.
- Published
- 2020