1. Detection of differentially methylated regions in whole genome bisulfite sequencing data using local Getis-Ord statistics
- Author
-
Fushun Chen, Yalu Wen, Zhiguang Li, Qingzheng Zhang, and Yan Zhuang
- Subjects
0301 basic medicine ,Statistics and Probability ,Sequence analysis ,Bisulfite sequencing ,Biology ,Biochemistry ,Genome ,Mice ,03 medical and health sciences ,Statistics ,Animals ,Humans ,Sulfites ,Methylated DNA immunoprecipitation ,Epigenetics ,Molecular Biology ,Sequence Analysis, DNA ,DNA Methylation ,Computer Science Applications ,Computational Mathematics ,030104 developmental biology ,Differentially methylated regions ,Computational Theory and Mathematics ,CpG site ,DNA methylation ,Software - Abstract
Motivation: DNA methylation is an important epigenetic modification that has essential role in gene regulation, cell differentiation and cancer development. Bisulfite sequencing is a widely used technique to obtain genome-wide DNA methylation profiles, and one of the key tasks of analyzing bisulfite sequencing data is to detect differentially methylated regions (DMRs) among samples under different treatment conditions. Although numerous tools have been proposed to detect differentially methylated single CpG site (DMC) between samples, methods for direct DMR detection, especially for complex study designs, are largely limited. Results: We present a new software, GetisDMR, for direct DMR detection. We use beta-binomial regression to model the whole-genome bisulfite sequencing data, where variations in methylation levels and confounding effects have been accounted for. We employ a region-wise test statistic, which is derived from local Getis-Ord statistics and considers the spatial correlation between nearby CpG sites, to detect DMRs. Unlike existing methods, that attempt to infer DMRs from DMCs based on empirical criteria, we provide statistical inference for direct DMR detection. Through extensive simulations and an application to two mouse datasets, we demonstrate that GetisDMR achieves better sensitivities, positive predictive values, more exact locations and better agreement of DMRs with current biological knowledge. Availability and Implementation: It is available at https://github.com/DMU-lilab/GetisDMR. Contacts: y.wen@auckland.ac.nz or zhiguangli@dlmedu.edu.cn Supplementary information: Supplementary data are available at Bioinformatics online.
- Published
- 2016
- Full Text
- View/download PDF