Back to Search
Start Over
An integrated package for bisulfite DNA methylation data analysis with Indel-sensitive mapping
- Source :
- BMC Bioinformatics, BMC Bioinformatics, Vol 20, Iss 1, Pp 1-11 (2019)
- Publication Year :
- 2018
-
Abstract
- Background DNA methylation plays crucial roles in most eukaryotic organisms. Bisulfite sequencing (BS-Seq) is a sequencing approach that provides quantitative cytosine methylation levels in genome-wide scope and single-base resolution. However, genomic variations such as insertions and deletions (indels) affect methylation calling, and the alignment of reads near/across indels becomes inaccurate in the presence of polymorphisms. Hence, the simultaneous detection of DNA methylation and indels is important for exploring the mechanisms of functional regulation in organisms. Results These problems motivated us to develop the algorithm BatMeth2, which can align BS reads with high accuracy while allowing for variable-length indels with respect to the reference genome. The results from simulated and real bisulfite DNA methylation data demonstrated that our proposed method increases alignment accuracy. Additionally, BatMeth2 can calculate the methylation levels of individual loci, genomic regions or functional regions such as genes/transposable elements. Additional programs were also developed to provide methylation data annotation, visualization, and differentially methylated cytosine/region (DMC/DMR) detection. The whole package provides new tools and will benefit bisulfite data analysis. Conclusion BatMeth2 improves DNA methylation calling, particularly for regions close to indels. It is an autorun package and easy to use. In addition, a DNA methylation visualization program and a differential analysis program are provided in BatMeth2. We believe that BatMeth2 will facilitate the study of the mechanisms of DNA methylation in development and disease. BatMeth2 is an open source software program and is available on GitHub (https://github.com/GuoliangLi-HZAU/BatMeth2/). Electronic supplementary material The online version of this article (10.1186/s12859-018-2593-4) contains supplementary material, which is available to authorized users.
- Subjects :
- Transposable element
Data Analysis
Bisulfite sequencing
Computational biology
Biology
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
03 medical and health sciences
0302 clinical medicine
Structural Biology
Pipeline
Humans
Sulfites
Indel
lcsh:QH301-705.5
Molecular Biology
030304 developmental biology
Alignment
0303 health sciences
DNA methylation
Applied Mathematics
food and beverages
Methylation
Sequence Analysis, DNA
Computer Science Applications
Bisulfite
lcsh:Biology (General)
030220 oncology & carcinogenesis
lcsh:R858-859.7
DNA microarray
Algorithms
Software
Reference genome
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 20
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC bioinformatics
- Accession number :
- edsair.doi.dedup.....8a8f07b1b055c924b6585138479be364