Back to Search
Start Over
Deletion Detection Method Using the Distribution of Insert Size and a Precise Alignment Strategy
- Source :
- IEEE/ACM Transactions on Computational Biology and Bioinformatics. 18:1070-1081
- Publication Year :
- 2021
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2021.
-
Abstract
- Homozygous and heterozygous deletions commonly exist in the human genome. For current structural variation detection tools, it is significant to determine whether a deletion is homozygous or heterozygous. However, the problems of sequencing errors, micro-homologies, and micro-insertions prohibit common alignment tools from identifying accurate breakpoint locations, and often result in detecting false structural variations. In this study, we present a novel deletion detection tool called Sprites2. Comparing with Sprites, Sprites2 makes the following modifications: (1) The distribution of insert size is used in Sprites2, which can identify the type of deletions and improve the accuracy of deletion calls. (2) A precise alignment method based on AGE (one algorithm simultaneously aligning 5’ and 3’ ends between two sequences) is adopted in Sprites2 to identify breakpoints, which is helpful to resolve the problems introduced by sequencing errors, micro-homologies, and micro-insertions. In order to test and verify the performance of Sprites2, some simulated and real datasets are adopted in our experiments, and Sprites2 is compared with five popular tools. The experimental results show that Sprites2 can improve the performance of deletion detection. Sprites2 can be downloaded from https://github.com/zhangzhen/sprites2 .
- Subjects :
- Genome, Human
Computer science
business.industry
Applied Mathematics
Breakpoint
Computational Biology
Genomics
Pattern recognition
Sequence Analysis, DNA
Insert (molecular biology)
Structural variation
Mutagenesis, Insertional
Distribution (mathematics)
Genomic Structural Variation
Genetics
Humans
Human genome
Artificial intelligence
business
Sequence Alignment
Gene Deletion
Biotechnology
Subjects
Details
- ISSN :
- 23740043 and 15455963
- Volume :
- 18
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Computational Biology and Bioinformatics
- Accession number :
- edsair.doi.dedup.....6a1f08e69f4d49aeb935b69046640edc
- Full Text :
- https://doi.org/10.1109/tcbb.2019.2934407