Back to Search
Start Over
A New Efficient Algorithm for Quorum Planted Motif Search on Large DNA Datasets
- Source :
- IEEE Access, Vol 7, Pp 129617-129626 (2019)
- Publication Year :
- 2019
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2019.
-
Abstract
- Quorum planted (l, d) motif search (qPMS) is a challenging computational problem in bioinformatics, mainly for the identification of regulatory elements such as transcription factor binding sites in DNA sequences. Large DNA datasets play an important role in identifying high-quality (l, d) motifs, while most existing qPMS algorithms are too time-consuming to complete the calculation of qPMS in a reasonable time. We propose an approximate qPMS algorithm called APMS to deal with large DNA datasets mainly by accelerating neighboring substring search and filtering redundant substrings. Experimental results on them show that APMS can not only identify the implanted (l, d) motifs, but also run orders of magnitude faster than the state-of-the-art qPMS algorithms. The source code of APMS and the python wrapper for the code are freely available at https://github.com/qyu071/apms.
- Subjects :
- Source code
General Computer Science
Computer science
media_common.quotation_subject
General Engineering
String searching algorithm
DNA sequencing
Substring
DNA binding site
chemistry.chemical_compound
ComputingMethodologies_PATTERNRECOGNITION
Planted motif search
chemistry
large DNA datasets
transcription factor binding sites
General Materials Science
lcsh:Electrical engineering. Electronics. Nuclear engineering
%28l%2C+d%29+motif+search%22">Quorum planted (l, d) motif search
Computational problem
lcsh:TK1-9971
Algorithm
DNA
media_common
Subjects
Details
- ISSN :
- 21693536
- Volume :
- 7
- Database :
- OpenAIRE
- Journal :
- IEEE Access
- Accession number :
- edsair.doi.dedup.....5686f867fe545f8907bac0e8db216c6f
- Full Text :
- https://doi.org/10.1109/access.2019.2940115