Back to Search Start Over

Procrastination Leads to Efficient Filtration for Local Multiple Alignment

Authors :
Todd J. Treangen
Xavier Messeguer
Carla Kuiken
Nicole T. Perna
Louxin Zhang
Aaron E. Darling
Source :
Lecture Notes in Computer Science ISBN: 9783540395836, WABI
Publication Year :
2006
Publisher :
Springer Berlin Heidelberg, 2006.

Abstract

We describe an efficient local multiple alignment filtration heuristic for identification of conserved regions in one or more DNA sequences. The method incorporates several novel ideas: (1) palindromic spaced seed patterns to match both DNA strands simultaneously, (2) seed extension (chaining) in order of decreasing multiplicity, and (3) procrastination when low multiplicity matches are encountered. The resulting local multiple alignments may have nucleotide substitutions and internal gaps as large as w characters in any occurrence of the motif. The algorithm consumes $\mathcal{O}(wN)$ memory and $\mathcal{O}(wN \log wN)$ time where N is the sequence length. We score the significance of multiple alignments using entropy-based motif scoring methods. We demonstrate the performance of our filtration method on Alu-repeat rich segments of the human genome and a large set of Hepatitis C virus genomes. The GPL implementation of our algorithm in C++ is called procrastAligner and is freely available from http://gel.ahabs.wisc.edu/procrastination

Details

ISBN :
978-3-540-39583-6
ISBNs :
9783540395836
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783540395836, WABI
Accession number :
edsair.doi...........126f46018b5eac6129a2ba04e4a0055b
Full Text :
https://doi.org/10.1007/11851561_12