Back to Search Start Over

Randomized And Parallel Algorithms For Distance Matrix Calculations In Multiple Sequence Alignment

Authors :
Chun-Hsi Huang
Sanguthevar Rajasekaran
Hardik Dave
Vishal Thapar
Source :
Journal of Clinical Monitoring and Computing. 19:351-359
Publication Year :
2005
Publisher :
Springer Science and Business Media LLC, 2005.

Abstract

Multiple sequence alignment (MSA) is a vital problem in biology. Optimal alignment of multiple sequences becomes impractical even for a modest number of sequences since the general version of the problem is NP-hard. Because of the high time complexity of traditional MSA algorithms, even today's fast computers are not able to solve the problem for large number of sequences. In this paper we present a randomized algorithm to calculate distance matrices, which is a major step in many multiple sequence alignment algorithms. The basic idea employed is sampling (along the lines of). We also illustrate how to parallelize this algorithm. In Section we introduce the problem of multiple sequence alignments. In Section we provide a discussion on various methods that have been employed in the literature for Multiple Sequence Alignment. In this section we also introduce our new sampling approach. We extend our randomized algorithm to the case of non-uniform length sequences as well. We show that our algorithms are amenable to parallelism in Section. In Section we back up our claim of speedup and accuracy with empirical data and examples. In Section we provide some concluding remarks.

Details

ISSN :
15732614 and 13871307
Volume :
19
Database :
OpenAIRE
Journal :
Journal of Clinical Monitoring and Computing
Accession number :
edsair.doi.dedup.....92a999452f35a0344a38e106ab50ccf9