Back to Search
Start Over
Randomized And Parallel Algorithms For Distance Matrix Calculations In Multiple Sequence Alignment
- Source :
- Journal of Clinical Monitoring and Computing. 19:351-359
- Publication Year :
- 2005
- Publisher :
- Springer Science and Business Media LLC, 2005.
-
Abstract
- Multiple sequence alignment (MSA) is a vital problem in biology. Optimal alignment of multiple sequences becomes impractical even for a modest number of sequences since the general version of the problem is NP-hard. Because of the high time complexity of traditional MSA algorithms, even today's fast computers are not able to solve the problem for large number of sequences. In this paper we present a randomized algorithm to calculate distance matrices, which is a major step in many multiple sequence alignment algorithms. The basic idea employed is sampling (along the lines of). We also illustrate how to parallelize this algorithm. In Section we introduce the problem of multiple sequence alignments. In Section we provide a discussion on various methods that have been employed in the literature for Multiple Sequence Alignment. In this section we also introduce our new sampling approach. We extend our randomized algorithm to the case of non-uniform length sequences as well. We show that our algorithms are amenable to parallelism in Section. In Section we back up our claim of speedup and accuracy with empirical data and examples. In Section we provide some concluding remarks.
- Subjects :
- Multiple sequence alignment
Theoretical computer science
Parallel algorithm
Computational Biology
Health Informatics
Sequence Analysis, DNA
Critical Care and Intensive Care Medicine
Computing Methodologies
Anesthesiology and Pain Medicine
Distance matrix
Sequence Analysis, Protein
Optimal alignment
Sequence Alignment
Time complexity
Algorithms
Mathematics
Subjects
Details
- ISSN :
- 15732614 and 13871307
- Volume :
- 19
- Database :
- OpenAIRE
- Journal :
- Journal of Clinical Monitoring and Computing
- Accession number :
- edsair.doi.dedup.....92a999452f35a0344a38e106ab50ccf9