Back to Search
Start Over
Fast and exact gap-affine partial order alignment with POASTA.
- Source :
-
Bioinformatics (Oxford, England) [Bioinformatics] 2024 Dec 26; Vol. 41 (1). - Publication Year :
- 2024
-
Abstract
- Motivation: Partial order alignment is a widely used method for computing multiple sequence alignments, with applications in genome assembly and pangenomics, among many others. Current algorithms to compute the optimal, gap-affine partial order alignment do not scale well to larger graphs and sequences. While heuristic approaches exist, they do not guarantee optimal alignment and sacrifice alignment accuracy.<br />Results: We present POASTA, a new optimal algorithm for partial order alignment that exploits long stretches of matching sequence between the graph and a query. We benchmarked POASTA against the state-of-the-art on several diverse bacterial gene datasets and demonstrated an average speed-up of 4.1× and up to 9.8×, using less memory. POASTA's memory scaling characteristics enabled the construction of much larger POA graphs than previously possible, as demonstrated by megabase-length alignments of 342 Mycobacterium tuberculosis sequences.<br />Availability and Implementation: POASTA is available on Github at https://github.com/broadinstitute/poasta.<br /> (© The Author(s) 2025. Published by Oxford University Press.)
Details
- Language :
- English
- ISSN :
- 1367-4811
- Volume :
- 41
- Issue :
- 1
- Database :
- MEDLINE
- Journal :
- Bioinformatics (Oxford, England)
- Publication Type :
- Academic Journal
- Accession number :
- 39752324
- Full Text :
- https://doi.org/10.1093/bioinformatics/btae757