Back to Search Start Over

HaploBlocks: Efficient Detection of Positive Selection in Large Population Genomic Datasets.

Authors :
Kirsch-Gerweck, Benedikt
Bohnenkämper, Leonard
Henrichs, Michel T
Alanko, Jarno N
Bannai, Hideo
Cazaux, Bastien
Peterlongo, Pierre
Burger, Joachim
Stoye, Jens
Diekmann, Yoan
Source :
Molecular Biology & Evolution; Mar2023, Vol. 40 Issue 3, p1-12, 12p
Publication Year :
2023

Abstract

Genomic regions under positive selection harbor variation linked for example to adaptation. Most tools for detecting positively selected variants have computational resource requirements rendering them impractical on population genomic datasets with hundreds of thousands of individuals or more. We have developed and implemented an efficient haplotype-based approach able to scan large datasets and accurately detect positive selection. We achieve this by combining a pattern matching approach based on the positional Burrows–Wheeler transform with model-based inference which only requires the evaluation of closed-form expressions. We evaluate our approach with simulations, and find it to be both sensitive and specific. The computational resource requirements quantified using UK Biobank data indicate that our implementation is scalable to population genomic datasets with millions of individuals. Our approach may serve as an algorithmic blueprint for the era of "big data" genomics: a combinatorial core coupled with statistical inference in closed form. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
07374038
Volume :
40
Issue :
3
Database :
Complementary Index
Journal :
Molecular Biology & Evolution
Publication Type :
Academic Journal
Accession number :
162858412
Full Text :
https://doi.org/10.1093/molbev/msad027