Back to Search Start Over

mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection.

Authors :
Cui, Yingbo
Peng, Shaoliang
Lu, Yutong
Zhu, Xiaoqian
Wang, Bingqiang
Wu, Chengkun
Liao, Xiangke
Source :
IEEE Transactions on Parallel & Distributed Systems. 11/1/2018, Vol. 29 Issue 11, p2557-2567. 11p.
Publication Year :
2018

Abstract

Single Nucleotide Polymorphism (SNP) detection is a fundamental procedure of whole genome analysis. SOAPsnp, a classic tool for detection, would take more than one week to analyze one typical human genome, which limits the efficiency of downstream analyses. In this paper, we present mSNP, an optimized version of SOAPsnp, which leverages Intel Xeon Phi coprocessors for large-scale SNP detection. Firstly, we redesigned the essential data structures of SOAPsnp, which significantly reduces memory footprint and improves computing efficiency. Then we developed a coordinated parallel framework for a higher hardware utilization of both CPU and Xeon Phi. Also, we tailored the data structures and operations to utilize the wide VPU of Xeon Phi to improve data throughput. Last but not the least, we proposeed a read-based window division strategy to improve throughput and obtain better load balance. mSNP is the first SNP detection tool empowered by Xeon Phi. We achieved a 38x single thread speedup on CPU, without any loss in precision. Moreover, mSNP successfully scaled to 4,096 nodes on Tianhe-2. Our experiments demonstrate that mSNP is efficient and scalable for large-scale human genome SNP detection. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10459219
Volume :
29
Issue :
11
Database :
Academic Search Index
Journal :
IEEE Transactions on Parallel & Distributed Systems
Publication Type :
Academic Journal
Accession number :
132313673
Full Text :
https://doi.org/10.1109/TPDS.2018.2839578