Back to Search Start Over

GPA: A Microbial Genetic Polymorphisms Assignments Tool in Metagenomic Analysis by Bayesian Estimation

Authors :
Jiarui Li
Pengcheng Du
Adam Yongxin Ye
Yuanyuan Zhang
Chuan Song
Hui Zeng
Chen Chen
Source :
Genomics, Proteomics & Bioinformatics, Vol 17, Iss 1, Pp 106-117 (2019)
Publication Year :
2019
Publisher :
Oxford University Press, 2019.

Abstract

Identifying antimicrobial resistant (AMR) bacteria in metagenomics samples is essential for public health and food safety. Next-generation sequencing (NGS) technology has provided a powerful tool in identifying the genetic variation and constructing the correlations between genotype and phenotype in humans and other species. However, for complex bacterial samples, there lacks a powerful bioinformatic tool to identify genetic polymorphisms or copy number variations (CNVs) for given genes. Here we provide a Bayesian framework for genotype estimation for mixtures of multiple bacteria, named as Genetic Polymorphisms Assignments (GPA). Simulation results showed that GPA has reduced the false discovery rate (FDR) and mean absolute error (MAE) in CNV and single nucleotide variant (SNV) identification. This framework was validated by whole-genome sequencing and Pool-seq data from Klebsiella pneumoniae with multiple bacteria mixture models, and showed the high accuracy in the allele fraction detections of CNVs and SNVs in AMR genes between two populations. The quantitative study on the changes of AMR genes fraction between two samples showed a good consistency with the AMR pattern observed in the individual strains. Also, the framework together with the genome annotation and population comparison tools has been integrated into an application, which could provide a complete solution for AMR gene identification and quantification in unculturable clinical samples. The GPA package is available at https://github.com/IID-DTH/GPA-package. Keywords: Next-generation sequencing, Pool-seq, Bayesian model, Metagenomics, Genetic polymorphisms

Details

Language :
English
ISSN :
16720229
Volume :
17
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Genomics, Proteomics & Bioinformatics
Publication Type :
Academic Journal
Accession number :
edsdoj.4c4a78153984bdebfe0c9732250c9b2
Document Type :
article
Full Text :
https://doi.org/10.1016/j.gpb.2018.12.005