Back to Search Start Over

Hierarchical Meta-Storms enables comprehensive and rapid comparison of microbiome functional profiles on a large scale using hierarchical dissimilarity metrics and parallel computing

Authors :
Gongchao Jing
Xiaoquan Su
Yuzhu Chen
Yufeng Zhang
Jinhua Li
Source :
Bioinformatics Advances. 1
Publication Year :
2021
Publisher :
Oxford University Press (OUP), 2021.

Abstract

Functional beta-diversity analysis on numerous microbiomes interprets the linkages between metabolic functions and their meta-data. To evaluate the microbiome beta-diversity, widely used distance metrices only count overlapped gene families but omit their inherent relationships, resulting in erroneous distances due to the sparsity of high-dimensional function profiles. Here we propose Hierarchical Meta-Storms (HMS) to tackle such problem. HMS contains two core components: (i) a dissimilarity algorithm that comprehensively measures functional distances among microbiomes using multi-level metabolic hierarchy and (ii) a fast Principal Co-ordinates Analysis (PCoA) implementation that deduces the beta-diversity pattern optimized by parallel computing. Results showed HMS can detect the variations of microbial functions in upper-level metabolic pathways, however, always missed by other methods. In addition, HMS accomplished the pairwise distance matrix and PCoA for 20 000 microbiomes in 3.9 h on a single computing node, which was 23 times faster and 80% less RAM consumption compared to existing methods, enabling the in-depth data mining among microbiomes on a high resolution. HMS takes microbiome functional profiles as input, produces their pairwise distance matrix and PCoA coordinates. Availability and implementation It is coded in C/C++ with parallel computing and released in two alternative forms: a standalone software (https://github.com/qdu-bioinfo/hierarchical-meta-storms) and an equivalent R package (https://github.com/qdu-bioinfo/hrms). Supplementary information Supplementary data are available at Bioinformatics Advances online.

Details

ISSN :
26350041
Volume :
1
Database :
OpenAIRE
Journal :
Bioinformatics Advances
Accession number :
edsair.doi...........4b6834f230ae3813417a8e474a981fed
Full Text :
https://doi.org/10.1093/bioadv/vbab003