Back to Search Start Over

GNPI: Graph normalization to integrate phylogenetic information for metagenomic host phenotype prediction.

Authors :
Li, Bojing
Zhong, Duo
Qiao, Jimei
Jiang, Xingpeng
Source :
Methods. Sep2022, Vol. 205, p11-17. 7p.
Publication Year :
2022

Abstract

• A graph normalization method was proposed to integrate phylogenetic information for metagenomic host phenotype prediction. • The method learned better the structure of phylogenetic trees. • The proposed method enhanced the accuracy of machine learning and deep learning models in different metagenomic datasets. Microorganisms play important roles in our lives especially on metabolism and diseases. Determining the probability of human suffering from specific diseases and the severity of the disease based on microbial genes is the crucial research for understanding the relationship between microbes and diseases. Previous could extract the topological information of phylogenetic trees and integrate them to metagenomic datasets, thus enable classifiers to learn more information in limited datasets and thus improve the performance of the models. In this paper, we proposed a GNPI model to better learn the structure of phylogenetic trees. GNPI maintained the original vector format of metagenomic datasets, while previous research had to change the input form to matrices. The vector-like form of the input data can be easily adopted in the baseline machine learning models and is available for deep learning models. The datasets processed with GNPI help enhance the accuracy of machine learning and deep learning models in three different datasets. GNPI is an interpretable data processing method for host phenotype prediction and other bioinformatics tasks. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10462023
Volume :
205
Database :
Academic Search Index
Journal :
Methods
Publication Type :
Academic Journal
Accession number :
158609452
Full Text :
https://doi.org/10.1016/j.ymeth.2022.05.007