Back to Search Start Over

Selective of informative metabolites using random forests based on model population analysis.

Authors :
Huang, Jian-Hua
Yan, Jun
Wu, Qing-Hua
Duarte Ferro, Miguel
Yi, Lun-Zhao
Lu, Hong-Mei
Xu, Qing-Song
Liang, Yi-Zeng
Source :
Talanta. Dec2013, Vol. 117, p549-555. 7p.
Publication Year :
2013

Abstract

Abstract: One of the main goals of metabolomics studies is to discover informative metabolites or biomarkers, which may be used to diagnose diseases and to find out pathology. Sophisticated feature selection approaches are required to extract the information hidden in such complex ‘omics’ data. In this study, it is proposed a new and robust selective method by combining random forests (RF) with model population analysis (MPA), for selecting informative metabolites from three metabolomic datasets. According to the contribution to the classification accuracy, the metabolites were classified into three kinds: informative, no-informative, and interfering metabolites. Based on the proposed method, some informative metabolites were selected for three datasets; further analyses of these metabolites between healthy and diseased groups were then performed, showing by T-test that the P values for all these selected metabolites were lower than 0.05. Moreover, the informative metabolites identified by the current method were demonstrated to be correlated with the clinical outcome under investigation. The source codes of MPA-RF in Matlab can be freely downloaded from http://code.google.com/p/my-research-list/downloads/list [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
00399140
Volume :
117
Database :
Academic Search Index
Journal :
Talanta
Publication Type :
Academic Journal
Accession number :
91865270
Full Text :
https://doi.org/10.1016/j.talanta.2013.07.070