Back to Search Start Over

Research on geographical origin traceability of Coix seed based on a modified random forest algorithm.

Authors :
ZHAO Hanqing
WANG Bin
CHEN Yao
TANG Zhangfeng
FANG Xin
CHEN Zengping
YANG Jian
DENG Ting
Source :
Journal of Light Industry. Dec2023, Vol. 38 Issue 6, p70-77. 8p.
Publication Year :
2023

Abstract

Coix seeds from 9 different origins were taken as the research object. An attempt was made to achieve geographical origin traceability of Coix seeds through the combination of excitation-emission matrix (EEM) fluorescence spectroscopy with improved random forest algorithm. The improvements to the random forest algorithm mainly include two aspects, firstly, principal component analysis (PCA) was adopted to reduce the dimension of EEM fluorescent data; secondly, a grid search method was used to identify the optimal number of principal components(PCs) to retain and the hyperparameters of the discriminant model during the PCA dimension reduction process. The results showed that an improved random forest model, incorporating standard deviation normalization and PCA dimension reduction modules, based on Coix seeds EEM fluorescence spectroscopy data, accurately predicted the geographical origin of Coix seed samples from 9 different areas. The optimal model was constructed by combining 100 decision trees with a maximum depth of 3 and a minimum sample size of 1 at the leaf node, using 16 principal components (PCs). This model achieved 100% prediction accuracy for both the validation and test sets, which consisted of a total of 108 samples, outperforming the PLS-DA model constructed by the partial least squares method (96% prediction accuracy). [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
20961553
Volume :
38
Issue :
6
Database :
Academic Search Index
Journal :
Journal of Light Industry
Publication Type :
Academic Journal
Accession number :
175585681
Full Text :
https://doi.org/10.12187/2023.06.009