Back to Search Start Over

A Robust TabNet-Based Multi-Classification Algorithm for Infrared Spectral Data of Chinese Herbal Medicine with High-Dimensional Small Samples.

Authors :
Wang, Yongjun
Jin, Chengliang
Ma, Li
Liu, Xiao
Source :
Journal of Pharmaceutical & Biomedical Analysis. May2024, Vol. 242, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Robust classification algorithms for high-dimensional, small-sample datasets are valuable in practical applications. Faced with the infrared spectroscopic dataset with 568 samples and 3448 wavelengths (features) to identify the origins of Chinese medicinal materials, this paper proposed a novel embedded multiclassification algorithm, ITabNet, derived from the framework of TabNet. Firstly, a refined data pre-processing (DP) mechanism was designed to efficiently find the best adaptive one among 50 DP methods with the help of Support Vector Machine (SVM). Following this, an innovative focal loss function was designed and joined with a cross-validation experiment strategy to mitigate the impact of sample imbalance on algorithm. Detailed investigations on ITabNet were conducted, including comparisons of ITabNet with SVM for the conditions of DP and Non-DP, GPU and CPU computer settings, as well as ITabNet against XGBT (Extreme Gradient Boosting). The numerical results demonstrate that ITabNet can significantly improve the effectiveness of prediction. The best accuracy score is 1.0000, and the best Area Under the Curve (AUC) score is 1.0000. Suggestions on how to use models effectively were given. Furthermore, ITabNet shows the potential to apply the analysis of medicinal efficacy and chemical composition of medicinal materials. The paper also provides ideas for multi-classification modeling data with small sample size and high-dimensional feature. • A TabNet-based deep learning method for high-dimensional small-sample datasets. • A rapid data preprocessing search strategy using Support Vector Machine is established. • An adaptive Focal loss function with adjusting parameters counters sample imbalance. • Chinese herb origin identification via infrared spectroscopy shows impressive results. • Suggestions for Chinese herb further research and method generalization are offered. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
07317085
Volume :
242
Database :
Academic Search Index
Journal :
Journal of Pharmaceutical & Biomedical Analysis
Publication Type :
Academic Journal
Accession number :
175934505
Full Text :
https://doi.org/10.1016/j.jpba.2024.116031