Back to Search
Start Over
An efficient classification framework for Type 2 Diabetes incorporating feature interactions.
- Source :
-
Expert Systems with Applications . Apr2024, Vol. 239, pN.PAG-N.PAG. 1p. - Publication Year :
- 2024
-
Abstract
- Accurate and timely diagnosis of Type 2 Diabetes is a highly challenging task due to its initial asymptomatic nature and complex risk factor composition. Recently, Machine Learning (ML) has been actively used to build improved Type 2 Diabetes classification systems. One important aspect of these systems has been feature selection. Filter feature selection techniques especially based on mutual information have been popularly employed in recent works. However, most of them have focused on selecting relevant features and eliminating redundant ones. A third relationship called feature interaction may exist if input features are highly correlated, as in the case of Type 2 Diabetes. Feature interaction signifies the additional information about the target provided by the interaction between a subset of input features, that may not be relevant to the target individually. Second, many of the existing ML models are black-box, making the model interpretability very difficult. This paper proposes an efficient ML framework for the classification of Prediabetes and Type 2 Diabetes by incorporating feature interactions. It presents a hybrid filter-wrapper technique called Feature Interaction-based Greedy Sequential Feature selection. Agglomerative Feature Clustering and Dendrogram visualization for the analysis of interactive features is performed. A model-agnostic explainability technique of SHapley Additive explanations (SHAP) is augmented to provide local and global explanations of model predictions. The performance of the proposed classification framework was found to be interpretable as well as efficient with an accuracy of 98.8669%, precision of 98.8660%, recall of 98.8665%, and F-score of 0.9886 for Diabetes. For Prediabetes, an accuracy of 90.1187%, precision of 94.6958%, recall of 90.1187%, and F-score of 0.9403 was obtained. [ABSTRACT FROM AUTHOR]
- Subjects :
- *TYPE 2 diabetes
*FEATURE selection
*TYPE 2 diabetes diagnosis
*MACHINE learning
Subjects
Details
- Language :
- English
- ISSN :
- 09574174
- Volume :
- 239
- Database :
- Academic Search Index
- Journal :
- Expert Systems with Applications
- Publication Type :
- Academic Journal
- Accession number :
- 174875278
- Full Text :
- https://doi.org/10.1016/j.eswa.2023.122138