Back to Search Start Over

Hybrid diabetes disease prediction framework based on data imputation and outlier detection techniques.

Authors :
Srivastava, Anand Kumar
Kumar, Yugal
Singh, Pradeep Kumar
Source :
Expert Systems. Mar2022, Vol. 39 Issue 3, p1-17. 17p.
Publication Year :
2022

Abstract

In the field of medical science, accurate prediction is a difficult and challenging task. But, the presence of missing values and outliers can make the prediction task more complicated. Many researchers address the issue of missing value in medical data, either detect the missing value and delete the respective data instances from the dataset or adopt some default methods such as mean, median, neighbour etc., for filling the missing value. However, both methods are lacking to produce optimal results. Furthermore, outliers are also presented in data and degraded the performance of classifier. Few researchers also focus on the outlier detection in medical dataset, but it is not fully explored till date. This work considers the two well‐known problems of data that is, (i) missing value imputation, and (ii) outlier. The missing value imputation issue is addressed through K‐Mean++ based data imputation technique. This technique also validates the data through clustering and also compute the values for missing data. The outlier can be detected through an ABC based outlier detection technique. Further, the final outcome is determined using LS‐SVM classifiers. Hence, this work presents a hybrid disease diagnosis framework for diabetes prediction, called hybrid diabetes prediction framework. The reason behind to choose the diabetes dataset for implementation as it contains 763 missing values and several outliers. The simulation results showed that proposed hybrid framework effectively determines the missing values and outliers in diabetes dataset. Further, the performance of proposed hybrid diabetes prediction framework is evaluated using accuracy, sensitivity, specificity, kappa and AUC parameters and compared with 34 state of art techniques. Results confirmed that proposed hybrid framework obtains 96.57%, 93.37%, 98.12%, 98.17%, and 95.43% accuracy, sensitivity, specificity, kappa and AUC rate respectively. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02664720
Volume :
39
Issue :
3
Database :
Academic Search Index
Journal :
Expert Systems
Publication Type :
Academic Journal
Accession number :
155435363
Full Text :
https://doi.org/10.1111/exsy.12785