Back to Search Start Over

Performance evaluation of machine learning techniques for breast cancer detection using WDBC dataset.

Authors :
Chhillar, Indu
Singh, Ajmer
Source :
AIP Conference Proceedings. 2024, Vol. 3919 Issue 1, p1-8. 8p.
Publication Year :
2024

Abstract

Recently, the study of machine learning has gained prominence in the medical field. One of the most challeng-ing tasks in machine learning is creating accurate and reliable classifiers for medical applications. To find the best classi-fier, a comparative analysis of the most prominent machine learning classifiers—SVM, DT, RF, NB, KNN, XGBoost, AdaBoost, and CatBoost—has been done using the Wisconsin Diagnostic Breast Cancer (WDBC) dataset. The perfor-mance of classifiers is evaluated in terms of accuracy, precision, recall, AUC, confusion matrices, and ROC. The results of the analytical comparison revealed that the SVM and RF classifiers outperformed other classifiers. Furthermore, the WDBC dataset used in this study has a class imbalance issue (malignant 212; benign 357). Imbalanced data sets produce skewed outcomes and deceptive accuracy. Data resampling was performed to fix the problem of class imbalance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0094243X
Volume :
3919
Issue :
1
Database :
Academic Search Index
Journal :
AIP Conference Proceedings
Publication Type :
Conference
Accession number :
176251292
Full Text :
https://doi.org/10.1063/5.0184603