Back to Search
Start Over
Performance evaluation of machine learning techniques for breast cancer detection using WDBC dataset.
- Source :
-
AIP Conference Proceedings . 2024, Vol. 3919 Issue 1, p1-8. 8p. - Publication Year :
- 2024
-
Abstract
- Recently, the study of machine learning has gained prominence in the medical field. One of the most challeng-ing tasks in machine learning is creating accurate and reliable classifiers for medical applications. To find the best classi-fier, a comparative analysis of the most prominent machine learning classifiers—SVM, DT, RF, NB, KNN, XGBoost, AdaBoost, and CatBoost—has been done using the Wisconsin Diagnostic Breast Cancer (WDBC) dataset. The perfor-mance of classifiers is evaluated in terms of accuracy, precision, recall, AUC, confusion matrices, and ROC. The results of the analytical comparison revealed that the SVM and RF classifiers outperformed other classifiers. Furthermore, the WDBC dataset used in this study has a class imbalance issue (malignant 212; benign 357). Imbalanced data sets produce skewed outcomes and deceptive accuracy. Data resampling was performed to fix the problem of class imbalance. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 0094243X
- Volume :
- 3919
- Issue :
- 1
- Database :
- Academic Search Index
- Journal :
- AIP Conference Proceedings
- Publication Type :
- Conference
- Accession number :
- 176251292
- Full Text :
- https://doi.org/10.1063/5.0184603