1. Prediction of Atypical Ductal Hyperplasia Upgrades Through a Machine Learning Approach to Reduce Unnecessary Surgical Excisions
- Author
-
Roberta M. diFlorio-Alexander, Saeed Hassanpour, Arief A. Suriawinata, Todd A. MacKenzie, Lia X. Harrington, and Katherine Trinh
- Subjects
0301 basic medicine ,Core needle ,Logistic regression ,Machine learning ,computer.software_genre ,Malignancy ,Cohort Studies ,Machine Learning ,03 medical and health sciences ,0302 clinical medicine ,Biopsy ,Original Report ,Humans ,Medicine ,Ductal Hyperplasia ,Medical diagnosis ,Hyperplasia ,medicine.diagnostic_test ,business.industry ,General Medicine ,medicine.disease ,Carcinoma, Intraductal, Noninfiltrating ,030104 developmental biology ,030220 oncology & carcinogenesis ,Cohort ,Hormonal therapy ,Female ,Artificial intelligence ,business ,computer - Abstract
Purpose Surgical excision is currently recommended for all occurrences of atypical ductal hyperplasia (ADH) found on core needle biopsies for malignancy diagnoses and treatment of lesions. The excision of all ADH lesions may lead to overtreatment, which results in invasive surgeries for benign lesions in many women. A machine learning method to predict ADH upgrade may help clinicians and patients decide whether combined active surveillance and hormonal therapy is a reasonable alternative to surgical excision. Methods The following six machine learning models were developed to predict ADH upgrade from core needle biopsy: gradient-boosting trees, random forest, radial support vector machine (SVM), weighted K-nearest neighbors (KNN), logistic elastic net, and logistic regression. The study cohort consisted of 128 lesions from 124 women at a tertiary academic care center in New Hampshire who had ADH on core needle biopsy and who underwent an associated surgical excision from 2011 to 2017. Results The best-performing models were gradient-boosting trees (area under the curve [AUC], 68%; accuracy, 78%) and random forest (AUC, 67%; accuracy, 77%). The top five most important features that determined ADH upgrade were age at biopsy, lesion size, number of biopsies, needle gauge, and personal and family history of breast cancer. Using the random forest model, 98% of all malignancies would have been diagnosed through surgical biopsies, whereas 16% of unnecessary surgeries on benign lesions could have been avoided (ie, 87% sensitivity at 45% specificity). Conclusion These results add to the growing body of support for machine learning models as useful aids for clinicians and patients in decisions about the clinical management of ADH.
- Published
- 2018