1. Weakly-supervised deep learning for ultrasound diagnosis of breast cancer
- Author
-
Jaeil, Kim, Hye Jung, Kim, Chanho, Kim, Jin Hwa, Lee, Keum Won, Kim, Young Mi, Park, Hye Won, Kim, So Yeon, Ki, You Me, Kim, and Won Hwa, Kim
- Subjects
Adult ,Adolescent ,Science ,Breast Neoplasms ,Article ,Diagnosis, Differential ,Young Adult ,Deep Learning ,Breast cancer ,Image Interpretation, Computer-Assisted ,Humans ,Breast ,Aged ,Retrospective Studies ,Ultrasonography ,Cancer ,Aged, 80 and over ,Multidisciplinary ,Health care ,Middle Aged ,Prognosis ,ROC Curve ,Oncology ,Case-Control Studies ,Medicine ,Female ,Neural Networks, Computer ,Ultrasonography, Mammary ,Algorithms ,Follow-Up Studies - Abstract
Conventional deep learning (DL) algorithm requires full supervision of annotating the region of interest (ROI) that is laborious and often biased. We aimed to develop a weakly-supervised DL algorithm that diagnosis breast cancer at ultrasound without image annotation. Weakly-supervised DL algorithms were implemented with three networks (VGG16, ResNet34, and GoogLeNet) and trained using 1000 unannotated US images (500 benign and 500 malignant masses). Two sets of 200 images (100 benign and 100 malignant masses) were used for internal and external validation sets. For comparison with fully-supervised algorithms, ROI annotation was performed manually and automatically. Diagnostic performances were calculated as the area under the receiver operating characteristic curve (AUC). Using the class activation map, we determined how accurately the weakly-supervised DL algorithms localized the breast masses. For internal validation sets, the weakly-supervised DL algorithms achieved excellent diagnostic performances, with AUC values of 0.92–0.96, which were not statistically different (all Ps > 0.05) from those of fully-supervised DL algorithms with either manual or automated ROI annotation (AUC, 0.92–0.96). For external validation sets, the weakly-supervised DL algorithms achieved AUC values of 0.86–0.90, which were not statistically different (Ps > 0.05) or higher (P = 0.04, VGG16 with automated ROI annotation) from those of fully-supervised DL algorithms (AUC, 0.84–0.92). In internal and external validation sets, weakly-supervised algorithms could localize 100% of malignant masses, except for ResNet34 (98%). The weakly-supervised DL algorithms developed in the present study were feasible for US diagnosis of breast cancer with well-performing localization and differential diagnosis.
- Published
- 2021