1. The impact of site-specific digital histology signatures on deep learning model accuracy and bias
- Author
-
James M. Dolezal, Lara R. Heij, Jefree J. Schulte, Dezheng Huo, Olufunmilayo I. Olopade, Robert L. Grossman, Heather Chen, Nicole A. Cipriani, Frederick M Howard, Sara Kochanny, Jakob Nikolas Kather, Rita Nanda, and Alexander T. Pearson
- Subjects
0301 basic medicine ,Color normalization ,Computer science ,Science ,DNA Mutational Analysis ,education ,General Physics and Astronomy ,Image processing ,Computational biology ,Risk Assessment ,General Biochemistry, Genetics and Molecular Biology ,Article ,Specimen Handling ,03 medical and health sciences ,0302 clinical medicine ,Deep Learning ,Cancer genome ,Neoplasms ,Tumor stage ,Biomarkers, Tumor ,Image Processing, Computer-Assisted ,Humans ,Neoplasm Staging ,Multidisciplinary ,business.industry ,Deep learning ,Gene Expression Profiling ,Diagnostic marker ,Histology ,Diagnostic markers ,General Chemistry ,Data Accuracy ,030104 developmental biology ,Neoplasms diagnosis ,030220 oncology & carcinogenesis ,Mutation ,Cancer imaging ,Artificial intelligence ,business - Abstract
The Cancer Genome Atlas (TCGA) is one of the largest biorepositories of digital histology. Deep learning (DL) models have been trained on TCGA to predict numerous features directly from histology, including survival, gene expression patterns, and driver mutations. However, we demonstrate that these features vary substantially across tissue submitting sites in TCGA for over 3,000 patients with six cancer subtypes. Additionally, we show that histologic image differences between submitting sites can easily be identified with DL. Site detection remains possible despite commonly used color normalization and augmentation methods, and we quantify the image characteristics constituting this site-specific digital histology signature. We demonstrate that these site-specific signatures lead to biased accuracy for prediction of features including survival, genomic mutations, and tumor stage. Furthermore, ethnicity can also be inferred from site-specific signatures, which must be accounted for to ensure equitable application of DL. These site-specific signatures can lead to overoptimistic estimates of model performance, and we propose a quadratic programming method that abrogates this bias by ensuring models are not trained and validated on samples from the same site., Deep learning models have been trained on The Cancer Genome Atlas to predict numerous features directly from histology, including survival, gene expression patterns, and driver mutations. Here, the authors demonstrate that site-specific histologic signatures can lead to biased estimates of accuracy for such models, and propose a method to minimize such bias.
- Published
- 2021