Back to Search Start Over

Comparison of the accuracy of human readers versus machine-learning algorithms for pigmented skin lesion classification: an open, web-based, international, diagnostic study

Authors :
Allan C. Halpern
Iris Zalaudek
Michael A. Marchetti
Ashfaq A. Marghoob
Christoph Sinz
Caterina Longo
Josep Malvehy
Amanda Oakley
Rainer Hofmann-Wellenhof
Brian Helba
Luc Thomas
Harald Kittler
John Paoli
H. Peter Soyer
Cliff Rosendahl
Alon Scope
Aimilios Lallas
Horacio Cabo
Philipp Tschandl
Christoph Rinner
Giuseppe Argenziano
Jan Lapins
Scott W. Menzies
David A. Gutman
Ralph P. Braun
Noel C. F. Codella
Susana Puig
Bengü Nisa Akay
Tschandl, Philipp
Codella, Noel
Akay, Bengü Nisa
Argenziano, Giuseppe
Braun, Ralph P
Cabo, Horacio
Gutman, David
Halpern, Allan
Helba, Brian
Hofmann-Wellenhof, Rainer
Lallas, Aimilio
Lapins, Jan
Longo, Caterina
Malvehy, Josep
Marchetti, Michael A
Marghoob, Ashfaq
Menzies, Scott
Oakley, Amanda
Paoli, John
Puig, Susana
Rinner, Christoph
Rosendahl, Cliff
Scope, Alon
Sinz, Christoph
Soyer, H Peter
Thomas, Luc
Zalaudek, Iri
Kittler, Harald
Tschandl, P
Codella, N
Akay, Bn
Argenziano, G
Braun, Rp
Cabo, H
Gutman, D
Halpern, A
Helba, B
Hofmann-Wellenhof, R
Lallas, A
Lapins, J
Longo, C
Malvehy, J
Marchetti, Ma
Marghoob, A
Menzies, S
Oakley, A
Paoli, J
Puig, S
Rinner, C
Rosendahl, C
Scope, A
Sinz, C
Soyer, Hp
Thomas, L
Zalaudek, I
Kittler, H.
Source :
Lancet Oncol
Publication Year :
2019

Abstract

BACKGROUND: Whether machine-learning algorithms can diagnose all pigmented skin lesions as accurately as human experts is unclear. The aim of this study was to compare the diagnostic accuracy of state-of-the-art machine-learning algorithms with human readers for all clinically relevant types of benign and malignant pigmented skin lesions. METHODS: For this open, web-based, international, diagnostic study, human readers were asked to diagnose dermatoscopic images selected randomly in 30-image batches from a test set of 1511 images. The diagnoses from human readers were compared with those of 139 algorithms created by 77 machine-learning labs, who participated in the International Skin Imaging Collaboration 2018 challenge and received a training set of 10 015 images in advance. The ground truth of each lesion fell into one of seven predefined disease categories: intraepithelial carcinoma including actinic keratoses and Bowen’s disease; basal cell carcinoma; benign keratinocytic lesions including solar lentigo, seborrheic keratosis and lichen planus-like keratosis; dermatofibroma; melanoma; melanocytic nevus; and vascular lesions. The two main outcomes were the differences in the number of correct specific diagnoses per batch between all human readers and the top three algorithms, and between human experts and the top three algorithms. FINDINGS: Between Aug 4, 2018, and Sept 30, 2018, 511 human readers from 63 countries had at least one attempt in the reader study. 283 (55·4%) of 511 human readers were board-certified dermatologists, 118 (23·1%) were dermatology residents, and 83 (16·2%) were general practitioners. When comparing all human readers with all machine-learning algorithms, the algorithms achieved a mean of 2·01 (95% CI 1·97 to 2·04; p

Details

Language :
English
Database :
OpenAIRE
Journal :
Lancet Oncol
Accession number :
edsair.doi.dedup.....e926000ebe8515a3942114889f27be6c