1. Efficient and Effective Training of COVID-19 Classification Networks With Self-Supervised Dual-Track Learning to Rank.
- Author
-
Li Y, Wei D, Chen J, Cao S, Zhou H, Zhu Y, Wu J, Lan L, Sun W, Qian T, Ma K, Xu H, and Zheng Y
- Subjects
- Algorithms, COVID-19, COVID-19 Testing, Cohort Studies, Computational Biology, Coronavirus Infections classification, Deep Learning, Diagnostic Errors statistics & numerical data, Humans, Neural Networks, Computer, Pneumonia, Viral classification, Retrospective Studies, SARS-CoV-2, Betacoronavirus, Clinical Laboratory Techniques statistics & numerical data, Coronavirus Infections diagnosis, Coronavirus Infections diagnostic imaging, Pandemics classification, Pneumonia, Viral diagnosis, Pneumonia, Viral diagnostic imaging, Radiographic Image Interpretation, Computer-Assisted statistics & numerical data, Supervised Machine Learning, Tomography, X-Ray Computed statistics & numerical data
- Abstract
Coronavirus Disease 2019 (COVID-19) has rapidly spread worldwide since first reported. Timely diagnosis of COVID-19 is crucial both for disease control and patient care. Non-contrast thoracic computed tomography (CT) has been identified as an effective tool for the diagnosis, yet the disease outbreak has placed tremendous pressure on radiologists for reading the exams and may potentially lead to fatigue-related mis-diagnosis. Reliable automatic classification algorithms can be really helpful; however, they usually require a considerable number of COVID-19 cases for training, which is difficult to acquire in a timely manner. Meanwhile, how to effectively utilize the existing archive of non-COVID-19 data (the negative samples) in the presence of severe class imbalance is another challenge. In addition, the sudden disease outbreak necessitates fast algorithm development. In this work, we propose a novel approach for effective and efficient training of COVID-19 classification networks using a small number of COVID-19 CT exams and an archive of negative samples. Concretely, a novel self-supervised learning method is proposed to extract features from the COVID-19 and negative samples. Then, two kinds of soft-labels ('difficulty' and 'diversity') are generated for the negative samples by computing the earth mover's distances between the features of the negative and COVID-19 samples, from which data 'values' of the negative samples can be assessed. A pre-set number of negative samples are selected accordingly and fed to the neural network for training. Experimental results show that our approach can achieve superior performance using about half of the negative samples, substantially reducing model training time.
- Published
- 2020
- Full Text
- View/download PDF