1. Introducing the GEV Activation Function for Highly Unbalanced Data to Develop COVID-19 Diagnostic Models
- Author
-
Renrong Sun, Joshua Bridge, Yanda Meng, Yalin Zheng, Mingfeng Zhao, Yong Du, and Yitian Zhao
- Subjects
0301 basic medicine ,Artificial intelligence ,extreme value theory ,Databases, Factual ,Computer science ,Pneumonia, Viral ,Activation function ,Health Informatics ,Machine learning ,computer.software_genre ,Article ,lung ,030218 nuclear medicine & medical imaging ,Betacoronavirus ,03 medical and health sciences ,Bayes' theorem ,COVID-19 Testing ,Deep Learning ,0302 clinical medicine ,Health Information Management ,computer-aided detection and diagnosis ,Humans ,x-ray and computed tomography ,Electrical and Electronic Engineering ,Extreme value theory ,Pandemics ,Receiver operating characteristic ,Clinical Laboratory Techniques ,SARS-CoV-2 ,business.industry ,Deep learning ,COVID-19 ,Computational Biology ,Bayes Theorem ,Sigmoid function ,Computer Science Applications ,030104 developmental biology ,Binary classification ,Generalized extreme value distribution ,Radiographic Image Interpretation, Computer-Assisted ,Neural Networks, Computer ,Coronavirus Infections ,Tomography, X-Ray Computed ,business ,computer ,Algorithms ,Biotechnology - Abstract
Fast and accurate diagnosis is essential for the efficient and effective control of the COVID-19 pandemic that is currently disrupting the whole world. Despite the prevalence of the COVID-19 outbreak, relatively few diagnostic images are openly available to develop automatic diagnosis algorithms. Traditional deep learning methods often struggle when data is highly unbalanced with many cases in one class and only a few cases in another; new methods must be developed to overcome this challenge. We propose a novel activation function based on the generalized extreme value (GEV) distribution from extreme value theory, which improves performance over the traditional sigmoid activation function when one class significantly outweighs the other. We demonstrate the proposed activation function on a publicly available dataset and externally validate on a dataset consisting of 1,909 healthy chest X-rays and 84 COVID-19 X-rays. The proposed method achieves an improved area under the receiver operating characteristic (DeLong's p-value < 0.05) compared to the sigmoid activation. Our method is also demonstrated on a dataset of healthy and pneumonia vs. COVID-19 X-rays and a set of computerized tomography images, achieving improved sensitivity. The proposed GEV activation function significantly improves upon the previously used sigmoid activation for binary classification. This new paradigm is expected to play a significant role in the fight against COVID-19 and other diseases, with relatively few training cases available.
- Published
- 2020
- Full Text
- View/download PDF