1. Eleven routine clinical features predict COVID-19 severity uncovered by machine learning of longitudinal measurements
- Author
-
Shigao Huang, Xindong Sun, Chunyu Tu, Haihong Zhao, Xuejun Dong, Baofu Chen, Lu Li, Hai Yang, Xiaomai Wu, Liang Yue, Bo Shen, Fangfei Zhang, Minfei Peng, Dongqing Lv, Luxiao Hong, Stan Z. Li, Tiannan Guo, Shiyong Chen, Hongguo Zhu, Qiushi Zhang, Zelin Zang, Chao Zhang, Yaoting Sun, Jing Wang, Yi Zhu, Jiaqin Xu, Zhehan Zhou, Junbo Liang, Weigang Ge, Jun Li, Hao Chen, Minghui Li, Haixiao Chen, and Kai Zhou
- Subjects
Declaration ,LOESS, locally estimated scatterplot smoothing ,AST, aspartate aminotransferase ,Research purpose ,computer.software_genre ,Biochemistry ,AUC, area under the curve ,0302 clinical medicine ,Informed consent ,Structural Biology ,SHAP, SHapley Additive exPlanations ,GA, genetic algorithm ,Medicine ,Severity prediction ,Predictive dynamics ,TT, thrombin time ,0303 health sciences ,LDH, lactate dehydrogenase ,eGFR, estimated glomerular filtration rate ,ROC, receiver operating characteristics ,Subject (documents) ,Predictive value ,PCT, procalcitonin ,CT, computed tomography ,Test (assessment) ,Computer Science Applications ,NPV, negative predictive value ,GGT, gamma glutamyl transpeptidase ,030220 oncology & carcinogenesis ,Cohort ,CRP, C-reactive protein ,LOS, length of stay ,Biotechnology ,NETs, neutrophil extracellular traps ,LAC, lactate ,medicine.medical_specialty ,Coronavirus disease 2019 (COVID-19) ,Biophysics ,Feature selection ,Machine learning ,Article ,Mg, magnesium ,03 medical and health sciences ,Text mining ,Routine clinical test ,Genetics ,APTT, activated partial thromboplastin time ,Normal range ,ComputingMethodologies_COMPUTERGRAPHICS ,030304 developmental biology ,HIS, hospital information system ,ESR, erythrocyte sedimentation rate ,SVM, support vector machine ,SARS-CoV-2 ,business.industry ,Disease progression ,BASO#, basophil counts ,COVID-19 ,Retrospective cohort study ,CFDA, China Food and Drug Administration ,PPV, positive predictive value ,RT-PCR, reverse transcriptase -polymerase chain reaction ,Family medicine ,SaO2, oxygen saturation ,ABG, arterial blood gas ,Longitudinal dynamics ,Artificial intelligence ,business ,computer ,CK, creatine kinase ,Medical ethics ,TP248.13-248.65 - Abstract
Graphical abstract, Severity prediction of COVID-19 remains one of the major clinical challenges for the ongoing pandemic. Here, we have recruited a 144 COVID-19 patient cohort, resulting in a data matrix containing 3,065 readings for 124 types of measurements over 52 days. A machine learning model was established to predict the disease progression based on the cohort consisting of training, validation, and internal test sets. A panel of eleven routine clinical factors constructed a classifier for COVID-19 severity prediction, achieving accuracy of over 98% in the discovery set. Validation of the model in an independent cohort containing 25 patients achieved accuracy of 80%. The overall sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were 0.70, 0.99, 0.93, and 0.93, respectively. Our model captured predictive dynamics of lactate dehydrogenase (LDH) and creatine kinase (CK) while their levels were in the normal range. This model is accessible at https://www.guomics.com/covidAI/ for research purpose.
- Published
- 2021
- Full Text
- View/download PDF