1. Predicting COVID-19 mortality with electronic medical records
- Author
-
Zachary H. Strasser, Shawn N. Murphy, Pourandokht Naseri, Hossein Estiri, Jeffy G. Klann, and Kavishwar B. Wagholikar
- Subjects
medicine.medical_specialty ,Population ,Computer applications to medicine. Medical informatics ,MEDLINE ,R858-859.7 ,Medicine (miscellaneous) ,Health Informatics ,Information technology ,01 natural sciences ,Article ,03 medical and health sciences ,0302 clinical medicine ,Health Information Management ,Epidemiology ,medicine ,030212 general & internal medicine ,0101 mathematics ,Lung cancer ,Intensive care medicine ,education ,education.field_of_study ,business.industry ,Medical record ,Statistics ,Computational science ,010102 general mathematics ,Interstitial lung disease ,COVID-19 ,Cancer ,medicine.disease ,Computer Science Applications ,Coronavirus ,Pneumonia ,Risk factors ,business - Abstract
This study aims to predict death after COVID-19 using only the past medical information routinely collected in electronic health records (EHRs) and to understand the differences in risk factors across age groups. Combining computational methods and clinical expertise, we curated clusters that represent 46 clinical conditions as potential risk factors for death after a COVID-19 infection. We trained age-stratified generalized linear models (GLMs) with component-wise gradient boosting to predict the probability of death based on what we know from the patients before they contracted the virus. Despite only relying on previously documented demographics and comorbidities, our models demonstrated similar performance to other prognostic models that require an assortment of symptoms, laboratory values, and images at the time of diagnosis or during the course of the illness. In general, we found age as the most important predictor of mortality in COVID-19 patients. A history of pneumonia, which is rarely asked in typical epidemiology studies, was one of the most important risk factors for predicting COVID-19 mortality. A history of diabetes with complications and cancer (breast and prostate) were notable risk factors for patients between the ages of 45 and 65 years. In patients aged 65–85 years, diseases that affect the pulmonary system, including interstitial lung disease, chronic obstructive pulmonary disease, lung cancer, and a smoking history, were important for predicting mortality. The ability to compute precise individual-level risk scores exclusively based on the EHR is crucial for effectively allocating and distributing resources, such as prioritizing vaccination among the general population.
- Published
- 2021