Back to Search Start Over

The benefit of augmenting open data with clinical data-warehouse EHR for forecasting SARS-CoV-2 hospitalizations in Bordeaux area, France

Authors :
Thomas, Ferté
Vianney, Jouhet
Romain, Griffier
Boris P, Hejblum
Rodolphe, Thiébaut
Francois, Rouanet
Statistics In System biology and Translational Medicine (SISTM)
Inria Bordeaux - Sud-Ouest
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)- Bordeaux population health (BPH)
Université de Bordeaux (UB)-Institut de Santé Publique, d'Épidémiologie et de Développement (ISPED)-Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Bordeaux (UB)-Institut de Santé Publique, d'Épidémiologie et de Développement (ISPED)-Institut National de la Santé et de la Recherche Médicale (INSERM)
Bordeaux population health (BPH)
Université de Bordeaux (UB)-Institut de Santé Publique, d'Épidémiologie et de Développement (ISPED)-Institut National de la Santé et de la Recherche Médicale (INSERM)
Hôpital Pellegrin
CHU Bordeaux [Bordeaux]-Groupe hospitalier Pellegrin
Bordeaux University Hospital Covid-19 Crisis Task Force: Isabelle Faure, Philippe Revel, Eric Tentillier, Jean-Michel Dindart, Didier Gruson, Olivier Joannes-Boyau, Jean-Marie Denis Malvy, Thierry Pistone, Didier Neau, Duc Nguyen, Marie-Edith Lafon, Mathieu Molimard, Thierry Schaeverbeke, Nicolas Grenier, Nathalie Salles, Francois Rouanet
Hejblum, Boris
Source :
JAMIA open, JAMIA open, 2022, 5 (4), ⟨10.1093/jamiaopen/ooac086⟩
Publication Year :
2022
Publisher :
HAL CCSD, 2022.

Abstract

Objective The aim of this study was to develop an accurate regional forecast algorithm to predict the number of hospitalized patients and to assess the benefit of the Electronic Health Records (EHR) information to perform those predictions. Materials and Methods Aggregated data from SARS-CoV-2 and weather public database and data warehouse of the Bordeaux hospital were extracted from May 16, 2020 to January 17, 2022. The outcomes were the number of hospitalized patients in the Bordeaux Hospital at 7 and 14 days. We compared the performance of different data sources, feature engineering, and machine learning models. Results During the period of 88 weeks, 2561 hospitalizations due to COVID-19 were recorded at the Bordeaux Hospital. The model achieving the best performance was an elastic-net penalized linear regression using all available data with a median relative error at 7 and 14 days of 0.136 [0.063; 0.223] and 0.198 [0.105; 0.302] hospitalizations, respectively. Electronic health records (EHRs) from the hospital data warehouse improved median relative error at 7 and 14 days by 10.9% and 19.8%, respectively. Graphical evaluation showed remaining forecast error was mainly due to delay in slope shift detection. Discussion Forecast model showed overall good performance both at 7 and 14 days which were improved by the addition of the data from Bordeaux Hospital data warehouse. Conclusions The development of hospital data warehouse might help to get more specific and faster information than traditional surveillance system, which in turn will help to improve epidemic forecasting at a larger and finer scale.

Details

Language :
English
ISSN :
25742531
Database :
OpenAIRE
Journal :
JAMIA open, JAMIA open, 2022, 5 (4), ⟨10.1093/jamiaopen/ooac086⟩
Accession number :
edsair.doi.dedup.....3e6bfd36bc4907b343292f35722d56ea
Full Text :
https://doi.org/10.1093/jamiaopen/ooac086⟩