1. Using big data to predict pertussis infections in Jinan city, China: a time series analysis.
- Author
-
Zhang, Yuzhou, Bambrick, Hilary, Mengersen, Kerrie, Tong, Shilu, Feng, Lei, Zhang, Li, Liu, Guifang, Xu, Aiqiang, and Hu, Wenbiao
- Subjects
- *
TIME series analysis , *REGRESSION trees , *BOX-Jenkins forecasting , *BIG data , *STANDARD deviations , *SCHOOL schedules , *WHOOPING cough - Abstract
This study aims to use big data (climate data, internet query data and school calendar patterns (SCP)) to improve pertussis surveillance and prediction, and develop an early warning model for pertussis epidemics. We collected weekly pertussis notifications, SCP, climate and internet search query data (Baidu index (BI)) in Jinan, China between 2013 and 2017. Time series decomposition and temporal risk assessment were used for examining the epidemic features in pertussis infections. A seasonal autoregressive integrated moving average (SARIMA) model and regression tree model were developed to predict pertussis occurrence using identified predictors. Our study demonstrates clear seasonal patterns in pertussis epidemics, and pertussis activity was most significantly associated with BI at 2-week lag (rBI = 0.73, p < 0.05), temperature at 1-week lag (rtemp = 0.19, p < 0.05) and rainfall at 2-week lag (rrainfall = 0.27, p < 0.05). No obvious relationship between pertussis peaks and school attendance was found in the study. Pertussis cases were more likely to be temporally concentrated throughout the epidemics during the study period. SARIMA models with 2-week-lagged BI and 1-week-lagged temperature had better predictive performance (βsearch query = 0.06, p = 0.02; βtemp = 0.16, p = 0.03) with large correlation coefficients (r = 0.67, p < 0.01) and low root mean squared error (RMSE) value (r = 3.59). The regression tree model identified threshold values of potential predictors (search query, climate and SCP) for pertussis epidemics. Our results showed that internet query in conjunction with social and climatic data can predict pertussis epidemics, which is a foundation of using such data to develop early warning systems. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF