학술논문

A Novel Framework to Forecast COVID-19 Incidence Based on Google Trends Search Data
Document Type
Periodical
Source
IEEE Transactions on Computational Social Systems IEEE Trans. Comput. Soc. Syst. Computational Social Systems, IEEE Transactions on. 11(1):1352-1361 Feb, 2024
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
General Topics for Engineers
COVID-19
Predictive models
Market research
Internet
Data models
Wavelet transforms
Meteorology
Coronavirus disease 2019 (COVID-19)
forecasting
Google Trends
Wiener model
Language
ISSN
2329-924X
2373-7476
Abstract
The global outbreak of coronavirus disease 2019 (COVID-19) has spread to more than 200 countries worldwide, leading to severe health and socioeconomic consequences. As such, the topic of monitoring and predicting epidemics has been attracting a lot of interest. Previous work reported search volumes from Google Trends are beneficial in decoding influenza dynamics, implying its potential for COVID-19 prediction. Therefore, a predictive model using the Wiener methods was built based on epidemic-related search queries from Google Trends, along with climate variables, aiming to forecast the dynamics of the weekly COVID-19 incidence in Washington, DC, USA. The Wiener model, which shares the merits of interpretability, low computation costs, and adaptation to nonlinear fluctuations, was used in this study. Models with multiple sets of features were constructed and further optimized by the highest weight selecting strategy. Furthermore, comparisons to the other two commonly used prediction models based on the autoregressive integrated moving average (ARIMA) and long short-term memory (LSTM) were also performed. Our results showed the predicted COVID-19 trends significantly correlated with the actual (rho = 0.88, $p < 0.0001$ ), outperforming those with ARIMA and LSTM approaches, indicating Google Trends data as a useful tool in terms of COVID-19 prediction. Also, the model using 20 search queries with the highest weighting outperformed all other models, supporting the highest weight feature selection as a feasible criterion. Google Trends search query data can be used to forecast the outbreak of COVID-19, which might assist health policymakers to allocate health care resources and taking preventive strategies.