Forecasting the west nile virus in the United States: An extensive novel data streams-based time series analysis and structural equation modeling of related digital searching behavior

Abdulla Watad, Samaa Watad, Naim Mahroum, Kassem Sharif, Howard Amital, Nicola Luigi Bragazzi, Mohammad Adawi

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

Background: West Nile virus is an arbovirus responsible for an infection that tends to peak during the late summer and early fall. Tools monitoring Web searches are emerging as powerful sources of data, especially concerning infectious diseases such as West Nile virus. Objective: This study aimed at exploring the potential predictive power of West Nile virus-related Web searches. Methods: Different novel data streams, including Google Trends, WikiTrends, YouTube, and Google News, were used to extract search trends. Data regarding West Nile virus cases were obtained from the Centers for Disease Control and Prevention. Data were analyzed using regression, times series analysis, structural equation modeling, and clustering analysis. Results: In the regression analysis, an association between Web searches and “real-world” epidemiological figures was found. The best seasonal autoregressive integrated moving average model with explicative variable (SARIMAX) was found to be (0,1,1)x(0,1,1)4. Using data from 2004 to 2015, we were able to predict data for 2016. From the structural equation modeling, the consumption of West Nile virus-related news fully mediated the relation between Google Trends and the consumption of YouTube videos, as well as the relation between the latter variable and the number of West Nile virus cases. Web searches fully mediated the relation between epidemiological figures and the consumption of YouTube videos, as well as the relation between epidemiological data and the number of accesses to the West Nile virus-related Wikipedia page. In the clustering analysis, the consumption of news was most similar to the Web searches pattern, which was less close to the consumption of YouTube videos and least similar to the behavior of accessing West Nile virus-related Wikipedia pages. Conclusions: Our study demonstrated an association between epidemiological data and search patterns related to the West Nile virus. Based on this correlation, further studies are needed to examine the practicality of these findings.

Original languageEnglish
Article numbere9176
JournalJMIR Public Health and Surveillance
Volume5
Issue number1
DOIs
StatePublished - 28 Feb 2019

Bibliographical note

Publisher Copyright:
© Abdulla Watad, Samaa Watad, Naim Mahroum, Kassem Sharif, Howard Amital, Nicola Luigi Bragazzi, Mohammad Adawi. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 28.02.2019. This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.

Keywords

  • Forecasting model
  • Google trends
  • Infodemiology
  • Infoveillance
  • Seasonal autoregressive integrated moving average model with explicative variable (SARIMAX)
  • West nile virus

Fingerprint

Dive into the research topics of 'Forecasting the west nile virus in the United States: An extensive novel data streams-based time series analysis and structural equation modeling of related digital searching behavior'. Together they form a unique fingerprint.

Cite this