151. Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study.
- Author
-
Cai, Mingxiang, Cai, Mingxiang, Shah, Neal, Li, Jiawei, Chen, Wen-Hao, Cuomo, Raphael E, Obradovich, Nick, Mackey, Tim K, Cai, Mingxiang, Cai, Mingxiang, Shah, Neal, Li, Jiawei, Chen, Wen-Hao, Cuomo, Raphael E, Obradovich, Nick, and Mackey, Tim K
- Abstract
IntroductionFrom late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policymaking. In response, this retrospective infoveillance study identifies and characterizes user-generated messages related to opioid abuse, heroin injection drug use, and HIV status using natural language processing (NLP) among Twitter users in Indiana during the period of this HIV outbreak.Materials and methodsOur study consisted of two phases: data collection and processing, and data analysis. We collected Indiana geolocated tweets from the public Twitter API using Amazon Web Services EC2 instances filtered for geocoded messages in the immediate pre and post period of the outbreak. In the data analysis phase we applied an unsupervised machine learning approach using NLP called the Biterm Topic Model (BTM) to identify tweets related to opioid, heroin/injection, and HIV behavior and then examined these messages for HIV risk-related topics that could be associated with the outbreak.ResultsMore than 10 million geocoded tweets occurring in Indiana during the immediate pre and post period of the outbreak were collected for analysis. Using BTM, we identified 1350 tweets thought to be relevant to the outbreak and then confirmed 358 tweets using human annotation. The most prevalent themes identified were tweets related to self-reported abuse of illicit and prescription drugs, opioid use disorder, self-reported HIV status, and public sentiment regarding the outbreak. Geospatial analysis found that these messages clustered in population dense areas outside of the outbreak, including Indianapolis and neighboring Clark County.DiscussionThis infoveillance study characterized the social media conversations of communities in Indiana in the pre and post period of the 2015 HIV outbreak. Behav
- Published
- 2020