9 results
Search Results
2. Leveraging machine learning approaches for predicting potential Lyme disease cases and incidence rates in the United States using Twitter.
- Author
-
Boligarla, Srikanth, Laison, Elda Kokoè Elolo, Li, Jiaxin, Mahadevan, Raja, Ng, Austen, Lin, Yangming, Thioub, Mamadou Yamar, Huang, Bruce, Ibrahim, Mohamed Hamza, and Nasri, Bouchra
- Subjects
DISEASE incidence ,MACHINE learning ,VECTOR-borne diseases ,REPORTING of diseases ,LYME disease ,COMMUNICABLE diseases ,SOCIAL media - Abstract
Background: Lyme disease is one of the most commonly reported infectious diseases in the United States (US), accounting for more than 90 % of all vector-borne diseases in North America. Objective: In this paper, self-reported tweets on Twitter were analyzed in order to predict potential Lyme disease cases and accurately assess incidence rates in the US. Methods: The study was done in three stages: (1) Approximately 1.3 million tweets were collected and pre-processed to extract the most relevant Lyme disease tweets with geolocations. A subset of tweets were semi-automatically labelled as relevant or irrelevant to Lyme disease using a set of precise keywords, and the remaining portion were manually labelled, yielding a curated labelled dataset of 77, 500 tweets. (2) This labelled data set was used to train, validate, and test various combinations of NLP word embedding methods and prominent ML classification models, such as TF-IDF and logistic regression, Word2vec and XGboost, and BERTweet, among others, to identify potential Lyme disease tweets. (3) Lastly, the presence of spatio-temporal patterns in the US over a 10-year period were studied. Results: Preliminary results showed that BERTweet outperformed all tested NLP classifiers for identifying Lyme disease tweets, achieving the highest classification accuracy and F1-score of 90 % . There was also a consistent pattern indicating that the West and Northeast regions of the US had a higher tweet rate over time. Conclusions: We focused on the less-studied problem of using Twitter data as a surveillance tool for Lyme disease in the US. Several crucial findings have emerged from the study. First, there is a fairly strong correlation between classified tweet counts and Lyme disease counts, with both following similar trends. Second, in 2015 and early 2016, the social media network like Twitter was essential in raising popular awareness of Lyme disease. Third, counties with a high incidence rate were not necessarily related with a high tweet rate, and vice versa. Fourth, BERTweet can be used as a reliable NLP classifier for detecting relevant Lyme disease tweets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
3. Aspect-based classification of vaccine misinformation: a spatiotemporal analysis using Twitter chatter.
- Author
-
Ismail, Heba, Hussein, Nada, Elabyad, Rawan, Abdelhalim, Salma, and Elhadef, Mourad
- Subjects
MACHINE learning ,SOCIAL media ,MISINFORMATION ,PEARSON correlation (Statistics) ,COVID-19 vaccines - Abstract
Background: The spread of misinformation of all types threatens people's safety and interrupts resolutions. COVID-19 vaccination has been a widely discussed topic on social media platforms with numerous misleading and fallacious information. This false information has a critical impact on the safety of society as it prevents many people from taking the vaccine, decelerating the world's ability to go back to normal. Therefore, it is vital to analyze the content shared on social media platforms, detect misinformation, identify aspects of misinformation, and efficiently represent related statistics to combat the spread of misleading information about the vaccine. This paper aims to support stakeholders in decision-making by providing solid and current insights into the spatiotemporal progression of the common misinformation aspects of the various available vaccines. Methods: Approximately 3800 tweets were annotated into four expert-verified aspects of vaccine misinformation obtained from reliable medical resources. Next, an Aspect-based Misinformation Analysis Framework was designed using the Light Gradient Boosting Machine (LightGBM) model, which is one of the most advanced, fast, and efficient machine learning models to date. Based on this dataset, spatiotemporal statistical analysis was performed to infer insights into the progression of aspects of vaccine misinformation among the public. Finally, the Pearson correlation coefficient and p-values are calculated for the global misinformation count against the vaccination counts of 43 countries from December 2020 until July 2021. Results: The optimized classification per class (i.e., per an aspect of misinformation) accuracy was 87.4%, 92.7%, 80.1%, and 82.5% for the "Vaccine Constituent," "Adverse Effects," "Agenda," "Efficacy and Clinical Trials" aspects, respectively. The model achieved an Area Under the ROC Curve (AUC) of 90.3% and 89.6% for validation and testing, respectively, which indicates the reliability of the proposed framework in detecting aspects of vaccine misinformation on Twitter. The correlation analysis shows that 37% of the countries addressed in this study were negatively affected by the spread of misinformation on Twitter resulting in reduced number of administered vaccines during the same timeframe. Conclusions: Twitter is a rich source of insight on the progression of vaccine misinformation among the public. Machine Learning models, such as LightGBM, are efficient for multi-class classification and proved reliable in classifying vaccine misinformation aspects even with limited samples in social media datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. Text classification models for the automatic detection of nonmedical prescription medication use from social media
- Author
-
Al-Garadi, Mohammed Ali, Yang, Yuan-Chi, Cai, Haitao, Ruan, Yucheng, O’Connor, Karen, Graciela, Gonzalez-Hernandez, Perrone, Jeanmarie, and Sarker, Abeed
- Published
- 2021
- Full Text
- View/download PDF
5. Potential benefits and limitations of machine learning in the field of eating disorders: current research and future directions
- Author
-
Fardouly, Jasmine, Crosby, Ross D., and Sukunesan, Suku
- Published
- 2022
- Full Text
- View/download PDF
6. A framework for multi-faceted content analysis of social media chatter regarding non-medical use of prescription medications
- Author
-
Raza, Shaina, Schwartz, Brian, Lakamana, Sahithi, Ge, Yao, and Sarker, Abeed
- Published
- 2023
- Full Text
- View/download PDF
7. Prediction of vaccine hesitancy based on social media traffic among Israeli parents using machine learning strategies
- Author
-
Bar-Lev, Shirly, Reichman, Shahar, and Barnett-Itzhaki, Zohar
- Published
- 2021
- Full Text
- View/download PDF
8. Machine learning for distinguishing saudi children with and without autism via eye-tracking data.
- Author
-
Alarifi, Hana, Aldhalaan, Hesham, Hadjikhani, Nouchine, Johnels, Jakob Åsberg, Alarifi, Jhan, Ascenso, Guido, and Alabdulaziz, Reem
- Subjects
DIAGNOSIS of autism ,BIOMARKERS ,COMPUTER software ,EYE movements ,VISUAL fields ,TIME ,CLASSIFICATION ,SOCIAL media ,EFFECT sizes (Statistics) ,MACHINE learning ,DIFFERENTIAL diagnosis ,TASK performance ,ACCURACY ,RANDOM forest algorithms ,PATIENTS ,EYE ,FACE ,AUTISM in children ,AUTISM ,ATTENTION ,VISUAL perception ,DESCRIPTIVE statistics ,RESEARCH funding ,QUESTIONNAIRES ,DATA analysis software ,ALGORITHMS ,EMAIL - Abstract
Background: Despite the prevalence of Autism Spectrum Disorder (ASD) globally, there's a knowledge gap pertaining to autism in Arabic nations. Recognizing the need for validated biomarkers for ASD, our study leverages eye-tracking technology to understand gaze patterns associated with ASD, focusing on joint attention (JA) and atypical gaze patterns during face perception. While previous studies typically evaluate a single eye-tracking metric, our research combines multiple metrics to capture the multidimensional nature of autism, focusing on dwell times on eyes, left facial side, and joint attention. Methods: We recorded data from 104 participants (41 neurotypical, mean age: 8.21 ± 4.12 years; 63 with ASD, mean age 8 ± 3.89 years). The data collection consisted of a series of visual stimuli of cartoon faces of humans and animals, presented to the participants in a controlled environment. During each stimulus, the eye movements of the participants were recorded and analyzed, extracting metrics such as time to first fixation and dwell time. We then used these data to train a number of machine learning classification algorithms, to determine if these biomarkers can be used to diagnose ASD. Results: We found no significant difference in eye-dwell time between autistic and control groups on human or animal eyes. However, autistic individuals focused less on the left side of both human and animal faces, indicating reduced left visual field (LVF) bias. They also showed slower response times and shorter dwell times on congruent objects during joint attention (JA) tasks, indicating diminished reflexive joint attention. No significant difference was found in time spent on incongruent objects during JA tasks. These results suggest potential eye-tracking biomarkers for autism. The best-performing algorithm was the random forest one, which achieved accuracy = 0.76 ± 0.08, precision = 0.78 ± 0.13, recall = 0.84 ± 0.07, and F1 = 0.80 ± 0.09. Conclusions: Although the autism group displayed notable differences in reflexive joint attention and left visual field bias, the dwell time on eyes was not significantly different. Nevertheless, the machine algorithm model trained on these data proved effective at diagnosing ASD, showing the potential of these biomarkers. Our study shows promising results and opens up potential for further exploration in this under-researched geographical context. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. Optimization on machine learning based approaches for sentiment analysis on HPV vaccines related tweets.
- Author
-
Jingcheng Du, Jun Xu, Hsingyi Song, Xiangyu Liu, and Cui Tao
- Subjects
HUMAN papillomavirus vaccines ,MACHINE learning ,INDUSTRIAL efficiency ,SENTIMENT analysis ,SUPPORT vector machines - Abstract
Background: Analysing public opinions on HPV vaccines on social media using machine learning based approaches will help us understand the reasons behind the low vaccine coverage and come up with corresponding strategies to improve vaccine uptake. Objective: To propose a machine learning system that is able to extract comprehensive public sentiment on HPV vaccines on Twitter with satisfying performance. Method: We collected and manually annotated 6,000 HPV vaccines related tweets as a gold standard. SVM model was chosen and a hierarchical classification method was proposed and evaluated. Additional feature sets evaluation and model parameters optimization was done to maximize the machine learning model performance. Results: A hierarchical classification scheme that contains 10 categories was built to access public opinions toward HPV vaccines comprehensively. A 6,000 annotated tweets gold corpus with Kappa annotation agreement at 0.851 was created and made public available. The hierarchical classification model with optimized feature sets and model parameters has increased the micro-averaging and macro-averaging F score from 0.6732 and 0.3967 to 0.7442 and 0.5883 respectively, compared with baseline model. Conclusions: Our work provides a systematical way to improve the machine learning model performance on the highly unbalanced HPV vaccines related tweets corpus. Our system can be further applied on a large tweets corpus to extract large-scale public opinion towards HPV vaccines. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.