Back to Search
Start Over
Identifying tweets of personal health experience through word embedding and LSTM neural network
- Source :
- BMC Bioinformatics, BMC Bioinformatics, Vol 19, Iss S8, Pp 67-74 (2018)
- Publication Year :
- 2018
- Publisher :
- Springer Science and Business Media LLC, 2018.
-
Abstract
- Background As Twitter has become an active data source for health surveillance research, it is important that efficient and effective methods are developed to identify tweets related to personal health experience. Conventional classification algorithms rely on features engineered by human domain experts, and engineering such features is a challenging task and requires much human intelligence. The resultant features may not be optimal for the classification problem, and can make it challenging for conventional classifiers to correctly predict personal experience tweets (PETs) due to the various ways to express and/or describe personal experience in tweets. In this study, we developed a method that combines word embedding and long short-term memory (LSTM) model without the need to engineer any specific features. Through word embedding, tweet texts were represented as dense vectors which in turn were fed to the LSTM neural network as sequences. Results Statistical analyses of the results of 10-fold cross-validations of our method and conventional methods indicate that there exist significant differences (p
- Subjects :
- Word embedding
020205 medical informatics
Process (engineering)
Computer science
Twitter
02 engineering and technology
lcsh:Computer applications to medicine. Medical informatics
Machine learning
computer.software_genre
Vocabulary
Unsupervised feature learning
Biochemistry
LSTM neural network
Domain (software engineering)
Social media
Pharmacovigilance
Structural Biology
0202 electrical engineering, electronic engineering, information engineering
Humans
lcsh:QH301-705.5
Molecular Biology
Artificial neural network
Human intelligence
business.industry
Research
Applied Mathematics
Deep learning
Computer Science Applications
Statistical classification
lcsh:Biology (General)
Health
lcsh:R858-859.7
020201 artificial intelligence & image processing
Neural Networks, Computer
Artificial intelligence
business
Health surveillance
computer
Algorithms
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 19
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....4dfd37fce7ef366258e9c4a6145beb40
- Full Text :
- https://doi.org/10.1186/s12859-018-2198-y