Back to Search
Start Over
A survey on sentiment analysis in Urdu: A resource-poor language
A survey on sentiment analysis in Urdu: A resource-poor language
- Source :
- Egyptian Informatics Journal, Egyptian Informatics Journal, Vol 22, Iss 1, Pp 53-74 (2021)
- Publication Year :
- 2021
- Publisher :
- Elsevier BV, 2021.
-
Abstract
- Background/introduction The dawn of the internet opened the doors to the easy and widespread sharing of information on subject matters such as products, services, events and political opinions. While the volume of studies conducted on sentiment analysis is rapidly expanding, these studies mostly address English language concerns. The primary goal of this study is to present state-of-art survey for identifying the progress and shortcomings saddling Urdu sentiment analysis and propose rectifications. Methods We described the advancements made thus far in this area by categorising the studies along three dimensions, namely: text pre-processing lexical resources and sentiment classification. These pre-processing operations include word segmentation, text cleaning, spell checking and part-of-speech tagging. An evaluation of sophisticated lexical resources including corpuses and lexicons was carried out, and investigations were conducted on sentiment analysis constructs such as opinion words, modifiers, negations. Results and conclusions Performance is reported for each of the reviewed study. Based on experimental results and proposals forwarded through this paper provides the groundwork for further studies on Urdu sentiment analysis. Previous article in issueNext article in issue Keywords Urdu sentiment analysisPre-processingSentiment lexiconDatasetsCorpusUrdu sentiment classificationSemantic orientation
- Subjects :
- Computer science
02 engineering and technology
Pre-processing
Management Science and Operations Research
Corpus
computer.software_genre
Negation
0202 electrical engineering, electronic engineering, information engineering
Datasets
Resource poor
Urdu sentiment analysis
business.industry
Sentiment analysis
Text segmentation
Spell
020206 networking & telecommunications
Subject (documents)
QA75.5-76.95
language.human_language
Computer Science Applications
Sentiment lexicon
Electronic computers. Computer science
language
Urdu sentiment classification
020201 artificial intelligence & image processing
The Internet
Urdu
Artificial intelligence
business
computer
Natural language processing
Information Systems
Subjects
Details
- ISSN :
- 11108665
- Volume :
- 22
- Database :
- OpenAIRE
- Journal :
- Egyptian Informatics Journal
- Accession number :
- edsair.doi.dedup.....18322a85c0575f1541e1810531345707
- Full Text :
- https://doi.org/10.1016/j.eij.2020.04.003