Back to Search Start Over

Data Mining Techniques to Categorize Single Paragraph-Formed Self-narrated Stories

Authors :
Niloy Biswas
Md. Mahmudul Haque
Niloy Saha Roy
Syeda Saiara Lubaba
Rakibul Islam
Amzad Hossain Rafi
Rashedur M. Rahman
Sajid-ul Islam
Source :
ICT Analysis and Applications ISBN: 9789811583537
Publication Year :
2020
Publisher :
Springer Singapore, 2020.

Abstract

In this age of natural language processing, most of the sentiment analysis tasks are done by polarization, for example, 0 for negative or 1 for positive of the given context/text. In some work, the tasks are done using fine-grained polarization, such as very negative or very positive. The proposed system of this paper includes the categorization of the paragraphs using its nature. All the paragraphs are self-narrated, and the number of words in those self-narrated paragraphs contains 50–4200 words. The paragraphs are categorized using three categorizations: “work stress,” “bullying,” and “sexual harassment” in both real and cyber worlds. Artificial neural network paragraph vectors, a distributed bag-of-words and distributed memory, are used to get the embedding of each paragraph and later for classification by data mining techniques. The accuracy of each algorithm lies between 70 and 94%. The best model gives a 77.46% F1 score in the test set.

Details

Database :
OpenAIRE
Journal :
ICT Analysis and Applications ISBN: 9789811583537
Accession number :
edsair.doi...........cfaf041d06f98c93fc65f022ce63a6a1
Full Text :
https://doi.org/10.1007/978-981-15-8354-4_70