Start Over

Filtering offensive language from multilingual social media contents: A deep learning approach.

Authors :: Saumya, Sunil
Kumar, Abhinav
Singh, Jyoti Prakash
Source :: Engineering Applications of Artificial Intelligence. Jul2024:Part A, Vol. 133, pN.PAG-N.PAG. 1p.
Publication Year :: 2024
Abstract: In the face of uncontrolled offensive content on social media, automated detection emerges as a critical need. This paper tackles this challenge by proposing a novel approach for identifying offensive language in multilingual, code-mixed, and script-mixed settings. The study presents a novel multilingual hybrid dataset constructed by merging diverse monolingual and bilingual resources. Further, we systematically evaluate the impact of input representations (Word2Vec, Global Vectors for Word Representation (or GloVe), Bidirectional Encoder Representations from Transformers (or BERT), and uniform initialization) and deep learning models (Convolutional Neural Network (or CNN), Bidirectional Long Short Term Memory (or Bi-LSTM), Bi-LSTM-Attention, and fine-tuned BERT) on detection accuracy. Our comprehensive experiments on a dataset of 42,560 social media comments from five languages (English, Hindi, German, Tamil, and Malayalam) reveal the superiority of fine-tuned BERT. Notably, it achieves a macro average F 1 -score of 0.79 for monolingual tasks and an impressive 0.86 for code-mixed and script-mixed tasks. These findings significantly advance offensive language detection methodologies and shed light on the complex dynamics of multilingual social media, paving the way for more inclusive and safer online communities. [ABSTRACT FROM AUTHOR]

Subjects :: *DEEP learning
*LANGUAGE models
*CONVOLUTIONAL neural networks
*SOCIAL media
*SURGICAL gloves
*LONG-term memory

Details

Language :: English
ISSN :: 09521976
Volume :: 133
Database :: Academic Search Index
Journal :: Engineering Applications of Artificial Intelligence
Publication Type :: Academic Journal
Accession number :: 177605529
Full Text :: https://doi.org/10.1016/j.engappai.2024.108159

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Filtering offensive language from multilingual social media contents: A deep learning approach.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Filtering offensive language from multilingual social media contents: A deep learning approach.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources