Back to Search Start Over

Sentiment analysis on Malay-English mixed language text using artificial neural network.

Authors :
Yann, Lim May
Zahri, N. A. H.
Amir, Amiza
Romli, R.
Ghazali, N. H.
Anwar, S. A.
Hashim, N. M. Z.
Source :
AIP Conference Proceedings. 2024, Vol. 2898 Issue 1, p1-13. 13p.
Publication Year :
2024

Abstract

Sentiment analysis (SA) is the study of people's emotions and attitudes toward a particular topic. It is beneficial for monitoring and analyzing social media text in order to gather public opinion. Despite the fact that there are SA applications for monolingual text such as English and non-English languages like Hindi, Chinese and French, the Malay language has far fewer works, not to mention the mixed language such as Malay-English (also known as Manglish). Other than comments and posts from websites and social media, the emoji used by internet users can also help to provide better insights into how they truly feel about a particular topic. Our work focuses on Malay-English mixed language comments and posts on how Malaysians feel about daily new cases of Covid-19 in Malaysia. We proposed a neural network framework to perform SA on languages spoken by Malaysians, namely Malay, English, and Malay-English, by also taking into account the emoji used by internet users. The data was pre-processed to remove noises and then transformed into word vector representation using word embedding technique. Then we propose a framework that involves training and testing mixed language textual data along with emoji analysis by using bidirectional Long Short Term Memory (biLSTM) neural network. To compare with the proposed method, several machine learning models and Long Short Term Memory (LSTM) with word vectorization was used. Finally, compared to the machine learning model such as Naïve Bayes and Logistic Regression, neural networks such as LSTM, the proposed method; biLSTM with tuned hyper-parameter for Malay-English mixed language achieved the highest accuracy of 76.6%, and macro F1-score of 69.6%. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0094243X
Volume :
2898
Issue :
1
Database :
Academic Search Index
Journal :
AIP Conference Proceedings
Publication Type :
Conference
Accession number :
175345784
Full Text :
https://doi.org/10.1063/5.0192401