Back to Search Start Over

A novel approach to word sense disambiguation in Bengali language using supervised methodology.

Authors :
Pal, Alok Ranjan
Saha, Diganta
Dash, Niladri Sekhar
Naskar, Sudip Kumar
Pal, Antara
Source :
Sādhanā: Academy Proceedings in Engineering Sciences. Aug2019, Vol. 44 Issue 8, pN.PAG-N.PAG. 1p.
Publication Year :
2019

Abstract

An attempt is made in this paper to report how a supervised methodology has been adopted for the task of Word Sense Disambiguation (WSD) in Bengali with necessary modifications. At the initial stage, four commonly used supervised methods, Decision Tree (DT), Support Vector Machine (SVM), Artificial Neural Network (ANN) and Naïve Bayes (NB), are developed at the baseline. These algorithms are applied individually on a data set of 13 most frequently used Bengali ambiguous words. On experimental basis, the baseline strategy is modified with two extensions: (a) inclusion of lemmatization process into the system and (b) bootstrapping of the operational process. As a result, the levels of accuracy of the baseline methods are slightly improved, which is a positive signal for the whole process of disambiguation as it opens scope for further modification of the existing method for better result. In this experiment, the data sets are prepared from the Bengali corpus, developed in the Technology Development for Indian Languages (TDIL) project of the Government of India and from the Bengali WordNet, which is developed at the Indian Statistical Institute, Kolkata. The paper reports the challenges and pitfalls of the work that have been closely observed during the experiment. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02562499
Volume :
44
Issue :
8
Database :
Academic Search Index
Journal :
Sādhanā: Academy Proceedings in Engineering Sciences
Publication Type :
Academic Journal
Accession number :
138171537
Full Text :
https://doi.org/10.1007/s12046-019-1165-2