Back to Search Start Over

A clustering‐based feature selection framework for handwritten Indic script classification.

Authors :
Chatterjee, Iman
Ghosh, Manosij
Singh, Pawan Kumar
Sarkar, Ram
Nasipuri, Mita
Source :
Expert Systems. Dec2019, Vol. 36 Issue 6, pN.PAG-N.PAG. 1p.
Publication Year :
2019

Abstract

In India, which has numerous officially recognized scripts, there is a primary need for categorizing the documents on the basis of the scripts used therein. Identification of script used in a document is essential for its effective handling both manually and digitally. Identification of script in a document image is an important research problem in the pattern recognition field, which, at times, suffers from the issue of growing dimensionality of the feature vector and requires an efficient feature selection technique. Keeping this fact in mind, in this paper, we propose a clustering‐based filter feature selection framework in order to extract an optimal and effective feature subset from the original feature vector. The present feature selection methodology is evaluated on a script classification problem involving handwritten documents in 12 major Indic scripts. Experiments are done at word‐level, text‐line‐level, and block‐level. Experiments demonstrate that a reasonable increment in classification accuracy has been realized using comparatively lesser number of features. The proposed framework for feature selection is computationally inexpensive and can be applied to other pattern recognition problems as well. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02664720
Volume :
36
Issue :
6
Database :
Academic Search Index
Journal :
Expert Systems
Publication Type :
Academic Journal
Accession number :
140319721
Full Text :
https://doi.org/10.1111/exsy.12459