1. Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models
- Author
-
Maksat Kanatov, Kairat Bostanbekov, Abdelrahman Abdallah, Galymzhan Abdimanap, Daniyar Nurseitov, and Anel Alimova
- Subjects
FOS: Computer and information sciences ,Physics and Astronomy (miscellaneous) ,Relation (database) ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Feature extraction ,Computer Science - Computer Vision and Pattern Recognition ,02 engineering and technology ,Kazakh ,computer.software_genre ,Convolutional neural network ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,0302 clinical medicine ,Handwriting ,Management of Technology and Innovation ,0202 electrical engineering, electronic engineering, information engineering ,Engineering (miscellaneous) ,business.industry ,Deep learning ,language.human_language ,ComputingMethodologies_PATTERNRECOGNITION ,Recurrent neural network ,Handwriting recognition ,language ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neural networks (CNNs) for feature extraction and a fully connected multilayer perceptron neural network (MLP) for word classification. The second model, called SimpleHTR, uses CNN and recurrent neural network (RNN) layers to extract information from images. We also proposed the Bluechet and Puchserver models to compare the results. Due to the lack of available open datasets in Russian and Kazakh languages, we carried out work to collect data that included handwritten names of countries and cities from 42 different Cyrillic words, written more than 500 times in different handwriting. We also used a handwritten database of Kazakh and Russian languages (HKR). This is a new database of Cyrillic words (not only countries and cities) for the Russian and Kazakh languages, created by the authors of this work.
- Published
- 2020
- Full Text
- View/download PDF