Back to Search
Start Over
CALText: Contextual Attention Localization for Offline Handwritten Text
- Source :
- Neural Processing Letters.
- Publication Year :
- 2023
- Publisher :
- Springer Science and Business Media LLC, 2023.
-
Abstract
- Recognition of Arabic-like scripts such as Persian and Urdu is more challenging than Latin-based scripts. This is due to the presence of a two-dimensional structure, context-dependent character shapes, spaces and overlaps, and placement of diacritics. Not much research exists for offline handwritten Urdu script which is the 10th most spoken language in the world. We present an attention based encoder-decoder model that learns to read Urdu in context. A novel localization penalty is introduced to encourage the model to attend only one location at a time when recognizing the next character. In addition, we comprehensively refine the only complete and publicly available handwritten Urdu dataset in terms of ground-truth annotations. We evaluate the model on both Urdu and Arabic datasets and show that contextual attention localization outperforms both simple attention and multi-directional LSTM models.<br />25 pages, 15 figures and 6 tables
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
I.2.6
Computer Networks and Communications
Computer Vision and Pattern Recognition (cs.CV)
General Neuroscience
Computer Science - Computer Vision and Pattern Recognition
Computer Science - Neural and Evolutionary Computing
Machine Learning (cs.LG)
I.7.5
Artificial Intelligence
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
68T07 (Primary), 68U10 (Secondary)
Neural and Evolutionary Computing (cs.NE)
Software
Subjects
Details
- ISSN :
- 1573773X and 13704621
- Database :
- OpenAIRE
- Journal :
- Neural Processing Letters
- Accession number :
- edsair.doi.dedup.....22522a10129a447677907121fc814c57