Back to Search Start Over

Data Augmentation for Offline Arabic Handwritten Text Recognition Using Moving Least Squares.

Authors :
Chadli, Mohamed Amine
Bouiadjra, Rochdi Bachir
Fekir, Abdelkader
Martínez-Gómez, Jesus
Gámez, José A.
Source :
Revue d'Intelligence Artificielle; Feb2024, Vol. 38 Issue 1, p1-9, 9p
Publication Year :
2024

Abstract

This paper addresses the research problem of Offline Arabic Handwriting Text Recognition (HTR). One of the most important approaches to HTR systems is deep learning. A large amount of annotated data is needed to train deep learning-based HTR systems. The Arabic language is spoken by hundreds of millions of people in North Africa and the Middle East. Writing styles and common words differ significantly between those regions. Due to the great diversity possible, designing a statistically represented and balanced database of Arabic handwritten texts by gathering and labeling the texts is an arduous task to achieve. One of the ways to enrich the training databases is by augmenting the existing data. We have developed a new data augmentation technique for Arabic handwritten texts using Moving Least Squares (MLS) to deform the images. This technique results in realistic images that look like manipulating real-world images, and the deformations are done using linear functions that produce deformations in real time. We aim to deform the training data images randomly in a way that the text present in the images is still recognizable by a human. This augmentation technique can be used directly on images to augment them unlike other techniques such as Generative Adversarial Networks (GAN) where they must be trained beforehand. At the same time, it produces new complex augmented images compared to simple traditional augmentation techniques such as rotations and translations. In addition to this augmentation technique, we used a deep learning system called Convolutional Recurrent Neural Networks (CRNN) to test the new technique, and we have experimented with a CRNN model that accepts small input-size images to boost the time needed for both training and image augmentations. All the experimentations are carried out on the Arabic IFN/ENIT database. The results show that the small input size CRNN model outperforms the large input size CRNN model by a big margin. The results also show that the integration of images augmented by the MLS technique can help the recognition system to generalize better on the test data, therefore, it can slightly improve the performance of the recognition system. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0992499X
Volume :
38
Issue :
1
Database :
Complementary Index
Journal :
Revue d'Intelligence Artificielle
Publication Type :
Academic Journal
Accession number :
176040519
Full Text :
https://doi.org/10.18280/ria.380101