Start Over

ET-Network: A novel efficient transformer deep learning model for automated Urdu handwritten text recognition.

Authors :: Hamza, Ameer
Ren, Shengbing
Saeed, Usman
Source :: PLoS ONE. 5/17/2024, Vol. 19 Issue 5, p1-21. 21p.
Publication Year :: 2024
Abstract: Automatic Urdu handwritten text recognition is a challenging task in the OCR industry. Unlike printed text, Urdu handwriting lacks a uniform font and structure. This lack of uniformity causes data inconsistencies and recognition issues. Different writing styles, cursive scripts, and limited data make Urdu text recognition a complicated task. Major languages, such as English, have experienced advances in automated recognition, whereas low-resource languages, such as Urdu, still lag. Transformer-based models are promising for automated recognition in high- and low-resource languages such as Urdu. This paper presents a transformer-based method called ET-Network that integrates self-attention into EfficientNet for feature extraction and a transformer for language modeling. The use of self-attention layers in EfficientNet helps to extract global and local features that capture long-range dependencies. These features proceeded into a vanilla transformer to generate text, and a prefix beam search is used for the finest outcome. NUST-UHWR, UPTI2.0, and MMU-OCR-21 are three datasets used to train and test the ET Network for a handwritten Urdu script. The ET-Network improved the character error rate by 4% and the word error rate by 1.55%, while establishing a new state-of-the-art character error rate of 5.27% and a word error rate of 19.09% for Urdu handwritten text. [ABSTRACT FROM AUTHOR]

Subjects :: *DEEP learning
*TEXT recognition
*TRANSFORMER models
*FEATURE extraction
*ERROR rates
*HANDWRITING

Details

Language :: English
ISSN :: 19326203
Volume :: 19
Issue :: 5
Database :: Academic Search Index
Journal :: PLoS ONE
Publication Type :: Academic Journal
Accession number :: 177326091
Full Text :: https://doi.org/10.1371/journal.pone.0302590

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

ET-Network: A novel efficient transformer deep learning model for automated Urdu handwritten text recognition.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

ET-Network: A novel efficient transformer deep learning model for automated Urdu handwritten text recognition.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources