Back to Search Start Over

DPF-S2S: A novel dual-pathway-fusion-based sequence-to-sequence text recognition model.

Authors :
Zhang, Yuqing
Wu, Peishu
Li, Han
Liu, Yurong
Alsaadi, Fuad E.
Zeng, Nianyin
Source :
Neurocomputing. Feb2023, Vol. 523, p182-190. 9p.
Publication Year :
2023

Abstract

In this paper, a novel dual-pathway-fusion-based sequence-to-sequence learning model (DPF-S2S) is proposed for text recognition in the wild, which mainly focuses on enriching the spatial information and extracting high-dimensional representation features to assist decoding. In particular, a double alignment module is developed to solve the problem of text misalignment, where both position and vision information are well considered. Moreover, a global fusion module is deployed to enrich 2D information in the aligned attention maps, which benefits accurate recognition from complicated scenes with arbitrary text shapes and poor imaging conditions. Benchmark evaluations on seven datasets have demonstrated the superiority of proposed DPF-S2S model in comparison to other state-of-the-art text recognition methods, which presents great competitiveness on identifying texts in both regular and irregular scenes. In addition, extensive ablation studies have been carried out, which validate the effectiveness of applied strategies in proposed DPF-S2S. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*TEXT recognition
*PROBLEM solving

Details

Language :
English
ISSN :
09252312
Volume :
523
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
161174730
Full Text :
https://doi.org/10.1016/j.neucom.2022.12.034