Back to Search Start Over

Self-information of radicals: A new clue for zero-shot Chinese character recognition.

Authors :
Luo, Guo-Feng
Wang, Da-Han
Du, Xia
Yin, Hua-Yi
Zhang, Xu-Yao
Zhu, Shunzhi
Source :
Pattern Recognition. Aug2023, Vol. 140, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• We propose the self-information of radicals (SIR) from the information theory perspective to measure the importance of radicals in recognizing Chinese characters. • The proposed SIR can be easily adopted by two commonly used radical-based zero-shot Chinese Character Recognition (ZSCCR) frameworks, i.e., sequence matching based and attribute embedding based. • For sequence matching based ZSCCR, we propose a novel Chinese character uncertainty elimination (CUE) framework, which is capable of alleviating the sequence mismatch problem. • For attribute embedding based ZSCCR, we propose a novel radical information embedding (RIE) method, which can highlight the importance of indispensable radicals. • Comprehensive experiments on the CASIA-HWDB, ICDAR2013, CTW, and AHCDB datasets demonstrate the effectiveness and high extensibility of the proposed SIR. Zero-shot Chinese character recognition (ZSCCR) is an important research topic in Chinese character recognition as it attempts to recognize unseen Chinese characters. As basic components and mid-level representations, radicals are significant for ZSCCR. However, previous methods treat the importance of radicals equally, ignoring the different contributions of radicals in distinguishing characters. In this paper, we propose the self-information of radicals (SIR) to measure the importance of radicals in recognizing Chinese characters. The proposed SIR can be easily adopted by two commonly used radical-based ZSCCR frameworks, i.e., sequence matching based and attribute embedding based. For sequence matching based ZSCCR, we propose a novel Chinese character uncertainty elimination (CUE) framework to alleviate the radical sequence mismatch problem. For attribute embedding based ZSCCR, we propose a novel radical information embedding (RIE) method that can highlight the importance of indispensable radicals and weaken the influence of some unnecessary radicals. We conducted comprehensive experiments on the CASIA-HWDB, ICDAR2013, CTW datasets, and AHCDB datasets to evaluate the proposed method. Experiments show that our proposed methods can achieve superior performance to the state-of-the-art methods, which demonstrate the effectiveness and the high extensibility of the proposed SIR. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
140
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
163267124
Full Text :
https://doi.org/10.1016/j.patcog.2023.109598