Back to Search Start Over

Pinyin-to-Chinese conversion on sentence-level for domain-specific applications using self-attention model.

Authors :
Xiong, Shufeng
Ma, Li
Cheng, Ming
Wang, Bingkun
Source :
Multimedia Systems. Apr2022, Vol. 28 Issue 2, p375-386. 12p.
Publication Year :
2022

Abstract

In the pinyin-based Chinese input method engine (IME), its performance depends mainly on the Pinyin-to-Chinese (P2C) conversion module. Traditional methods for P2C follow a pipeline procedure, which typically suffers from error propagation. Also, the ability to input the whole sentence of pinyin-based Chinese IME for domain-specific application needs to be improved. In this paper, we propose a neural self-attention model for Pinyin Sequence to Chinese Sequence (PS2CS) conversion method, which directly infers the entire Chinese sequence by feeding the unsegmented pinyin character sequence into. Our experimental results show that the proposed method outperforms baselines and the commercial IME on specific medical domain dataset, and also achieves comparable performance on the domain-general dataset. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*DEEP learning

Details

Language :
English
ISSN :
09424962
Volume :
28
Issue :
2
Database :
Academic Search Index
Journal :
Multimedia Systems
Publication Type :
Academic Journal
Accession number :
156342383
Full Text :
https://doi.org/10.1007/s00530-021-00829-y