Back to Search
Start Over
An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech
- Source :
- ICASSP
- Publication Year :
- 2018
- Publisher :
- IEEE, 2018.
-
Abstract
- End-to-end automatic speech recognition (ASR) can significantly reduce the burden of developing ASR systems for new languages, by eliminating the need for linguistic information such as pronunciation dictionaries. This also creates an opportunity to build a monolithic multilingual ASR system with a language-independent neural network architecture. In our previous work, we proposed a monolithic neural network architecture that can recognize multiple languages, and showed its effectiveness compared with conventional language-dependent models. However, the model is not guaranteed to properly handle switches in language within an utterance, thus lacking the flexibility to recognize mixed-language speech such as code-switching. In this paper, we extend our model to enable dynamic tracking of the language within an utterance, and propose a training procedure that takes advantage of a newly created mixed-language speech corpus. Experimental results show that the extended model outperforms both language-dependent models and our previous model without suffering from performance degradation that could be associated with language switching.
- Subjects :
- Flexibility (engineering)
Artificial neural network
Language identification
Computer science
Speech recognition
020206 networking & telecommunications
Speech corpus
02 engineering and technology
Pronunciation
01 natural sciences
Mixed language
Rule-based machine translation
0103 physical sciences
0202 electrical engineering, electronic engineering, information engineering
010301 acoustics
Utterance
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Accession number :
- edsair.doi...........523ea08d42926ac7a851b4eda753ffe0
- Full Text :
- https://doi.org/10.1109/icassp.2018.8462180