Back to Search Start Over

Automatic Speech Recognition for Irish: the ABAIR-ÉIST System

Authors :
Lonergan, L
Qian, M
Berthelsen, H
Murphy, A
Wendler, C
Ní Chiaráin, N
Gobl, C
Ní Chasaide, A
Publication Year :
2023
Publisher :
Apollo - University of Cambridge Repository, 2023.

Abstract

This paper describes ÉIST, automatic speech recogniser for Irish, developed as part of the ongoing ABAIR initiative, combining (1) acoustic models, (2) pronunciation lexicons and (3) language models into a hybrid system. A priority for now is a system that can deal with the multiple diverse native-speaker dialects. Consequently, (1) was built using predominately native-speaker speech, which included earlier recordings used for synthesis development as well as more diverse recordings obtained using the MíleGlór platform. The pronunciation variation across the dialects is a particular challenge in the development of (2) and is explored by testing both Trans-dialect and Multi-dialect letter-to-sound rules. Two approaches to language modelling (3) are used in the hybrid system, a simple n-gram model and recurrent neural network lattice rescoring, the latter garnering impressive performance improvements. The system is evaluated using a test set that is comprised of both native and non-native speakers, which allows for some inferences to be made on the performance of the system on both cohorts.

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........05115378682edda7a702ad7462a2f1b3
Full Text :
https://doi.org/10.17863/cam.93269