Back to Search
Start Over
Automatic Speech Recognition for Irish: the ABAIR-ÉIST System
- Publication Year :
- 2023
- Publisher :
- Apollo - University of Cambridge Repository, 2023.
-
Abstract
- This paper describes ÉIST, automatic speech recogniser for Irish, developed as part of the ongoing ABAIR initiative, combining (1) acoustic models, (2) pronunciation lexicons and (3) language models into a hybrid system. A priority for now is a system that can deal with the multiple diverse native-speaker dialects. Consequently, (1) was built using predominately native-speaker speech, which included earlier recordings used for synthesis development as well as more diverse recordings obtained using the MíleGlór platform. The pronunciation variation across the dialects is a particular challenge in the development of (2) and is explored by testing both Trans-dialect and Multi-dialect letter-to-sound rules. Two approaches to language modelling (3) are used in the hybrid system, a simple n-gram model and recurrent neural network lattice rescoring, the latter garnering impressive performance improvements. The system is evaluated using a test set that is comprised of both native and non-native speakers, which allows for some inferences to be made on the performance of the system on both cohorts.
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.doi...........05115378682edda7a702ad7462a2f1b3
- Full Text :
- https://doi.org/10.17863/cam.93269