Back to Search Start Over

A system dedicated to Polish automatic speech recognition – overview of solutions.

Authors :
PONDEL-SYCZ, Karolina
BILSKI, Piotr
Source :
Bulletin of the Polish Academy of Sciences: Technical Sciences. Jul2024, Vol. 72 Issue 4, p1-13. 13p.
Publication Year :
2024

Abstract

The paper presents the analysis of modern Artificial Intelligence algorithms for the automated system supporting human beings during their conversation in Polish language. Their task is to perform Automatic Speech Recognition (ASR) and process it further, for instance fill the computer-based form or perform the Natural Language Processing (NLP) to assign the conversation to one of predefined categories. The state-of-the-art review is required to select the optimal set of tools to process speech in the difficult conditions, which degrade accuracy of ASR. The paper presents the top-level architecture of the system applicable for the task. Characteristics of Polish language are discussed. Next, existing ASR solutions and architectures with the End-To-End (E2E) Deep Neural Network (DNN) based ASR models are presented in detail. Differences between Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN) and transformers in the context of ASR technology are also discussed. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02397528
Volume :
72
Issue :
4
Database :
Academic Search Index
Journal :
Bulletin of the Polish Academy of Sciences: Technical Sciences
Publication Type :
Academic Journal
Accession number :
178470665
Full Text :
https://doi.org/10.24425/bpasts.2024.149818