Start Over

Deep neural networks for automatic speech processing: a survey from large corpora to limited data.

Authors :: Roger, Vincent
Farinas, Jérôme
Pinquier, Julien
Source :: EURASIP Journal on Audio Speech & Music Processing; 8/17/2022, Vol. 2022 Issue 1, p1-15, 15p
Publication Year :: 2022
Abstract: Most state-of-the-art speech systems use deep neural networks (DNNs). These systems require a large amount of data to be learned. Hence, training state-of-the-art frameworks on under-resourced speech challenges are difficult tasks. As an example, a challenge could be the limited amount of data to model impaired speech. Furthermore, acquiring more data and/or expertise is time-consuming and expensive. In this paper, we focus on the following speech processing tasks: automatic speech recognition, speaker identification, and emotion recognition. To assess the problem of limited data, we firstly investigate state-of-the-art automatic speech recognition systems, as this is the hardest task (due to the wide variability in each language). Next, we provide an overview of techniques and tasks requiring fewer data. In the last section, we investigate few-shot techniques by interpreting under-resourced speech as a few-shot problem. In that sense, we propose an overview of few-shot techniques and the possibility of using such techniques for the speech problems addressed in this survey. It is true that the reviewed techniques are not well adapted for large datasets. Nevertheless, some promising results from the literature encourage the usage of such techniques for speech processing. [ABSTRACT FROM AUTHOR]

Subjects :: ARTIFICIAL neural networks
SPEECH processing systems
COMPUTATIONAL linguistics
AUTOMATIC speech recognition
SPEECH
EMOTION recognition
CORPORA
WORD recognition

Details

Language :: English
ISSN :: 16874714
Volume :: 2022
Issue :: 1
Database :: Complementary Index
Journal :: EURASIP Journal on Audio Speech & Music Processing
Publication Type :: Academic Journal
Accession number :: 158562950
Full Text :: https://doi.org/10.1186/s13636-022-00251-w

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Deep neural networks for automatic speech processing: a survey from large corpora to limited data.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Deep neural networks for automatic speech processing: a survey from large corpora to limited data.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources