Start Over

Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments.

Authors :: Wang, Chunxi
Jia, Maoshen
Zhang, Xinfeng
Source :: EURASIP Journal on Audio Speech & Music Processing; 10/12/2023, Vol. 2023 Issue 1, p1-16, 16p
Publication Year :: 2023
Abstract: In recent years, the speaker-independent, single-channel speech separation problem has made significant progress with the development of deep neural networks (DNNs). However, separating the speech of each interested speaker from an environment that includes the speech of other speakers, background noise, and room reverberation remains challenging. In order to solve this problem, a speech separation method for a noisy reverberation environment is proposed. Firstly, the time-domain end-to-end network structure of a deep encoder/decoder dual-path neural network is introduced in this paper for speech separation. Secondly, to make the model not fall into local optimum during training, a loss function stretched optimal scale-invariant signal-to-noise ratio (SOSISNR) was proposed, inspired by the scale-invariant signal-to-noise ratio (SISNR). At the same time, in order to make the training more appropriate to the human auditory system, the joint loss function is extended based on short-time objective intelligibility (STOI). Thirdly, an alignment operation is proposed to reduce the influence of time delay caused by reverberation on separation performance. Combining the above methods, the subjective and objective evaluation metrics show that this study has better separation performance in complex sound field environments compared to the baseline methods. [ABSTRACT FROM AUTHOR]

Subjects :: ARTIFICIAL neural networks
SPEECH
ACOUSTIC field
AUDITORY pathways
SIGNAL-to-noise ratio

Details

Language :: English
ISSN :: 16874714
Volume :: 2023
Issue :: 1
Database :: Complementary Index
Journal :: EURASIP Journal on Audio Speech & Music Processing
Publication Type :: Academic Journal
Accession number :: 173430602
Full Text :: https://doi.org/10.1186/s13636-023-00307-5

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources