Back to Search Start Over

Joint Deep Neural Network for Single-Channel Speech Separation on Masking-Based Training Targets

Authors :
Peng Chen
Binh Thien Nguyen
Yuting Geng
Kenta Iwai
Takanobu Nishiura
Source :
IEEE Access, Vol 12, Pp 152036-152044 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

Single-channel speech separation can be adopted in many applications. Time-frequency (T-F) masking is an effective method for single-channel speech separation. With advancements in deep learning, T-F masks have become used as a training target, achieving notable separation results. Among the numerous masks that have been proposed, the ideal binary mask (IBM), ideal ratio mask (IRM), Wiener filter (WF) and spectral magnitude mask (SMM) are commonly used and have proven effective, though their separation performance varies depending on the speech mixture and separation model. The existing approach mainly utilizes a single network to approximate the mask of the target speech. However, in mixed speech, there are segments where speech is mixed with other speech, segments where speech is mixed with silent intervals, and segments where high signal-to-noise ratio (SNR) speech is mixed due to pauses and variations in the speakers’ intonation and emphasis. In this paper, we attempt to use different networks to handle speech segments containing various mixtures. In addition to the existing network, we introduce a network (using the Rectified Linear Unit as activation functions) to specifically address segments containing a mixture of speech and silence, as well as segments with high SNR speech mixtures. We conducted evaluation experiments on the speech separation of two speakers using the four aforementioned masks as training targets. The performance improvements observed in the evaluation experiments demonstrate the effectiveness of our proposed method based on the joint network compared to the conventional method based on the single network.

Details

Language :
English
ISSN :
21693536 and 18648541
Volume :
12
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.396c8c1864854165a624388e4c742608
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2024.3479292