Back to Search
Start Over
Harmonic-aware tri-path convolution recurrent network for singing voice separation.
- Source :
-
JASA express letters [JASA Express Lett] 2023 Jul 01; Vol. 3 (7). - Publication Year :
- 2023
-
Abstract
- Temporal coherence and spectral regularity are critical cues for human auditory streaming processes and are considered in many sound separation models. Some examples include the Conv-tasnet model, which focuses on temporal coherence using short length kernels to analyze sound, and the dual-path convolution recurrent network (DPCRN) model, which uses two recurring neural networks to analyze general patterns along the temporal and spectral dimensions on a spectrogram. By expanding DPCRN, a harmonic-aware tri-path convolution recurrent network model via the addition of an inter-band RNN is proposed. Evaluation results on public datasets show that this addition can further boost the separation performances of DPCRN.<br /> (© 2023 Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).)
Details
- Language :
- English
- ISSN :
- 2691-1191
- Volume :
- 3
- Issue :
- 7
- Database :
- MEDLINE
- Journal :
- JASA express letters
- Publication Type :
- Academic Journal
- Accession number :
- 37404168
- Full Text :
- https://doi.org/10.1121/10.0019997