Back to Search
Start Over
What's All the FUSS About Free Universal Sound Separation Data?
- Source :
- ICASSP, ICASSP 2021-46th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021-46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414774⟩
- Publication Year :
- 2020
- Publisher :
- arXiv, 2020.
-
Abstract
- International audience; We introduce the Free Universal Sound Separation (FUSS) dataset, a new corpus for experiments in separating mixtures of an unknown number of sounds from an open domain of sound types. The dataset consists of 23 hours of single-source audio data drawn from 357 classes, which are used to create mixtures of one to four sources. To simulate reverberation, an acoustic room simulator is used to generate impulse responses of box shaped rooms with frequency-dependent reflective walls. Additional open-source data augmentation tools are also provided to produce new mixtures with different combinations of sources and room simulations. Finally, we introduce an open-source baseline separation model, based on an improved time-domain convolutional network (TDCN++), that can separate a variable number of sources in a mixture. This model achieves 9.8 dB of scale-invariant signal-to-noise ratio improvement (SI-SNRi) on mixtures with two to four sources, while reconstructing single-source inputs with 35.5 dB absolute SI-SNR. We hope this dataset will lower the barrier to new research and allow for fast iteration and application of novel techniques from other machine learning domains to the sound separation challenge.
- Subjects :
- FOS: Computer and information sciences
Reverberation
Sound (cs.SD)
open-source datasets
Computer science
Sound separation
Separation (aeronautics)
02 engineering and technology
Impulse (physics)
Computer Science - Sound
Data modeling
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Audio and Speech Processing (eess.AS)
0202 electrical engineering, electronic engineering, information engineering
Open domain
FOS: Electrical engineering, electronic engineering, information engineering
business.industry
Deep learning
deep learning
020206 networking & telecommunications
Universal sound separation
variable source sep- aration
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
020201 artificial intelligence & image processing
Artificial intelligence
Variable number
business
Algorithm
Electrical Engineering and Systems Science - Audio and Speech Processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- ICASSP, ICASSP 2021-46th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2021-46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto/Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414774⟩
- Accession number :
- edsair.doi.dedup.....6c5c6ecdfe75c9b15c8478ab28ba2999
- Full Text :
- https://doi.org/10.48550/arxiv.2011.00803