Back to Search Start Over

多任务实时声音事件检测卷积模型与复合数据扩增.

Authors :
刘 臣
倪仁絰
周立欣
Source :
Application Research of Computers / Jisuanji Yingyong Yanjiu. Apr2023, Vol. 40 Issue 4, p1080-1087. 8p.
Publication Year :
2023

Abstract

Most of the existing sound event detection researches analyzes from recorded audio, and these models had massive parameters with low efficiency, which is not suitable for real-time detection. This paper proposed an efficient convolutional neural networks model for multi-task and real-time sound event detection. The model considered triggering and detecting task as multi-task learning mission, and its structure combined densely connection, ghost module and SE block. Moreover, this paper proposed a data augmentation method that combined with audio shifting, random cropping and SpecAugment. The experimental results show that the mean prediction accuracies of the model on ESC-10 and Urbansound8K datasets are at least 2% higher than latest baseline models, but also has better robustness as well as less parameters and memory. Additionally, multi-task learning saves computation, and due to the reusing of feature maps, the model can quickly and accurately feedback the results. Compared with the traditional data augmentation methods, the proposed model achieves better performance and robustness. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10013695
Volume :
40
Issue :
4
Database :
Academic Search Index
Journal :
Application Research of Computers / Jisuanji Yingyong Yanjiu
Publication Type :
Academic Journal
Accession number :
163102339
Full Text :
https://doi.org/10.19734/j.issn.1001-3695.2022.07.0415