Back to Search
Start Over
多任务实时声音事件检测卷积模型与复合数据扩增.
- Source :
-
Application Research of Computers / Jisuanji Yingyong Yanjiu . Apr2023, Vol. 40 Issue 4, p1080-1087. 8p. - Publication Year :
- 2023
-
Abstract
- Most of the existing sound event detection researches analyzes from recorded audio, and these models had massive parameters with low efficiency, which is not suitable for real-time detection. This paper proposed an efficient convolutional neural networks model for multi-task and real-time sound event detection. The model considered triggering and detecting task as multi-task learning mission, and its structure combined densely connection, ghost module and SE block. Moreover, this paper proposed a data augmentation method that combined with audio shifting, random cropping and SpecAugment. The experimental results show that the mean prediction accuracies of the model on ESC-10 and Urbansound8K datasets are at least 2% higher than latest baseline models, but also has better robustness as well as less parameters and memory. Additionally, multi-task learning saves computation, and due to the reusing of feature maps, the model can quickly and accurately feedback the results. Compared with the traditional data augmentation methods, the proposed model achieves better performance and robustness. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Chinese
- ISSN :
- 10013695
- Volume :
- 40
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- Application Research of Computers / Jisuanji Yingyong Yanjiu
- Publication Type :
- Academic Journal
- Accession number :
- 163102339
- Full Text :
- https://doi.org/10.19734/j.issn.1001-3695.2022.07.0415