1. 基于配对特征融合的声学场景分类方法.
- Author
-
沈昕昊, 陈嘉烨, and 宋晓宁
- Subjects
- *
CONVOLUTIONAL neural networks , *PROBLEM solving , *SPECTROGRAMS , *GENERALIZATION , *ALGORITHMS , *DATA compression - Abstract
In recent years, single device sound scene classification has achieved good results, however, progress in multi-device sound scene classification has been slow. To solve the problem of large differences in the number of multi-device samples, this paper proposed a pairwise feature fusion algorithm. By calculating the differences in the spectrogram for each pair of paired samples and averaging these differences after accumulation, obtaining the average spectral characteristics of each device for the conversion of device samples. The algorithm effectively improved the generalization ability of the model while increasing the number of device samples. Meanwhile, in order to obtain global information, it proposed a lightweight attention module, which could make the model focus on the training of the whole sound sequence information on the basis of reduced computation by performing self-attention operations on the input features after compression in the frequency domain. The experimental results show that the proposed algorithm has better advantages in terms of model size and classification accuracy compared with other methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF