1. Research on Robustness of Emotion Recognition Under Environmental Noise Conditions
- Author
-
Yongming Huang, Jing Xiao, Kexin Tian, Ao Wu, and Guobao Zhang
- Subjects
Robust noise ,speech emotion recognition ,LW-WPCC feature ,feature extraction algorithm ,bio-modal emotion recognition ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Noise is an unneglectable problem in emotion recognition if we want to put it into practice. First, aiming at the problem of noise in speech, we design a new acoustic feature, Long time frame Analysis Weighted Wavelet Packet Cepstral Coefficient (LW-WPCC), for better robustness. To extract LW-WPCC feature, first the best wavelet packet basis is constructed. On the basis of this, a robust wavelet packet Cepstral Coefficient is extracted by combining short time frame analysis with long time frame analysis. After that, we introduce a sub-band spectral center-of-mass parameter with good robustness to additive noise and propose an extraction algorithm of LW-WPCC. Through experiments on speech emotion recognition of different SNR levels, it is shown that our proposed method shows better noise robustness and performance on speech emotion recognition. What's more, as facial expressions will not be affected by noise, we do bio-modal emotion recognition based on audio-visual data to improve robustness by making a decision-level fusion. Experiments based on audio-visual data are conducted to evaluate efficiency of our method. Results show that bio-modal emotion recognition based on audio-visual data can improve robustness and achieve better performance by benefiting from different kinds of emotion data.
- Published
- 2019
- Full Text
- View/download PDF