Back to Search
Start Over
Spectral and cepstral audio noise reduction techniques in speech emotion recognition
- Source :
- ACM Multimedia, Proceedings of the 24th ACM International Conference on Multimedia (ACM MM), Proceedings of the 24th ACM International Conference on Multimedia (ACM MM), 2016, Amsterdam, Netherlands. pp.670-674, ⟨10.1145/2964284.2967306⟩
- Publication Year :
- 2020
-
Abstract
- International audience; Signal noise reduction can improve the performance of machine learning systems dealing with time signals such as audio. Real-life applicability of these recognition technologies requires the system to uphold its performance level in variable, challenging conditions such as noisy environments. In this contribution, we investigate audio signal denoising methods in cepstral and log-spectral domains and compare them with common implementations of standard techniques. The different approaches are first compared generally using averaged acoustic distance metrics. They are then applied to automatic recognition of spontaneous and natural emotions under simulated smartphone-recorded noisy conditions. Emotion recognition is implemented as support vector regression for continuous-valued prediction of arousal and valence on a realistic multimodal database. In the experiments, the proposed methods are found to generally outperform standard noise reduction algorithms.
- Subjects :
- Speech emotion recognition
Computer science
Noise reduction
Speech recognition
02 engineering and technology
Signal
030507 speech-language pathology & audiology
03 medical and health sciences
Cepstrum
0202 electrical engineering, electronic engineering, information engineering
Emotion recognition
Valence (psychology)
Denoising
Audio signal
business.industry
020206 networking & telecommunications
Pattern recognition
Support vector machine
Variable (computer science)
Computer Science::Sound
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
Artificial intelligence
ddc:004
0305 other medical science
business
[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- ACM Multimedia, Proceedings of the 24th ACM International Conference on Multimedia (ACM MM), Proceedings of the 24th ACM International Conference on Multimedia (ACM MM), 2016, Amsterdam, Netherlands. pp.670-674, ⟨10.1145/2964284.2967306⟩
- Accession number :
- edsair.doi.dedup.....b2d955764d20bbbf3b37c56b58f0e5ae
- Full Text :
- https://doi.org/10.1145/2964284.2967306⟩