Back to Search
Start Over
Comparison of Cepstral Normalization Techniques in Whispered Speech Recognition
- Source :
- Advances in Electrical and Computer Engineering, Vol 17, Iss 1, Pp 21-26 (2017)
- Publication Year :
- 2017
- Publisher :
- Stefan cel Mare University of Suceava, 2017.
-
Abstract
- This article presents an analysis of different cepstral normalization techniques in automatic recognition of whispered and bimodal speech (speech+whisper). In these experiments, conventional GMM-HMM speech recognizer was used as speaker-dependant automatic speech recognition system with special Whi-Spe corpus containing utterance recordings in normally phonated speech and whisper. The following normalization techniques were tested and compared: CMN (Cepstral Mean Normalization), CVN (Cepstral Variance Normalization), MVN (Cepstral Mean and Variance Normalization), CGN (Cepstral Gain Normalization) and quantile-based dynamic normalization techniques such as QCN and QCN-RASTA. The experimental results show to what extent each of these cepstral normalization techniques can improve whisper recognition accuracy in mismatched train/test scenario. The best result is obtained using CMN in combination with inverse filtering which provides an average 39.9 percent improvement in whisper recognition accuracy for all tested speakers.
Details
- Language :
- English
- ISSN :
- 15827445 and 18447600
- Volume :
- 17
- Issue :
- 1
- Database :
- Directory of Open Access Journals
- Journal :
- Advances in Electrical and Computer Engineering
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.9cd6d513acde440db1d31508dcb3fb32
- Document Type :
- article
- Full Text :
- https://doi.org/10.4316/AECE.2017.01004