Back to Search Start Over

Comparison of Cepstral Normalization Techniques in Whispered Speech Recognition

Authors :
GROZDIC, D.
JOVICIC, S.
SUMARAC PAVLOVIC, D.
GALIC, J.
MARKOVIC, B.
Source :
Advances in Electrical and Computer Engineering, Vol 17, Iss 1, Pp 21-26 (2017)
Publication Year :
2017
Publisher :
Stefan cel Mare University of Suceava, 2017.

Abstract

This article presents an analysis of different cepstral normalization techniques in automatic recognition of whispered and bimodal speech (speech+whisper). In these experiments, conventional GMM-HMM speech recognizer was used as speaker-dependant automatic speech recognition system with special Whi-Spe corpus containing utterance recordings in normally phonated speech and whisper. The following normalization techniques were tested and compared: CMN (Cepstral Mean Normalization), CVN (Cepstral Variance Normalization), MVN (Cepstral Mean and Variance Normalization), CGN (Cepstral Gain Normalization) and quantile-based dynamic normalization techniques such as QCN and QCN-RASTA. The experimental results show to what extent each of these cepstral normalization techniques can improve whisper recognition accuracy in mismatched train/test scenario. The best result is obtained using CMN in combination with inverse filtering which provides an average 39.9 percent improvement in whisper recognition accuracy for all tested speakers.

Details

Language :
English
ISSN :
15827445 and 18447600
Volume :
17
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Advances in Electrical and Computer Engineering
Publication Type :
Academic Journal
Accession number :
edsdoj.9cd6d513acde440db1d31508dcb3fb32
Document Type :
article
Full Text :
https://doi.org/10.4316/AECE.2017.01004