Back to Search Start Over

Reducing word error rate on conversational speech from the Switchboard corpus

Authors :
Herbert Gish
John McDonough
U. Chaudhari
Phillippe Jeanrenaud
Kenney Ng
E. Eide
Man-Hung Siu
Source :
ICASSP
Publication Year :
2002
Publisher :
IEEE, 2002.

Abstract

Speech recognition of conversational speech is a difficult task. The performance levels on the Switchboard corpus had been in the vicinity of 70% word error rate. In this paper, we describe the results of applying a variety of modifications to our speech recognition system and we show their impact on improving the performance on conversational speech. These modifications include the use of more complex models, trigram language models, and cross-word triphone models. We also show the effect of using additional acoustic training on the recognition performance. Finally, we present an approach to dealing with the abundance of short words, and examine how the variable speaking rate found in conversational speech impacts on the performance. Currently, the level of performance is at the vicinity of 50% error, a significant improvement over recent levels.

Details

Database :
OpenAIRE
Journal :
1995 International Conference on Acoustics, Speech, and Signal Processing
Accession number :
edsair.doi...........cbcc332afcbdb398095a74758b4685be
Full Text :
https://doi.org/10.1109/icassp.1995.479271