Back to Search
Start Over
Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system
- Source :
- IEEE Transactions on Audio, Speech and Language Processing, IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006
- Publication Year :
- 2006
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2006.
-
Abstract
- This paper describes the progress made in the transcription of broadcast news (BN) and conversational telephone speech (CTS) within the combined BBN/LIMSI system from May 2002 to September 2004. During that period, BBN and LIMSI collaborated in an effort to produce significant reductions in the word error rate (WER), as directed by the aggressive goals of the Effective, Affordable, Reusable, Speech-to-text [Defense Advanced Research Projects Agency (DARPA) EARS] program. The paper focuses on general modeling techniques that led to recognition accuracy improvements, as well as engineering approaches that enabled efficient use of large amounts of training data and fast decoding architectures. Special attention is given on efforts to integrate components of the BBN and LIMSI systems, discussing the tradeoff between speed and accuracy for various system combination strategies. Results on the EARS progress test sets show that the combined BBN/LIMSI system achieved relative reductions of 47% and 51% on the BN and CTS domains, respectively
- Subjects :
- Acoustics and Ultrasonics
business.industry
Computer science
Speech recognition
Speech coding
Word error rate
020206 networking & telecommunications
Speech synthesis
02 engineering and technology
[INFO] Computer Science [cs]
Broadcasting
Speech processing
computer.software_genre
Cable television
030507 speech-language pathology & audiology
03 medical and health sciences
0202 electrical engineering, electronic engineering, information engineering
[INFO]Computer Science [cs]
Telephony
Electrical and Electronic Engineering
Transcription (software)
0305 other medical science
business
computer
ComputingMilieux_MISCELLANEOUS
Subjects
Details
- ISSN :
- 15587916
- Volume :
- 14
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Audio, Speech and Language Processing
- Accession number :
- edsair.doi.dedup.....7290b91cb5bb690beb1ed1eca6943d5b
- Full Text :
- https://doi.org/10.1109/tasl.2006.878257