Back to Search Start Over

A 40 nm 144 mW VLSI Processor for Real-Time 60-kWord Continuous Speech Recognition.

Authors :
He, Guangji
Sugahara, Takanobu
Miyamoto, Yuki
Fujinaga, Tsuyoshi
Noguchi, Hiroki
Izumi, Shintaro
Kawaguchi, Hiroshi
Yoshimoto, Masahiko
Source :
IEEE Transactions on Circuits & Systems. Part I: Regular Papers. Aug2012, Vol. 59 Issue 8, p1656-1666. 11p.
Publication Year :
2012

Abstract

We have developed a low-power VLSI chip for 60-kWord real-time continuous speech recognition based on a context-dependent hidden Markov model (HMM). Our implementation includes a cache architecture using locality of speech recognition, beam pruning using a dynamic threshold, two-stage language model searching, highly parallel Gaussian mixture model (GMM) computation based on the mixture level, a variable-frame look-ahead scheme, and elastic pipeline operation between the Viterbi transition and GMM processing. The accuracy degradation of the important parameters in Viterbi computation is strictly discussed. Results show that our implementation achieves 95% bandwidth reduction (70.86 MB/s) and 78% required frequency reduction (126.5 MHz) comparing to the referential Julius refid="ref1"/ system. The test chip, fabricated using 40 nm CMOS technology, contains 1.9 M transistors for logic and 7.8 Mbit on-chip memory. It dissipates 144 mW at 126.5 MHz and 1.1 V for 60-kWord real-time continuous speech recognition. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISSN :
15498328
Volume :
59
Issue :
8
Database :
Academic Search Index
Journal :
IEEE Transactions on Circuits & Systems. Part I: Regular Papers
Publication Type :
Periodical
Accession number :
79680192
Full Text :
https://doi.org/10.1109/TCSI.2012.2206501