Back to Search Start Over

Approaches to topic identification on the switchboard corpus

Authors :
Herbert Gish
John McDonough
Kenney Ng
Phillippe Jeanrenaud
Jan Robin Rohlicek
Source :
ICASSP (1)
Publication Year :
2002
Publisher :
IEEE, 2002.

Abstract

Topic identification (TID) is the automatic classification of speech messages into one of a known set of possible topics. The TID task can be view as having three principal components: 1) event generation, 2) keyword event selection, and 3) topic modeling. Using data from the Switchboard corpus, the authors present experimental results for various approaches to the TID problem and compare the relative effectiveness of each. In addition, they examine the effect of keyword set size on identification accuracy and gauge the loss in performance when mismatched topic modeling and keyword selection schemes are used. >

Details

Database :
OpenAIRE
Journal :
Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
Accession number :
edsair.doi...........265553631956b69796488487857efbb0