Back to Search
Start Over
Approaches to topic identification on the switchboard corpus
- Source :
- ICASSP (1)
- Publication Year :
- 2002
- Publisher :
- IEEE, 2002.
-
Abstract
- Topic identification (TID) is the automatic classification of speech messages into one of a known set of possible topics. The TID task can be view as having three principal components: 1) event generation, 2) keyword event selection, and 3) topic modeling. Using data from the Switchboard corpus, the authors present experimental results for various approaches to the TID problem and compare the relative effectiveness of each. In addition, they examine the effect of keyword set size on identification accuracy and gauge the loss in performance when mismatched topic modeling and keyword selection schemes are used. >
- Subjects :
- Topic model
Computer science
Event (computing)
business.industry
Speech processing
computer.software_genre
Task (project management)
Set (abstract data type)
Identification (information)
Principal component analysis
Selection (linguistics)
Artificial intelligence
business
computer
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
- Accession number :
- edsair.doi...........265553631956b69796488487857efbb0