Back to Search Start Over

Document classification system based on HMM word map

Authors :
Tsimboukakis Nikolaos
Tambouratzis George
Source :
CSTST
Publication Year :
2008
Publisher :
ACM Press, 2008.

Abstract

In this article, a system based on Hidden Markov Models (HMM) for document organization is presented. The purpose of the system is the classification of a document collection in terms of document content. The system possesses a two-level hybrid connectionist architecture that comprises (i) an automatically created word map using a HMM, which functions as a feature extraction module and (ii) a supervised MLP-based classifier, which provides the final classification result. A series of experiments, which have been performed on Modern Greek text-only documents, is presented. These experiments illustrate the effectiveness of the proposed system.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology - CSTST '08
Accession number :
edsair.doi...........5751a19126cd523164d46eb8aef4ebf4