Back to Search
Start Over
Document classification system based on HMM word map
- Source :
- CSTST
- Publication Year :
- 2008
- Publisher :
- ACM Press, 2008.
-
Abstract
- In this article, a system based on Hidden Markov Models (HMM) for document organization is presented. The purpose of the system is the classification of a document collection in terms of document content. The system possesses a two-level hybrid connectionist architecture that comprises (i) an automatically created word map using a HMM, which functions as a feature extraction module and (ii) a supervised MLP-based classifier, which provides the final classification result. A series of experiments, which have been performed on Modern Greek text-only documents, is presented. These experiments illustrate the effectiveness of the proposed system.
- Subjects :
- business.industry
Computer science
Document classification
Feature extraction
Modern Greek
Linear classifier
Pattern recognition
computer.software_genre
ComputingMethodologies_PATTERNRECOGNITION
Connectionism
Multilayer perceptron
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Artificial intelligence
business
Hidden Markov model
Classifier (UML)
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology - CSTST '08
- Accession number :
- edsair.doi...........5751a19126cd523164d46eb8aef4ebf4