Back to Search Start Over

Pairwise optimized Rocchio algorithm for text categorization

Authors :
Miao, Yun-Qian
Kamel, Mohamed
Source :
Pattern Recognition Letters. Jan2011, Vol. 32 Issue 2, p375-382. 8p.
Publication Year :
2011

Abstract

Abstract: This paper examines the Rocchio algorithm and its application in text categorization. Existing approaches using global parameters optimization of Rocchio algorithm result in choosing one fixed prototype representing each category for multi-category text categorization problems. Therefore, they have limited discriminating power on different category’s distribution and their parameter optimization methods are based on weak representation ability of the negative samples consisting of several categories. We present a pairwise optimized Rocchio algorithm, which dynamically adjusts the prototype position between pairs of categories. Experiments were conducted on three benchmark corpora, the 20-Newsgroup, Reuters-21578 and TDT2. The results confirm that our proposed pairwise method achieves encouraging performance improvement over the conventional Rocchio method. A comparative study with the top notch text classifier Support Vector Machine (SVM) also shows the pairwise Rocchio method achieves competitive results. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01678655
Volume :
32
Issue :
2
Database :
Academic Search Index
Journal :
Pattern Recognition Letters
Publication Type :
Academic Journal
Accession number :
55379360
Full Text :
https://doi.org/10.1016/j.patrec.2010.09.018