Back to Search Start Over

Using Nearest Neighbor Information to Improve Cross-Language Text Classification

Authors :
Manuel Montes-y-Gómez
Adelina Escobar-Acevedo
Luis Villaseñor-Pineda
Source :
MICAI 2009: Advances in Artificial Intelligence ISBN: 9783642052576, MICAI
Publication Year :
2009
Publisher :
Springer Berlin Heidelberg, 2009.

Abstract

Cross-language text classification (CLTC) aims to take advantage of existing training data from one language to construct a classifier for another language. In addition to the expected translation issues, CLTC is also complicated by the cultural distance between both languages, which causes that documents belonging to the same category concern very different topics. This paper proposes a re-classification method which purpose is to reduce the errors caused by this phenomenon by considering information from the own target language documents. Experimental results in a news corpus considering three pairs of languages and four categories demonstrated the appropriateness of the proposed method, which could improve the initial classification accuracy by up to 11%.

Details

ISBN :
978-3-642-05257-6
ISBNs :
9783642052576
Database :
OpenAIRE
Journal :
MICAI 2009: Advances in Artificial Intelligence ISBN: 9783642052576, MICAI
Accession number :
edsair.doi...........a19c2a9dacca5b593fce0f11a3ccb1e1
Full Text :
https://doi.org/10.1007/978-3-642-05258-3_14