Back to Search Start Over

A category-driven approach to deriving domain specific subsets of Wikipedia

Authors :
Anton V. Korshunov
Denis Yu. Turdakov
Jinguk Jeong
Minho Lee
Changsung Moon
Source :
Труды Института системного программирования РАН, Vol 21, Iss 0 (2018)
Publication Year :
2018
Publisher :
Ivannikov Institute for System Programming of the Russian Academy of Sciences, 2018.

Abstract

While many researchers attempt to build up different kinds of ontologies by means of Wikipedia, the possibility of deriving high-quality domain specific subset of Wikipedia using its own category structure still remains undervalued. We prove the necessity of such processing in this paper and also propose an appropriate technique. As a result, the size of knowledge base for our text processing framework has been reduced by more than order, while the precision of disambiguating musical metadata (ID3 tags) has decreased from 98% to 64%.

Details

Language :
English, Russian
ISSN :
20798156 and 22206426
Volume :
21
Issue :
0
Database :
Directory of Open Access Journals
Journal :
Труды Института системного программирования РАН
Publication Type :
Academic Journal
Accession number :
edsdoj.45d214b2ce495bb4b1429f68ee6f26
Document Type :
article