Back to Search Start Over

基于无向分块加权图的无模式实体识别方法研究.

Authors :
杨 宁
卢 菁
邵 清
刘 丛
Source :
Application Research of Computers / Jisuanji Yingyong Yanjiu. Jan2021, Vol. 38 Issue 1, p169-174. 6p.
Publication Year :
2021

Abstract

The blocking approach ignored the weight of block keys and ambiguity between blocking keys,leading to low accuracy. This paper proposed a schema-agnostic entity resolution method based on undirected weighted graph. The method extracted attributes from data sources,combined attribute information entropy and TF-IDF to get clustering attributes and established an unified block scheme. Through the relationship between clustering attribute weight and block key, it gave each group of block key a certain weight. The weight and the cooccurrence frequency of the edge were multiplied and weighted to form the undirected block weighted graph. Finally, it used the pruning scheme to prune the edges. The problem of multi-attribute and block key ambiguity in data was solved and increased accuracy. Experiments on seven real data sets show that the method is effective and scalable. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10013695
Volume :
38
Issue :
1
Database :
Academic Search Index
Journal :
Application Research of Computers / Jisuanji Yingyong Yanjiu
Publication Type :
Academic Journal
Accession number :
147932164
Full Text :
https://doi.org/10.19734/j.issn.1001-3695.2019.09.0526