1. Top-k Similarity Join in Heterogeneous Information Networks.
- Author
-
Xiong, Yun, Zhu, Yangyong, and Yu, Philip S.
- Subjects
INFORMATION networks ,COMPUTER networks ,DATA mining ,SEMANTICS ,DATA integration ,INFORMATION retrieval - Abstract
As a newly emerging network model, heterogeneous information networks (HINs) have received growing attention. Many data mining tasks have been explored in HINs, including clustering, classification, and similarity search. Similarity join is a fundamental operation required for many problems. It is attracting attention from various applications on network data, such as friend recommendation, link prediction, and online advertising. Although similarity join has been well studied in homogeneous networks, it has not yet been studied in heterogeneous networks. Especially, none of the existing research on similarity join takes different semantic meanings behind paths into consideration and almost all completely ignore the heterogeneity and diversity of the HINs. In this paper, we propose a path-based similarity join (PS-join) method to return the top $k$
- Published
- 2015
- Full Text
- View/download PDF