Back to Search
Start Over
Temporal Activity Path Based Character Correction in Heterogeneous Social Networks via Multimedia Sources
- Source :
- Advances in Multimedia, Vol 2018 (2018)
- Publication Year :
- 2018
- Publisher :
- Hindawi Limited, 2018.
-
Abstract
- Vast amount of multimedia data contains massive and multifarious social information which is used to construct large-scale social networks. In a complex social network, a character should be ideally denoted by one and only one vertex. However, it is pervasive that a character is denoted by two or more vertices with different names; thus it is usually considered as multiple, different characters. This problem causes incorrectness of results in network analysis and mining. The factual challenge is that character uniqueness is hard to correctly confirm due to lots of complicated factors, for example, name changing and anonymization, leading to character duplication. Early, limited research has shown that previous methods depended overly upon supplementary attribute information from databases. In this paper, we propose a novel method to merge the character vertices which refer to the same entity but are denoted with different names. With this method, we firstly build the relationship network among characters based on records of social activities participating, which are extracted from multimedia sources. Then we define temporal activity paths (TAPs) for each character over time. After that, we measure similarity of the TAPs for any two characters. If the similarity is high enough, the two vertices should be considered as the same character. Based on TAPs, we can determine whether to merge the two character vertices. Our experiments showed that this solution can accurately confirm character uniqueness in large-scale social network.
- Subjects :
- Similarity (geometry)
Article Subject
General Computer Science
Multimedia
Social network
Computer science
business.industry
02 engineering and technology
computer.software_genre
lcsh:QA75.5-76.95
Vertex (geometry)
Character (mathematics)
020204 information systems
Path (graph theory)
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
lcsh:Electronic computers. Computer science
Uniqueness
business
computer
Merge (linguistics)
Network analysis
Subjects
Details
- ISSN :
- 16875699 and 16875680
- Volume :
- 2018
- Database :
- OpenAIRE
- Journal :
- Advances in Multimedia
- Accession number :
- edsair.doi.dedup.....01ff789e099097c237ac95d265e479c9