Back to Search
Start Over
Subjectively Interesting Connecting Trees
- Source :
- Machine Learning and Knowledge Discovery in Databases ISBN: 9783319712451, ECML/PKDD (2), Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2017, Skopje, Macedonia, September 18--22, 2017, Proceedings, Part II
- Publication Year :
- 2017
- Publisher :
- Springer International Publishing, 2017.
-
Abstract
- Consider a large network, and a user-provided set of query nodes between which the user wishes to explore relations. For example, a researcher may want to connect research papers in a citation network, an analyst may wish to connect organized crime suspects in a communication network, or an internet user may want to organize their bookmarks given their location in the world wide web. A natural way to show how query nodes are related is in the form of a tree in the network that connects them. However, in sufficiently dense networks, most such trees will be large or somehow trivial (e.g. involving high degree nodes) and thus not insightful. In this paper, we define and investigate the new problem of mining subjectively interesting trees connecting a set of query nodes in a network, i.e., trees that are highly surprising to the specific user at hand. Using information theoretic principles, we formalize the notion of interestingness of such trees mathematically, taking in account any prior beliefs the user has specified about the network. We then propose heuristic algorithms to find the best trees efficiently, given a specified prior belief model. Modeling the user's prior belief state is however not necessarily computationally tractable. Yet, we show how a highly generic class of prior beliefs, namely about individual node degrees in combination with the density of particular sub-networks, can be dealt with in a tractable manner. Such types of beliefs can be used to model knowledge of a partial or total order of the network nodes, e.g. where the nodes represent events in time (such as papers in a citation network). An empirical validation of our methods on a large real network evaluates the different heuristics and validates the interestingness of the given trees.
- Subjects :
- Technology and Engineering
Information theory
Theoretical computer science
Subjective interestingness
Exploratory Data Mining
business.industry
Computer science
Heuristic
Node (networking)
02 engineering and technology
Telecommunications network
Set (abstract data type)
Graph pattern mining
Tree (data structure)
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
The Internet
Heuristics
business
Graphs
Subjects
Details
- ISBN :
- 978-3-319-71245-1
978-3-319-71246-8 - ISSN :
- 03029743 and 16113349
- ISBNs :
- 9783319712451 and 9783319712468
- Database :
- OpenAIRE
- Journal :
- Machine Learning and Knowledge Discovery in Databases ISBN: 9783319712451, ECML/PKDD (2), Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2017, Skopje, Macedonia, September 18--22, 2017, Proceedings, Part II
- Accession number :
- edsair.doi.dedup.....9eb7488c1aba9d3a44eaa824c8ba9196
- Full Text :
- https://doi.org/10.1007/978-3-319-71246-8_4