Back to Search
Start Over
Subjectively Interesting Subgroup Discovery on Real-valued Targets
- Source :
- Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018), 1352-1355, STARTPAGE=1352;ENDPAGE=1355;TITLE=Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018), 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), ICDE
- Publication Year :
- 2018
-
Abstract
- Deriving insights from high-dimensional data is one of the core problems in data mining. The difficulty mainly stems from the fact that there are exponentially many variable combinations to potentially consider, and there are infinitely many if we consider weighted combinations, even for linear combinations. Hence, an obvious question is whether we can automate the search for interesting patterns and visualizations. In this paper, we consider the setting where a user wants to learn as efficiently as possible about real-valued attributes. For example, to understand the distribution of crime rates in different geographic areas in terms of other (numerical, ordinal and/or categorical) variables that describe the areas. We introduce a method to find subgroups in the data that are maximally informative (in the formal Information Theoretic sense) with respect to a single or set of real-valued target attributes. The subgroup descriptions are in terms of a succinct set of arbitrarily-typed other attributes. The approach is based on the Subjective Interestingness framework FORSIED to enable the use of prior knowledge when finding most informative non-redundant patterns, and hence the method also supports iterative data mining.<br />12 pages, 10 figures, 2 tables, conference submission
- Subjects :
- FOS: Computer and information sciences
ta113
Information retrieval
Technology and Engineering
Exploratory data mining
Computer science
Exploratory Data Mining
Information Theory (cs.IT)
Computer Science - Information Theory
Exceptional Model Mining
Machine Learning (stat.ML)
02 engineering and technology
Pattern Mining
16. Peace & justice
Variable (computer science)
Statistics - Machine Learning
020204 information systems
Core (graph theory)
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Subgroup Discovery
Subjective Interestingness
Subjects
Details
- Language :
- English
- ISBN :
- 978-1-5386-5520-7
- ISSN :
- 10844627 and 2375026X
- ISBNs :
- 9781538655207
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018), 1352-1355, STARTPAGE=1352;ENDPAGE=1355;TITLE=Proceedings of the 34th IEEE International Conference on Data Engineering (ICDE 2018), 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), ICDE
- Accession number :
- edsair.doi.dedup.....2c7f6ab1e99dd9c695f25dc62c2385e5