101. A new approach to hierarchical clustering and structuring of data with Self-Organizing Maps
- Author
-
Alvin Chan, Gerhard Widmer, and Elias Pampalk
- Subjects
Self-organizing map ,Computer science ,business.industry ,Stability (learning theory) ,computer.software_genre ,Machine learning ,Theoretical Computer Science ,Hierarchical clustering ,Exploratory data analysis ,Artificial Intelligence ,Computer Vision and Pattern Recognition ,Data mining ,Artificial intelligence ,Granularity ,Hierarchical network model ,Hierarchical clustering of networks ,business ,Heuristics ,computer - Abstract
The Self-Organizing Map (SOM) is a powerful tool for exploratory data analysis which has been employed in a wide range of data mining applications. We present a novel approach to reveal the inherent hierarchical structure of data using multiple SOMs together with heuristics which optimize the stability. In particular, we address shortcomings of the Growing Hierarchical Self-Organizing Map (GHSOM) regarding the decision which areas in the hierarchical structure need to be represented by a finer granularity and which areas do not. We introduce the Tension and Mapping Ratio extension to exploit specific characteristics of the SOM based on the topology preservation. As a main result, in contrast to the GHSOM, the inherent hierarchical structure of the data is revealed without requiring the user to define a threshold parameter which controls the map sizes of the individual SOMs. We evaluate our approach using data from real-world data mining projects in the music domain.