Back to Search Start Over

Comparative study of selected clustering algorithm based on Monte Carlo simulation study.

Authors :
Sharif, Shamshuritawati
Mohamad-Shamsuri, Nurshaziana
Source :
AIP Conference Proceedings. 2024, Vol. 3123 Issue 1, p1-6. 6p.
Publication Year :
2024

Abstract

Clustering is a technique that involves partitioning data to group similar data points in a larger dataset with unknown groups. It is very useful in managing large datasets that have no predetermined classification, making it a crucial tool for data analysis. Based on clustering algorithm, the results of clustering or object segmentation depend on the combination of distance and linkage methods used. Therefore, it is important to investigate which clustering method(s) outperform others for different distance measures and linkage methods. A simulation study for 5000 iteration is conducted in evaluating the statistical performance of several distance and linkages methods. The average value of cophenetic correlation coefficient is computed for two different distributions: multivariate normal distribution and gamma distribution using MATLAB programming in measuring the quality of clustering solution. In addition, simulated data are deployed under p = 3 and p = 5, with sample sizes n = 30, 50, and 100. Overall, these results can serve as a helpful guideline for researchers when choosing appropriate distance and linkage methods to make better decisions. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0094243X
Volume :
3123
Issue :
1
Database :
Academic Search Index
Journal :
AIP Conference Proceedings
Publication Type :
Conference
Accession number :
179273860
Full Text :
https://doi.org/10.1063/5.0224175