Back to Search
Start Over
Initializing partition-optimization algorithms.
- Source :
-
IEEE/ACM transactions on computational biology and bioinformatics [IEEE/ACM Trans Comput Biol Bioinform] 2009 Jan-Mar; Vol. 6 (1), pp. 144-57. - Publication Year :
- 2009
-
Abstract
- Clustering datasets is a challenging problem needed in a wide array of applications. Partition-optimization approaches, such as k-means or expectation-maximization (EM) algorithms, are sub-optimal and find solutions in the vicinity of their initialization. This paper proposes a staged approach to specifying initial values by finding a large number of local modes and then obtaining representatives from the most separated ones. Results on test experiments are excellent. We also provide a detailed comparative assessment of the suggested algorithm with many commonly-used initialization approaches in the literature. Finally, the methodology is applied to two datasets on diurnal microarray gene expressions and industrial releases of mercury.
- Subjects :
- Arabidopsis genetics
Arabidopsis metabolism
Chemical Hazard Release statistics & numerical data
Circadian Rhythm genetics
Escherichia coli Proteins genetics
Humans
Industrial Waste statistics & numerical data
Methylmercury Compounds
Normal Distribution
Oligonucleotide Array Sequence Analysis
Starch biosynthesis
Starch genetics
Algorithms
Cluster Analysis
Computational Biology methods
Data Interpretation, Statistical
Pattern Recognition, Automated methods
Subjects
Details
- Language :
- English
- ISSN :
- 1557-9964
- Volume :
- 6
- Issue :
- 1
- Database :
- MEDLINE
- Journal :
- IEEE/ACM transactions on computational biology and bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 19179708
- Full Text :
- https://doi.org/10.1109/TCBB.2007.70244