Back to Search
Start Over
A Fast Semi-Supervised Clustering Framework for Large-Scale Time Series Data.
- Source :
- IEEE Transactions on Systems, Man & Cybernetics. Systems; Jul2021, Vol. 51 Issue 7, p4201-4216, 16p
- Publication Year :
- 2021
-
Abstract
- Semi-supervised clustering algorithms have several limitations: 1) the computation complexity of them is very high, because calculating the similarity distances of pairs of examples is time-consuming; 2) traditional semi-supervised clustering methods have not considered how to make full use of must-link and cannot-link constraints. In the clustering, the contribution of a few pairwise constraints to the clustering performance is very limited, and some may negatively affect the outcome; and 3) these methods are not effective to handle high dimensional data, especially for time series data. Up to now, few work touched semi-supervised clustering on time series data. To efficiently cluster large-scale time series data, we first tackle contract time series clustering to produce the most accurate clustering results under a contracted time. We propose a semi-supervised time series clustering framework (STSC), which integrates a fast similarity measure and a constraint propagation approach. Based on the proposed framework, two valid semi-supervised clustering algorithms including fssK-means and fssDBSCAN are designed. Experiments on 11 datasets show that our proposed method is efficient and effective for clustering large-scale time series data. [ABSTRACT FROM AUTHOR]
- Subjects :
- TIME series analysis
Subjects
Details
- Language :
- English
- ISSN :
- 21682216
- Volume :
- 51
- Issue :
- 7
- Database :
- Complementary Index
- Journal :
- IEEE Transactions on Systems, Man & Cybernetics. Systems
- Publication Type :
- Academic Journal
- Accession number :
- 151249943
- Full Text :
- https://doi.org/10.1109/TSMC.2019.2931731