Back to Search Start Over

A Fast Semi-Supervised Clustering Framework for Large-Scale Time Series Data.

Authors :
He, Guoliang
Pan, Yanzhou
Xia, Xuewen
He, Jinrong
Peng, Rong
Xiong, Neal N.
Source :
IEEE Transactions on Systems, Man & Cybernetics. Systems; Jul2021, Vol. 51 Issue 7, p4201-4216, 16p
Publication Year :
2021

Abstract

Semi-supervised clustering algorithms have several limitations: 1) the computation complexity of them is very high, because calculating the similarity distances of pairs of examples is time-consuming; 2) traditional semi-supervised clustering methods have not considered how to make full use of must-link and cannot-link constraints. In the clustering, the contribution of a few pairwise constraints to the clustering performance is very limited, and some may negatively affect the outcome; and 3) these methods are not effective to handle high dimensional data, especially for time series data. Up to now, few work touched semi-supervised clustering on time series data. To efficiently cluster large-scale time series data, we first tackle contract time series clustering to produce the most accurate clustering results under a contracted time. We propose a semi-supervised time series clustering framework (STSC), which integrates a fast similarity measure and a constraint propagation approach. Based on the proposed framework, two valid semi-supervised clustering algorithms including fssK-means and fssDBSCAN are designed. Experiments on 11 datasets show that our proposed method is efficient and effective for clustering large-scale time series data. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
TIME series analysis

Details

Language :
English
ISSN :
21682216
Volume :
51
Issue :
7
Database :
Complementary Index
Journal :
IEEE Transactions on Systems, Man & Cybernetics. Systems
Publication Type :
Academic Journal
Accession number :
151249943
Full Text :
https://doi.org/10.1109/TSMC.2019.2931731