Back to Search Start Over

Application of Parallel Clustering Algorithm Based on R in Power Customer Classification

Authors :
Lipeng Zhu
Junfeng Qiao
Sen Pan
Source :
2019 IEEE 4th International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

The power load curve is describing the behavior of electricity consumption by customers. The curve characteristic information is obtained by mining and analyzing the power load curve, which can be used as the classification basis for power customers. Based on the massive characteristics of power load curve, the K-means clustering algorithm and Spark-based parallelization method are studied firstly, and on the basis of this, the parallel transformation of k-means algorithm is carried out. Then the preprocessing steps of the power load curve data and the detailed flow of the mining analysis are proposed, and the R parallelization algorithm is applied to the mining analysis of the actual power load curve data, through which the classification of electricity customers are achieved and verified. Finally, the R-based parallel clustering algorithm is briefly summarized.

Details

Database :
OpenAIRE
Journal :
2019 IEEE 4th International Conference on Cloud Computing and Big Data Analysis (ICCCBDA)
Accession number :
edsair.doi...........c825900f621e28695ae7a6b7c199d1bf
Full Text :
https://doi.org/10.1109/icccbda.2019.8725760