Back to Search
Start Over
Application of Parallel Clustering Algorithm Based on R in Power Customer Classification
- Source :
- 2019 IEEE 4th International Conference on Cloud Computing and Big Data Analysis (ICCCBDA).
- Publication Year :
- 2019
- Publisher :
- IEEE, 2019.
-
Abstract
- The power load curve is describing the behavior of electricity consumption by customers. The curve characteristic information is obtained by mining and analyzing the power load curve, which can be used as the classification basis for power customers. Based on the massive characteristics of power load curve, the K-means clustering algorithm and Spark-based parallelization method are studied firstly, and on the basis of this, the parallel transformation of k-means algorithm is carried out. Then the preprocessing steps of the power load curve data and the detailed flow of the mining analysis are proposed, and the R parallelization algorithm is applied to the mining analysis of the actual power load curve data, through which the classification of electricity customers are achieved and verified. Finally, the R-based parallel clustering algorithm is briefly summarized.
Details
- Database :
- OpenAIRE
- Journal :
- 2019 IEEE 4th International Conference on Cloud Computing and Big Data Analysis (ICCCBDA)
- Accession number :
- edsair.doi...........c825900f621e28695ae7a6b7c199d1bf
- Full Text :
- https://doi.org/10.1109/icccbda.2019.8725760