Back to Search Start Over

Spark 异构集群负载均衡调度策略.

Authors :
陶宇炜
谢爱娟
Source :
Journal of Changzhou University (Natural Science Edition) / Changzhou Daxue Xuebao (Ziran Kexue Ban). Sep2024, Vol. 36 Issue 5, p61-70. 10p.
Publication Year :
2024

Abstract

Aiming at the problem that the Spark scalable distributed platform does not consider the computing capabilities of heterogeneous cluster nodes and load balance during job task scheduling, which affects the system performance, this paper constructs heterogeneous cluster nodes load balance scheduling policy under the Spark environment, Heterogeneous cluster node predicts the data distribution characteristics according to the sampling algorithm. divides the data into balancing partitions. According to the static load and dynamic load weight distribution, heterogeneous cluster node obtains the real-time load, and dynamically schedules job tasks. Finally. Wordcount, TeraSort, and K-means three benchmark tests were used to compare and analyze during heterogeneous cluster operation. Experimental results show that this algorithm can reduce the execution time significantly, and improve the performance of heterogeneous cluster. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
20950411
Volume :
36
Issue :
5
Database :
Academic Search Index
Journal :
Journal of Changzhou University (Natural Science Edition) / Changzhou Daxue Xuebao (Ziran Kexue Ban)
Publication Type :
Academic Journal
Accession number :
180051605
Full Text :
https://doi.org/10.3969/j.issn.2095-0411.2024.05.007