Back to Search
Start Over
Spark 异构集群负载均衡调度策略.
- Source :
-
Journal of Changzhou University (Natural Science Edition) / Changzhou Daxue Xuebao (Ziran Kexue Ban) . Sep2024, Vol. 36 Issue 5, p61-70. 10p. - Publication Year :
- 2024
-
Abstract
- Aiming at the problem that the Spark scalable distributed platform does not consider the computing capabilities of heterogeneous cluster nodes and load balance during job task scheduling, which affects the system performance, this paper constructs heterogeneous cluster nodes load balance scheduling policy under the Spark environment, Heterogeneous cluster node predicts the data distribution characteristics according to the sampling algorithm. divides the data into balancing partitions. According to the static load and dynamic load weight distribution, heterogeneous cluster node obtains the real-time load, and dynamically schedules job tasks. Finally. Wordcount, TeraSort, and K-means three benchmark tests were used to compare and analyze during heterogeneous cluster operation. Experimental results show that this algorithm can reduce the execution time significantly, and improve the performance of heterogeneous cluster. [ABSTRACT FROM AUTHOR]
Details
- Language :
- Chinese
- ISSN :
- 20950411
- Volume :
- 36
- Issue :
- 5
- Database :
- Academic Search Index
- Journal :
- Journal of Changzhou University (Natural Science Edition) / Changzhou Daxue Xuebao (Ziran Kexue Ban)
- Publication Type :
- Academic Journal
- Accession number :
- 180051605
- Full Text :
- https://doi.org/10.3969/j.issn.2095-0411.2024.05.007