Back to Search Start Over

A unit-based, cost-efficient scheduler for heterogeneous Hadoop systems.

Authors :
Javanmardi, Abdol Karim
Yaghoubyan, S. Hadi
Bagherifard, Karamollah
Nejatian, Samad
Parvin, Hamid
Source :
Journal of Supercomputing. 2021, Vol. 77 Issue 1, p1-22. 22p.
Publication Year :
2021

Abstract

A significant amount of research in the field of job scheduling is carried out in Hadoop. However, there is still need for research to overcome some challenges regarding scheduling jobs in Hadoop clusters. There are various factors affecting the performance of scheduling policies like data volume (storage), data source format (different data), speed (data rate), security and privacy, cost, connection and data sharing. To reach a better utilization of resources and managing big data, scheduling policies have been designed. In this paper, an algorithm has been presented that can run on heterogeneous Hadoop clusters and runs job in parallel. This algorithm first distributes data based on the performance of the nodes and then schedules the jobs according to their cost of execution and decreases the cost of executing the jobs. The presented algorithm offers better performance in terms of execution time, cost and locality compared to FIFO and Fair schedulers. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09208542
Volume :
77
Issue :
1
Database :
Academic Search Index
Journal :
Journal of Supercomputing
Publication Type :
Academic Journal
Accession number :
147909189
Full Text :
https://doi.org/10.1007/s11227-020-03256-4