1. ALBERT: An automatic learning based execution and resource management system for optimizing Hadoop workload in clouds.
- Author
-
Chen, Chen-Chun, Wang, Kai-Siang, Hsiao, Yu-Tung, and Chou, Jerry
- Subjects
- *
DEEP learning , *RESOURCE management , *ARTIFICIAL neural networks , *BATCH processing , *SEARCH algorithms , *JOB hunting - Abstract
Hadoop is a popular computing framework designed to deliver timely and cost-effective data processing on a large cluster of commodity machines. It relieves the burden of the programmers dealing with distributed programming, and an ecosystem of Big Data solutions has developed around it. However, Hadoop's job execution time can greatly depend on its runtime configurations and resource selections. Given the more than 100 job configuration settings provided by Hadoop, and diverse resource instance options in a cloud or virtualized computing environment, running Hadoop jobs still requires a substantial amount of expertise and experience. To address this challenge, we apply a deep neural network to predict Hadoop's job time based on historical execution data, and propose optimization methods to reduce job execution time and cost. The results show that our prediction method achieves almost 90% time prediction accuracy and clearly outperforms three other state-of-the-art regression-based prediction methods. Based on the time prediction, our proposed configuration search method and job scheduling algorithm successfully shorten the execution time of a single Hadoop job by more than a factor of 2 and reduce the time of processing a batch of Hadoop jobs by 40%∼65%. • We aim to optimize the performance and cost of running Hadoop MapReduce workload in a cloud environment with on-demand resources. • We propose a deep learning approach to predict the execution time of MapReduce jobs under any given resource allocation and Hadoop configuration setting. • We proposed a 2-step approximate search algorithm for finding the optimal resource allocation and execution configuration of individual job, and a 2D variable size bin packing algorithm for maximize overall resource utilization. • The experiment results show that our prediction method achieved almost 90% of time prediction accuracy and clearly out-performed three other state-of-art machine learning prediction methods. • Our optimization strategies successfully shorten the execution time of a single Hadoop job by more than a factor of 2 and reduce the time of processing a batch of Hadoop jobs by 40%∼65%. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF