1. Fast Recovery MapReduce (FAR-MR) to accelerate failure recovery in big data applications
- Author
-
Yongqing Zhu, Juniarto Samsudin, Renuga Kanagavelu, Theint Theint Aye, Weiwen Zhang, Rick Siow Mong Goh, and Long Wang
- Subjects
020203 distributed computing ,Computer science ,business.industry ,Distributed computing ,Node (networking) ,Big data ,Fault tolerance ,02 engineering and technology ,Fast recovery ,Cluster (spacecraft) ,Theoretical Computer Science ,Task (computing) ,Hardware and Architecture ,Distributed data store ,Data_FILES ,0202 electrical engineering, electronic engineering, information engineering ,business ,Software ,Information Systems - Abstract
Existing Hadoop MapReduce fault tolerance strategy causes the computing jobs suffering from high performance penalty during failure recovery. In this paper, we propose Fast Recovery MapReduce (FAR-MR) to improve MapReduce performance in failure recovery. FAR-MR includes a novel fault tolerance strategy that combines distributed checkpointing and proactive push mechanism to support fast recovery from task failure and node failure. With distributed checkpointing, computing progress of each task is recorded as checkpoints periodically and kept in distributed data storage. The recovered task can obtain the last progress of the failed task from the distributed storage during failure recovery. In addition, the proactive push mechanism enables the computing results of map tasks to be proactively transmitted to the nodes hosting reduce tasks of the same computing job. When a failure happens, the partial output results being pushed to the reducer nodes can be used by the reduce tasks without the necessity of re-compute. FAR-MR allows a failed task to be recovered efficiently at any node in the cluster. The performance evaluation has shown that the proposed FAR-MR can improve computing job performance by up to 62% and 45% compared to Hadoop MapReduce in the case of task failure recovery and node failure recovery, respectively.
- Published
- 2018
- Full Text
- View/download PDF