Back to Search Start Over

Non-MapReduce computing for intelligent big data analysis.

Authors :
Sun, Xudong
Zhao, Lingxiang
Chen, Jiaqi
Cai, Yongda
Wu, Dingming
Huang, Joshua Zhexue
Source :
Engineering Applications of Artificial Intelligence. Mar2024, Vol. 129, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

MapReduce is a popular paradigm in distributed computing, but it is not efficient when executing iterative algorithms over a distributed big dataset due to its heavy data communication overhead. Non-MapReduce computing is an alternative for improving computing efficiency and data scalability when using iterative algorithms to process big distributed datasets on clusters. In this paper, we investigate Non-MapReduce approach in distributed computing and use Spark implementations of machine learning algorithms to discuss the problems of MapReduce in executing iterative algorithms over a big distributed dataset and the advantages of Non-MapReduce for the same tasks. We present a method to build a new machine learning library made of sequential algorithms for distributed computing. We use experiment results to show comparisons of computing efficiency and data scalability of MapReduce and Non-MapReduce in executing six machine learning algorithms over big datasets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09521976
Volume :
129
Database :
Academic Search Index
Journal :
Engineering Applications of Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
175410931
Full Text :
https://doi.org/10.1016/j.engappai.2023.107648