Author: "Li, Zhuozhao" / Topic: random access memory - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Li, Zhuozhao"' showing total 2 results

Start Over Author "Li, Zhuozhao" Topic random access memory

2 results on '"Li, Zhuozhao"'

1. Measuring Scale-Up and Scale-Out Hadoop with Remote and Local File Systems and Selecting the Best Platform.

Author: Li, Zhuozhao and Shen, Haiying
Subjects: *COMPUTING platforms, *BIG data, *COMMUNICATIONS software, *BUFFER storage (Computer science), *MANAGEMENT
Abstract: MapReduce is a popular computing model for parallel data processing on large-scale datasets, which can vary from gigabytes to terabytes and petabytes. Though Hadoop MapReduce normally uses Hadoop Distributed File System (HDFS) local file system, it can be configured to use a remote file system. Then, an interesting question is raised: for a given application, which is the best running platform among the different combinations of scale-up and scale-out Hadoop with remote and local file systems. However, there has been no previous research on how different types of applications (e.g., CPU-intensive, data-intensive) with different characteristics (e.g., input data size) can benefit from the different platforms. Thus, in this paper, we conduct a comprehensive performance measurement of different applications on scale-up and scale-out clusters configured with HDFS and a remote file system (i.e., OFS), respectively. We identify and study how different job characteristics (e.g., input data size, the number of file reads/writes, and the amount of computations) affect the performance of different applications on the different platforms. Based on the measurement results, we also propose a performance prediction model to help users select the best platforms that lead to the minimum latency. Our evaluation using a Facebook workload trace demonstrates the effectiveness of our prediction model. This study is expected to provide a guidance for users to choose the best platform to run different applications with different characteristics in the environment that provides both remote and local storage, such as HPC cluster and cloud environment. [ABSTRACT FROM PUBLISHER]
Published: 2017
Full Text: View/download PDF

2. An Exploration of Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements.

Author: Li, Zhuozhao, Shen, Haiying, Ligon, Walter, and Denton, Jeffrey
Subjects: *DATA transmission systems, *COMPUTER architecture, *RANDOM access memory, *DATABASES
Abstract: Scale-up machines perform better for jobs with small and median (KB, MB) data sizes, while scale-out machines perform better for jobs with large (GB, TB) data size. Since a workload usually consists of jobs with different data size levels, we propose building a hybrid Hadoop architecture that includes both scale-up and scale-out machines, which however is not trivial. The first challenge is workload data storage. Thousands of small data size jobs in a workload may overload the limited local disks of scale-up machines. Jobs from scale-up and scale-out machines may both request the same set of data, which leads to data transmission between the machines. The second challenge is to automatically schedule jobs to either scale-up or scale-out cluster to achieve the best performance. We conduct a thorough performance measurement of different applications on scale-up and scale-out clusters, configured with Hadoop Distributed File System (HDFS) and a remote file system (i.e., OFS), respectively. We find that using OFS rather than HDFS can solve the data storage challenge. Also, we identify the factors that determine the performance differences on the scale-up and scale-out clusters and their cross points to make the choice. Accordingly, we design and implement the hybrid scale-up/out Hadoop architecture. Our trace-driven experimental results show that our hybrid architecture outperforms both the traditional Hadoop architecture with HDFS and with OFS in terms of job completion time, throughput and job failure rate. [ABSTRACT FROM PUBLISHER]
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Li, Zhuozhao"'

1. Measuring Scale-Up and Scale-Out Hadoop with Remote and Local File Systems and Selecting the Best Platform.

2. An Exploration of Designing a Hybrid Scale-Up/Out Hadoop Architecture Based on Performance Measurements.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

2 results on '"Li, Zhuozhao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources