Back to Search Start Over

An Early Functional and Performance Experiment of the MarFS Hybrid Storage EcoSystem

Authors :
Gary Grider
David Montoya
Hsing-bung Chen
Source :
IC2E
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

Many computing sites, LANL being one of them, have a requirement for long-term retention of mostly cold data. Although the main function of this storage tier is capacity, it does also have a bandwidth requirement. For many years, tape was the best economic solution for this requirement. However, over time, data sets have grown larger more quickly than tape bandwidth has improved. We have now entered a regime in which disk is the more economically efficient medium for this storage tier. Also more and more, data dominates the computing world. There is a "sea" of data out there in many different formats such as file, object, and Key-value that needs to be efficiently managed and effectively used. In this paper, we introduce a new hybrid storage system named MarFS. MarFS is a Near-POSIX File System using scale-out commercial/cloud for data and many POSIX file systems for metadata services. MarFS is an approach to support a data lake for HPC that sits on industry based commodity storage hardware and is a software layer that provides a global namespace and near POSIX semantics. MarFS provides the capability to serve as an umbrella over a variety of underling storage layers. In this paper, we present the system architecture of the proposed MarFS near-POISX file system, we conduct early functional performance testing cases on MarFS's software components, and finally we address current deployment status and future development works of the MarFS.

Details

Database :
OpenAIRE
Journal :
2017 IEEE International Conference on Cloud Engineering (IC2E)
Accession number :
edsair.doi...........d412eb11bff287a43a43744060100a7f