Start Over

Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage

Authors :: Dominic Davies-Tagg
Ashiq Anjum
Ali Zahir
Lu Liu
Muhammad Usman Yaseen
Nick Antonopoulos
Source :: Big Data Mining and Analytics, Vol 7, Iss 2, Pp 371-398 (2024)
Publication Year :: 2024
Publisher :: Tsinghua University Press, 2024.
Abstract: Data temperature is a response to the ever-growing amount of data. These data have to be stored, but they have been observed that only a small portion of the data are accessed more frequently at any one time. This leads to the concept of hot and cold data. Cold data can be migrated away from high-performance nodes to free up performance for higher priority data. Existing studies classify hot and cold data primarily on the basis of data age and usage frequency. We present this as a limitation in the current implementation of data temperature. This is due to the fact that age automatically assumes that all new data have priority and that usage is purely reactive. We propose new variables and conditions that influence smarter decision-making on what are hot or cold data and allow greater user control over data location and their movement. We identify new metadata variables and user-defined variables to extend the current data temperature value. We further establish rules and conditions for limiting unnecessary movement of the data, which helps to prevent wasted input output (I/O) costs. We also propose a hybrid algorithm that combines existing variables and new variables and conditions into a single data temperature. The proposed system provides higher accuracy, increases performance, and gives greater user control for optimal positioning of data within multi-tiered storage solutions.

Subjects :: data temperature
hot and cold data
multi-tiered storage
metadata variable
multi-temperature system
Electronic computers. Computer science
QA75.5-76.95

Details

Language :: English
ISSN :: 20960654
Volume :: 7
Issue :: 2
Database :: Directory of Open Access Journals
Journal :: Big Data Mining and Analytics
Publication Type :: Academic Journal
Accession number :: edsdoj.7ef7f431ccd4ec7a4720b8d6425e0a2
Document Type :: article
Full Text :: https://doi.org/10.26599/BDMA.2023.9020039

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources