Back to Search Start Over

Applications of provenance in performance prediction and data storage optimisation

Authors :
Simon Woodman
Paul Watson
Hugo Hiden
Source :
Future Generation Computer Systems. 75:299-309
Publication Year :
2017
Publisher :
Elsevier BV, 2017.

Abstract

Accurate and comprehensive storage of provenance information is a basic requirement for modern scientific computing. A significant effort in recent years has developed robust theories and standards for the representation of these traces across a variety of execution platforms. Whilst these are necessary to enable repeatability they do not exploit the captured information to its full potential. This data is increasingly being captured from applications hosted on Cloud Computing platforms, which offer large scale computing resources without significant up front costs. Medical applications, which generate large datasets are also suited to cloud computing as the practicalities of storing and processing such data locally are becoming increasingly challenging. This paper shows how provenance can be captured from medical applications, stored using a graph database and then used to answer audit questions and enable repeatability. This static provenance will then be combined with performance data to predict future workloads, inform decision makers and reduce latency. Finally, cost models which are based on real world cloud computing costs will be used to determine optimum strategies for data retention over potentially extended periods of time.

Details

ISSN :
0167739X
Volume :
75
Database :
OpenAIRE
Journal :
Future Generation Computer Systems
Accession number :
edsair.doi.dedup.....a09452d21ef4fc74bf9d7b1d8eb0b986