Back to Search
Start Over
Applications of provenance in performance prediction and data storage optimisation
- Source :
- Future Generation Computer Systems. 75:299-309
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- Accurate and comprehensive storage of provenance information is a basic requirement for modern scientific computing. A significant effort in recent years has developed robust theories and standards for the representation of these traces across a variety of execution platforms. Whilst these are necessary to enable repeatability they do not exploit the captured information to its full potential. This data is increasingly being captured from applications hosted on Cloud Computing platforms, which offer large scale computing resources without significant up front costs. Medical applications, which generate large datasets are also suited to cloud computing as the practicalities of storing and processing such data locally are becoming increasingly challenging. This paper shows how provenance can be captured from medical applications, stored using a graph database and then used to answer audit questions and enable repeatability. This static provenance will then be combined with performance data to predict future workloads, inform decision makers and reduce latency. Finally, cost models which are based on real world cloud computing costs will be used to determine optimum strategies for data retention over potentially extended periods of time.
- Subjects :
- 0301 basic medicine
Graph database
Exploit
Database
Computer Networks and Communications
business.industry
Computer science
Cloud computing
02 engineering and technology
computer.software_genre
Data science
03 medical and health sciences
030104 developmental biology
Workflow
Hardware and Architecture
Computer data storage
e-Science
0202 electrical engineering, electronic engineering, information engineering
Performance prediction
020201 artificial intelligence & image processing
business
computer
Software
Subjects
Details
- ISSN :
- 0167739X
- Volume :
- 75
- Database :
- OpenAIRE
- Journal :
- Future Generation Computer Systems
- Accession number :
- edsair.doi.dedup.....a09452d21ef4fc74bf9d7b1d8eb0b986