1. A Learning Architecture for Scheduling Workflow Applications in the Cloud
- Author
-
Enda Barrett, Jim Duggan, Enda Howley, and ~
- Subjects
Workflow management software ,Computer science ,Computational costs ,Distributed computing ,Processor scheduling ,Cloud computing ,Storage sites ,Quality of service requirements ,Optimal scheduling ,Workflow engine ,Workflow technology ,Scheduling (computing) ,Quality of service ,Pay-per-use fashion ,Precedence constraints ,Computational resources ,Cloud workflow scheduling ,Utility-type market model ,Schedules ,Scheduling ,Workflow scheduling ,business.industry ,Markov processes ,Computational processing ,Software architecture ,Storage management ,Environmental state ,Biological cells ,Data transmission costs ,Genetic algorithms ,Scheduling workflow applications ,Workflow execution process ,Bayesian model learning ,Workflow ,Genetic algorithm ,Scientific workflows ,Workflow based applications dependency ,Workflow schedulers ,Minimisation ,Markov decision process ,Learning architecture ,business ,Workflow management system ,Workflow tasks - Abstract
Conference paper The scheduling of workflow applications involves the mapping of individual workflow tasks to computational resources, based on a range of functional and non-functional quality of service requirements. Workflow applications such as scientific workflows often require extensive computational processing and generate significant amounts of experimental data. The emergence of cloud computing has introduced a utility-type market model, where computational resources of varying capacities can be procured on demand, in a pay-per-use fashion. In workflow based applications dependencies exist amongst tasks which requires the generation of schedules in accordance with defined precedence constraints. These constraints pose a difficult planning problem, where tasks must be scheduled for execution only once all their parent tasks have completed. In general the two most important objectives of workflow schedulers are the minimisation of both cost and make span. The cost of workflow execution consists of both computational costs incurred from processing individual tasks, and data transmission costs. With scientific workflows potentially large amounts of data must be transferred between compute and storage sites. This paper proposes a novel cloud workflow scheduling approach which employs a Markov Decision Process to optimally guide the workflow execution process depending on environmental state. In addition the system employs a genetic algorithm to evolve workflow schedules. The overall architecture is presented, and initial results indicate the potential of this approach for developing viable workflow schedules on the Cloud. Science Foundation Ireland non-peer-reviewed
- Published
- 2011