Start Over

Value Function Discovery in Markov Decision Processes With Evolutionary Algorithms.

Authors :: Onderwater, Martijn
Bhulai, Sandjai
van der Mei, Rob
Source :: IEEE Transactions on Systems, Man & Cybernetics. Systems; Sep2016, Vol. 46 Issue 9, p1190-1201, 12p
Publication Year :: 2016
Abstract: In this paper, we introduce a novel method for the discovery of value functions for Markov decision processes (MDPs). This method, which we call value function discovery (VFD), is based on ideas from the evolutionary algorithm field. VFDs key feature is that it discovers descriptions of value functions that are algebraic in nature. This feature is unique, because the descriptions include the model parameters of the MDP. The algebraic expression of the value function discovered by VFD can be used in several scenarios, e.g., conversion to a policy (with one-step policy improvement) or control of systems with time-varying parameters. The work in this paper is a first step toward exploring potential usage scenarios of discovered value functions. We give a detailed description of VFD and illustrate its application on an example MDP. For this MDP, we let VFD discover an algebraic description of a value function that closely resembles the optimal value function. The discovered value function is then used to obtain a policy, which we compare numerically to the optimal policy of the MDP. The resulting policy shows near-optimal performance on a wide range of model parameters. Finally, we identify and discuss future application scenarios of discovered value functions. [ABSTRACT FROM PUBLISHER]

Subjects :: PARTIALLY observable Markov decision processes
EVOLUTIONARY algorithms
CYBERNETICS

Details

Language :: English
ISSN :: 21682216
Volume :: 46
Issue :: 9
Database :: Complementary Index
Journal :: IEEE Transactions on Systems, Man & Cybernetics. Systems
Publication Type :: Academic Journal
Accession number :: 117596611
Full Text :: https://doi.org/10.1109/TSMC.2015.2475716

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Value Function Discovery in Markov Decision Processes With Evolutionary Algorithms.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Value Function Discovery in Markov Decision Processes With Evolutionary Algorithms.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources