Back to Search
Start Over
Robust topological policy iteration for infinite horizon bounded Markov Decision Processes.
- Source :
-
International Journal of Approximate Reasoning . Feb2019, Vol. 105, p287-304. 18p. - Publication Year :
- 2019
-
Abstract
- Abstract Markov Decision Processes (mdp s) are commonly used to solve sequential decision problems. A less restrictive model is the Bounded-parameter mdp (bmdp) that allows: (i) the transition function to be expressed in terms of probability intervals and (ii) reasoning about a robust solution, i.e., the best solution under the worst model. In this paper, we propose the Robust Topological Policy Iteration (rtpi) algorithm which is a new policy iteration algorithm for infinite horizon bmdp s based on a partition of the state space. The empirical results show that the more structured the domain, the better is the performance of rtpi. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 0888613X
- Volume :
- 105
- Database :
- Academic Search Index
- Journal :
- International Journal of Approximate Reasoning
- Publication Type :
- Periodical
- Accession number :
- 134151570
- Full Text :
- https://doi.org/10.1016/j.ijar.2018.12.004