Back to Search
Start Over
Reachability and safety objectives in Markov decision processes on long but finite horizons
- Source :
- Journal of Optimization Theory and Applications, 185(3), 945-965. Springer Verlag
- Publication Year :
- 2019
-
Abstract
- We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. Formally, strategy $\sigma$ overtakes another strategy $\sigma'$, if the probability of reaching the target state within horizon $t$ is larger under $\sigma$ than under $\sigma'$, for all sufficiently large $t\in\NN$. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model. We finally discuss extensions of our results to two-player zero-sum perfect information games.
- Subjects :
- OVERTAKING
Mathematical optimization
Control and Optimization
Safety objective
Existential quantification
0102 computer and information sciences
02 engineering and technology
Management Science and Operations Research
01 natural sciences
Strategy
C73
Reachability
Overtaking
FOS: Mathematics
0202 electrical engineering, electronic engineering, information engineering
Overtaking optimality
OPTIMALITY
Mathematics - Optimization and Control
Mathematics
Structure (mathematical logic)
Perron-Frobenius eigenvalue
Reachability objective
Applied Mathematics
State (functional analysis)
Optimization and Control (math.OC)
010201 computation theory & mathematics
Theory of computation
020201 artificial intelligence & image processing
Markov decision process
Subjects
Details
- Language :
- English
- ISSN :
- 00223239
- Database :
- OpenAIRE
- Journal :
- Journal of Optimization Theory and Applications, 185(3), 945-965. Springer Verlag
- Accession number :
- edsair.doi.dedup.....e77027f67a55a4222347d427ee26438c