Back to Search Start Over

Computation Techniques for Large Scale Undiscounted Markov Decision Processes.

Authors :
FLORIDA UNIV GAINESVILLE DEPT OF INDUSTRIAL AND SYSTEMS ENGINEERING
Hodgson,Thom J
Koehler,Gary J
FLORIDA UNIV GAINESVILLE DEPT OF INDUSTRIAL AND SYSTEMS ENGINEERING
Hodgson,Thom J
Koehler,Gary J
Source :
DTIC AND NTIS
Publication Year :
1978

Abstract

This paper considers computation techniques associated with the optimization of large scale Markov decision processes. Markov decision processes and successive approximation procedures are described. Then a procedure for scaling continuous time and renewal processes so that they are amenable to the second procedure is discussed. The effect of the scale factor value on the convergence rate of the procedure and insights into proper scale factor selection are given. Finally, various methods of achieving computational efficiency during execution of the optimization are considered.

Details

Database :
OAIster
Journal :
DTIC AND NTIS
Notes :
text/html, English
Publication Type :
Electronic Resource
Accession number :
edsoai.ocn831756691
Document Type :
Electronic Resource