Back to Search Start Over

Neuro-Optimal Control for Discrete Stochastic Processes via a Novel Policy Iteration Algorithm.

Authors :
Liang, Mingming
Wang, Ding
Liu, Derong
Source :
IEEE Transactions on Systems, Man & Cybernetics. Systems. Nov2020, Vol. 50 Issue 11, p3972-3985. 14p.
Publication Year :
2020

Abstract

In this paper, a novel policy iteration adaptive dynamic programming (ADP) algorithm is presented which is called “local policy iteration ADP algorithm” to obtain the optimal control for discrete stochastic processes. In the proposed local policy iteration ADP algorithm, the iterative decision rules are updated in a local space of the whole state space. Hence, we can significantly reduce the computational burden for the CPU in comparison with the conventional policy iteration algorithm. By analyzing the convergence properties of the proposed algorithm, it is shown that the iterative value functions are monotonically nonincreasing. Besides, the iterative value functions can converge to the optimum in a local policy space. In addition, this local policy space will be described in detail for the first time. Under a few weak constraints, it is also shown that the iterative value function will converge to the optimal performance index function of the global policy space. Finally, a simulation example is presented to validate the effectiveness of the developed method. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
21682216
Volume :
50
Issue :
11
Database :
Academic Search Index
Journal :
IEEE Transactions on Systems, Man & Cybernetics. Systems
Publication Type :
Academic Journal
Accession number :
146472564
Full Text :
https://doi.org/10.1109/TSMC.2019.2907991