Back to Search Start Over

Thompson Sampling for Stochastic Control: The Continuous Parameter Case.

Authors :
Banjevic, Dragan
Kim, Michael Jong
Source :
IEEE Transactions on Automatic Control; Oct2019, Vol. 64 Issue 10, p4137-4152, 16p
Publication Year :
2019

Abstract

Recently, Thompson sampling has been shown to achieve good theoretical performance guarantees for stochastic control problems with parameter uncertainty when the state, control, and parameter spaces are all finite. Much less is known however about the performance of Thompson sampling when applied to continuous or more general spaces, which constitutes an important class of problems in practice. In this paper, we study Thompson sampling when applied to a broad class of average cost stochastic control problems where the state, control, and parameter spaces are all general measurable spaces. The main contributions of our paper are establishing theoretical performance guarantees for Thompson sampling as measured by: first, expected posterior sampling error; and second, average per period regret. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00189286
Volume :
64
Issue :
10
Database :
Complementary Index
Journal :
IEEE Transactions on Automatic Control
Publication Type :
Periodical
Accession number :
138896397
Full Text :
https://doi.org/10.1109/TAC.2019.2895253