Back to Search
Start Over
Improving state-action space exploration in reinforcement learning using geometric properties
- Source :
- CDC
- Publication Year :
- 2017
- Publisher :
- IEEE, 2017.
-
Abstract
- Learning a model or learning a policy that optimizes some objective function relies on data-sets that describe the behavior of the system. When such sets are unavailable or insufficient, additional data may be generated through new experiments (if feasible) or through simulations (if an accurate model is available). In this paper we describe a third alternative that is based on the availability of a qualitative model of the physical system. In particular, we show how the number of experiments used in reinforcement learning can be reduced by leveraging geometric properties of the system. The geometric properties are independent of any particular instantiation of the qualitative model. As an illustrative example, we apply our approach to a cart-pole system.
- Subjects :
- Computer science
business.industry
010102 general mathematics
Approximation algorithm
Markov process
01 natural sciences
Space exploration
010305 fluids & plasmas
Data modeling
symbols.namesake
0103 physical sciences
Trajectory
symbols
Process control
Reinforcement learning
Artificial intelligence
0101 mathematics
business
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2017 IEEE 56th Annual Conference on Decision and Control (CDC)
- Accession number :
- edsair.doi...........b3d72c4aca995ebcb74ad36af2ff4c67
- Full Text :
- https://doi.org/10.1109/cdc.2017.8264625