Back to Search Start Over

Compulsory Flow Q-Learning: an RL algorithm for robot navigation based on partial-policy and macro-states

Authors :
Anna Helena Reali Costa
Valdinei Freire da Silva
Source :
Journal of the Brazilian Computer Society v.15 n.3 2009, Journal of the Brazilian Computer Society, Sociedade Brasileira de Computação (SBC), instacron:UFRGS, Journal of the Brazilian Computer Society, Volume: 15, Issue: 3, Pages: 65-75, Published: SEP 2009
Publication Year :
2009
Publisher :
Springer Science and Business Media LLC, 2009.

Abstract

Reinforcement Learning is carried out on-line, through trial-and-error interactions of the agent with the environment, which can be very time consuming when considering robots. In this paper we contribute a new learning algorithm, CFQLearning, which uses macro-states, a low-resolution discretisation of the state space, and a partial-policy to get around obstacles, both of them based on the complexity of the environment structure. The use of macro-states avoids convergence of algorithms, but can accelerate the learning process. In the other hand, partial-policies can guarantee that an agent fulfils its task, even through macro-state. Experiments show that the CFQ-Learning performs a good balance between policy quality and learning rate.

Details

ISSN :
01046500
Volume :
15
Database :
OpenAIRE
Journal :
Journal of the Brazilian Computer Society
Accession number :
edsair.doi.dedup.....11abdf8246b6ef601f302f59d6a2b873
Full Text :
https://doi.org/10.1590/s0104-65002009000300007