Back to Search Start Over

Learning robotic manipulation skills with multiple semantic goals by conservative curiosity-motivated exploration.

Authors :
Changlin Han
Zhiyong Peng
Yadong Liu
Jingsheng Tang
Yang Yu
Zongtan Zhou
Source :
Frontiers in Neurorobotics; 3/7/2023, Vol. 17, p1-14, 14p
Publication Year :
2023

Abstract

Reinforcement learning (RL) empowers the agent to learn robotic manipulation skills autonomously. Compared with traditional single-goal RL, semantic-goal-conditioned RL expands the agent capacity to accomplish multiple semantic manipulation instructions. However, due to sparsely distributed semantic goals and sparse-reward agent-environment interactions, the hard exploration problem arises and impedes the agent training process. In traditional RL, curiosity-motivated exploration shows effectiveness in solving the hard exploration problem. However, in semantic-goal-conditioned RL, the performance of previous curiosity-motivated methods deteriorates, which we propose is because of their two defects: uncontrollability and distraction. To solve these defects, we propose a conservative curiosity-motivated method named mutual information motivation with hybrid policy mechanism (MIHM). MIHM mainly contributes two innovations: the decoupled-mutual-information- based intrinsic motivation, which prevents the agent from being motivated to explore dangerous states by uncontrollable curiosity; the precisely trained and automatically switched hybrid policy mechanism, which eliminates the distraction from the curiosity-motivated policy and achieves the optimal utilization of exploration and exploitation. Compared with four state-of-the-art curiosity-motivated methods in the sparse-reward robotic manipulation task with 35 valid semantic goals, including stacks of 2 or 3 objects and pyramids, our MIHM shows the fastest learning speed. Moreover, MIHM achieves the highest 0.9 total success rate, which is up to 0.6 in other methods. Throughout all the baseline methods, our MIHM is the only one that achieves to stack three objects. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
16625218
Volume :
17
Database :
Complementary Index
Journal :
Frontiers in Neurorobotics
Publication Type :
Academic Journal
Accession number :
163675199
Full Text :
https://doi.org/10.3389/fnbot.2023.1089270