Back to Search Start Over

Multi actor hierarchical attention critic with RNN-based feature extraction

Authors :
Chao Xue
Gongju Wang
Chenran Zhao
Huanhuan Yang
Dianxi Shi
Yajie Wang
Hao Jiang
Yongjun Zhang
Shaowu Yang
Source :
Neurocomputing. 471:79-93
Publication Year :
2022
Publisher :
Elsevier BV, 2022.

Abstract

Deep reinforcement learning has made significant progress in multi-agent tasks in recent years. However, most previous studies focus on solving fully cooperative tasks, which do not perform well in mixed tasks. In mixed tasks, the agent needs to comprehensively consider the information provided by its friends and enemies to learn its strategy, and its strategy is sensitive to the received information. Additionally, the input space of the critic network increases rapidly with the number of agents in the actor-critic framework. It’s of great necessity to efficiently learn information representation to obtain important features. To this end, we present an approach that conducts information representation with attention mechanism. Our approach adopts the framework of centralized training and decentralized execution. We apply the multi-head hierarchical attention mechanism to centrally computed critics, so critics can process the received information more accurately and assist actors in choosing better actions. The hierarchical attention critic adopts a bi-level attention structure which is composed of the agent-level and the group-level. They are designed to assign different weights to friends’ and enemies’ information and then summarize them at each timestep. It achieves high efficiency and scalability in mixed tasks. Furthermore, we use the feature extraction based on the recurrent neural network to encode the state-action sequence information of each agent. Experimental results show that our approach is not only applicable to cooperative environments but also better in mixed environments, especially in the predator-prey task, the reward obtained by our method is twice that of the baselines.

Details

ISSN :
09252312
Volume :
471
Database :
OpenAIRE
Journal :
Neurocomputing
Accession number :
edsair.doi...........7caa94213ac7d2bee2ae8b8d7639db5d