Back to Search Start Over

Decentralized control of Partially Observable Markov Decision Processes using belief space macro-actions

Authors :
Christopher Amato
Shayegan Omidshafiei
Ali-akbar Agha-mohammadi
Jonathan P. How
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Omidshafiei, Shayegan
Aghamohammadi, Aliakbar
Amato, Christopher
How, Jonathan P
Massachusetts Institute of Technology. Laboratory for Information and Decision Systems
Agha-mohammadi, Ali-akbar
How, Jonathan P.
Source :
Other repository, ICRA, Omidshafiei
Publication Year :
2015
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2015.

Abstract

The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but representing and solving Dec-POMDPs is often intractable for large problems. To allow for a high-level representation that is natural for multi-robot problems and scalable to large discrete and continuous problems, this paper extends the Dec-POMDP model to the decentralized partially observable semi-Markov decision process (Dec-POSMDP). The Dec-POSMDP formulation allows asynchronous decision-making by the robots, which is crucial in multi-robot domains. We also present an algorithm for solving this Dec-POSMDP which is much more scalable than previous methods since it can incorporate closed-loop belief space macro-actions in planning. These macro-actions are automatically constructed to produce robust solutions. The proposed method's performance is evaluated on a complex multi-robot package delivery problem under uncertainty, showing that our approach can naturally represent multi-robot problems and provide high-quality solutions for large-scale problems.

Details

Database :
OpenAIRE
Journal :
Other repository, ICRA, Omidshafiei
Accession number :
edsair.doi.dedup.....0c06dfbc377ed6ec8ef11bdd4f8b065f