Start Over

Sim-to-Real quadrotor landing via sequential deep Q-Networks and domain randomization

Authors :: Marc Hanheide
Massimiliano Patacchiola
Riccardo Polvara
Gerhard Neumann
Source :: Robotics, 9 (1), 8, Robotics, Volume 9, Issue 1, Robotics, Vol 9, Iss 1, p 8 (2020)
Publication Year :: 2020
Publisher :: MDPI, 2020.
Abstract: The autonomous landing of an Unmanned Aerial Vehicle (UAV) on a marker is one of the most challenging problems in robotics. Many solutions have been proposed, with the best results achieved via customized geometric features and external sensors. This paper discusses for the first time the use of deep reinforcement learning as an end-to-end learning paradigm to find a policy for UAVs autonomous landing. Our method is based on a divide-and-conquer paradigm that splits a task into sequential sub-tasks, each one assigned to a Deep Q-Network (DQN), hence the name Sequential Deep Q-Network (SDQN). Each DQN in an SDQN is activated by an internal trigger, and it represents a component of a high-level control policy, which can navigate the UAV towards the marker. Different technical solutions have been implemented, for example combining vanilla and double DQNs, and the introduction of a partitioned buffer replay to address the problem of sample efficiency. One of the main contributions of this work consists in showing how an SDQN trained in a simulator via domain randomization, can effectively generalize to real-world scenarios of increasing complexity. The performance of SDQNs is comparable with a state-of-the-art algorithm and human pilots while being quantitatively better in noisy conditions.

Details

Language :: English
ISSN :: 22186581
Database :: OpenAIRE
Journal :: Robotics, 9 (1), 8, Robotics, Volume 9, Issue 1, Robotics, Vol 9, Iss 1, p 8 (2020)
Accession number :: edsair.doi.dedup.....53aab5b6b2fcb5393439f6e202ea4d2a

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Sim-to-Real quadrotor landing via sequential deep Q-Networks and domain randomization

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Sim-to-Real quadrotor landing via sequential deep Q-Networks and domain randomization

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources