Start Over

Policy Compression for Intelligent Continuous Control on Low-Power Edge Devices.

Authors :: Avé, Thomas
De Schepper, Tom
Mets, Kevin
Source :: Sensors (14248220). Aug2024, Vol. 24 Issue 15, p4876. 22p.
Publication Year :: 2024
Abstract: Interest in deploying deep reinforcement learning (DRL) models on low-power edge devices, such as Autonomous Mobile Robots (AMRs) and Internet of Things (IoT) devices, has seen a significant rise due to the potential of performing real-time inference by eliminating the latency and reliability issues incurred from wireless communication and the privacy benefits of processing data locally. Deploying such energy-intensive models on power-constrained devices is not always feasible, however, which has led to the development of model compression techniques that can reduce the size and computational complexity of DRL policies. Policy distillation, the most popular of these methods, can be used to first lower the number of network parameters by transferring the behavior of a large teacher network to a smaller student model before deploying these students at the edge. This works well with deterministic policies that operate using discrete actions. However, many real-world tasks that are power constrained, such as in the field of robotics, are formulated using continuous action spaces, which are not supported. In this work, we improve the policy distillation method to support the compression of DRL models designed to solve these continuous control tasks, with an emphasis on maintaining the stochastic nature of continuous DRL algorithms. Experiments show that our methods can be used effectively to compress such policies up to 750% while maintaining or even exceeding their teacher's performance by up to 41% in solving two popular continuous control tasks. [ABSTRACT FROM AUTHOR]

Subjects :: *DEEP reinforcement learning
*REINFORCEMENT learning
*AUTONOMOUS robots
*INTELLIGENT control systems
*MOBILE robots

Details

Language :: English
ISSN :: 14248220
Volume :: 24
Issue :: 15
Database :: Academic Search Index
Journal :: Sensors (14248220)
Publication Type :: Academic Journal
Accession number :: 178949939
Full Text :: https://doi.org/10.3390/s24154876

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Policy Compression for Intelligent Continuous Control on Low-Power Edge Devices.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Policy Compression for Intelligent Continuous Control on Low-Power Edge Devices.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources