Start Over

Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.

Authors :: Akrour, Riad
Tateo, Davide
Peters, Jan
Source :: IEEE Transactions on Pattern Analysis & Machine Intelligence. Oct2022, Vol. 44 Issue 10, p6795-6806. 12p.
Publication Year :: 2022
Abstract: Reinforcement learning (RL) has demonstrated its ability to solve high dimensional tasks by leveraging non-linear function approximators. However, these successes are mostly achieved by ’black-box’ policies in simulated domains. When deploying RL to the real world, several concerns regarding the use of a ’black-box’ policy might be raised. In order to make the learned policies more transparent, we propose in this paper a policy iteration scheme that retains a complex function approximator for its internal value predictions but constrains the policy to have a concise, hierarchical, and human-readable structure, based on a mixture of interpretable experts. Each expert selects a primitive action according to a distance to a prototypical state. A key design decision to keep such experts interpretable is to select the prototypical states from trajectory data. The main technical contribution of the paper is to address the challenges introduced by this non-differentiable prototypical state selection procedure. Experimentally, we show that our proposed algorithm can learn compelling policies on continuous action deep RL benchmarks, matching the performance of neural network based policies, but returning policies that are more amenable to human inspection than neural network or linear-in-feature policies. [ABSTRACT FROM AUTHOR]

Subjects :: *ACTIVE learning
*NONLINEAR functions
*MACHINE learning
*REINFORCEMENT learning
*APPROXIMATION algorithms

Details

Language :: English
ISSN :: 01628828
Volume :: 44
Issue :: 10
Database :: Academic Search Index
Journal :: IEEE Transactions on Pattern Analysis & Machine Intelligence
Publication Type :: Academic Journal
Accession number :: 159210611
Full Text :: https://doi.org/10.1109/TPAMI.2021.3103132

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources