Author: "Mets, Kevin" / Topic: statistics - machine learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mets, Kevin"' showing total 3 results

Start Over Author "Mets, Kevin" Topic statistics - machine learning

3 results on '"Mets, Kevin"'

1. HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal Memory

Author: Struye, Jakob, Mets, Kevin, and Latré, Steven
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Building Reinforcement Learning (RL) algorithms which are able to adapt to continuously evolving tasks is an open research challenge. One technology that is known to inherently handle such non-stationary input patterns well is Hierarchical Temporal Memory (HTM), a general and biologically plausible computational model for the human neocortex. As the RL paradigm is inspired by human learning, HTM is a natural framework for an RL algorithm supporting non-stationary environments. In this paper, we present HTMRL, the first strictly HTM-based RL algorithm. We empirically and statistically show that HTMRL scales to many states and actions, and demonstrate that HTM's ability for adapting to changing patterns extends to RL. Specifically, HTMRL performs well on a 10-armed bandit after 750 steps, but only needs a third of that to adapt to the bandit suddenly shuffling its arms. HTMRL is the first iteration of a novel RL approach, with the potential of extending to a capable algorithm for Meta-RL.
Published: 2020

2. Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

Author: Hutsebaut-Buysse, Matthias, Mets, Kevin, and Latré, Steven
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Reinforcement learning (RL) algorithms typically start tabula rasa, without any prior knowledge of the environment, and without any prior skills. This however often leads to low sample efficiency, requiring a large amount of interaction with the environment. This is especially true in a lifelong learning setting, in which the agent needs to continually extend its capabilities. In this paper, we examine how a pre-trained task-independent language model can make a goal-conditional RL agent more sample efficient. We do this by facilitating transfer learning between different related tasks. We experimentally demonstrate our approach on a set of object navigation tasks., Comment: Paper accepted to the ICML 2020 Language in Reinforcement Learning (LaReL) Workshop
Published: 2020

3. Learning to Communicate Using Counterfactual Reasoning

Author: Vanneste, Simon, Vanneste, Astrid, Mets, Kevin, De Schepper, Tom, Anwar, Ali, Mercelis, Siegfried, Latré, Steven, and Hellinckx, Peter
Subjects: Computer Science - Machine Learning, Computer Science - Multiagent Systems, Statistics - Machine Learning
Abstract: Learning to communicate in order to share state information is an active problem in the area of multi-agent reinforcement learning (MARL). The credit assignment problem, the non-stationarity of the communication environment and the creation of influenceable agents are major challenges within this research field which need to be overcome in order to learn a valid communication protocol. This paper introduces the novel multi-agent counterfactual communication learning (MACC) method which adapts counterfactual reasoning in order to overcome the credit assignment problem for communicating agents. Secondly, the non-stationarity of the communication environment while learning the communication Q-function is overcome by creating the communication Q-function using the action policy of the other agents and the Q-function of the action environment. Additionally, a social loss function is introduced in order to create influenceable agents which is required to learn a valid communication protocol. Our experiments show that MACC is able to outperform the state-of-the-art baselines in four different scenarios in the Particle environment., Comment: Accepted at Adaptive and Learning Agents Workshop (ALA 2022) https://ala2022.github.io/
Published: 2020

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Mets, Kevin"'

1. HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal Memory

2. Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

3. Learning to Communicate Using Counterfactual Reasoning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

3 results on '"Mets, Kevin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources