Search

Your search keyword '"Laroche, Romain"' showing total 131 results

Search Constraints

Start Over You searched for: Author "Laroche, Romain" Remove constraint Author: "Laroche, Romain"
131 results on '"Laroche, Romain"'

Search Results

1. Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning

2. Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

3. Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

4. Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting

5. Think Before You Act: Decision Transformers with Working Memory

6. Behavior Prior Representation learning for Offline Reinforcement Learning

7. Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning

8. Contrastive Multimodal Learning for Emergence of Graphical Sensory-Motor Communication

9. Using Representation Expressiveness and Learnability to Evaluate Self-Supervised Learning Methods

10. Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning

11. When does return-conditioned supervised learning work for offline reinforcement learning?

12. Non-Markovian policies occupancy measures

13. One-Shot Learning from a Demonstration with Hierarchical Latent Language

14. Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms

15. On the Convergence of SARSA with Linear Function Approximation

16. Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch

17. Batched Bandits with Crowd Externalities

18. Dr Jekyll and Mr Hyde: the Strange Case of Off-Policy Policy Updates

19. The Emergence of the Shape Bias Results from Communicative Efficiency

20. Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs

22. A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

23. Reinforcement Learning Framework for Deep Brain Stimulation Study

24. Learning Dynamic Belief Graphs to Generalize on Text-Based Games

25. Building Dynamic Knowledge Graphs from Text-based Games

26. Safe Policy Improvement with an Estimated Baseline Policy

27. Safe Policy Improvement with Soft Baseline Bootstrapping

28. Budgeted Reinforcement Learning in Continuous State Space

29. Decentralized Exploration in Multi-Armed Bandits -- Extended version

30. Counting to Explore and Generalize in Text-based Games

31. Safe Policy Improvement with Baseline Bootstrapping

32. The Complex Negotiation Dialogue Game

33. Hybrid Reward Architecture for Reinforcement Learning

34. Multi-Advisor Reinforcement Learning

35. Reinforcement Learning Algorithm Selection

36. Separation of Concerns in Reinforcement Learning

37. Safe Policy Improvement with Soft Baseline Bootstrapping

44. Think Before You Act: Decision Transformers with Internal Working Memory

45. Batched Bandits with Crowd Externalities

46. Emergence of Shared Sensory-motor Graphical Language from Visual Input

47. Contextual Bandit for Active Learning: Active Thompson Sampling

48. Massive Multi-Player Multi-Armed Bandits for IoT Networks: An Application on LoRa Networks

49. Reward Shaping for Statistical Optimisation of Dialogue Management

Catalog

Books, media, physical & digital resources