Search

Your search keyword '"Lazaric, Alessandro"' showing total 27 results

Search Constraints

Start Over You searched for: Author "Lazaric, Alessandro" Remove constraint Author: "Lazaric, Alessandro" Language undetermined Remove constraint Language: undetermined
27 results on '"Lazaric, Alessandro"'

Search Results

1. Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

2. Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

3. Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

4. Contextual bandits with concave rewards, and an application to fair ranking

5. On the Complexity of Representation Learning in Contextual Linear Bandits

6. Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

7. Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times

8. Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

9. A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

10. Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

11. A general sample complexity analysis of vanilla policy gradient

12. Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

13. Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

14. Meta-learning with Stochastic Linear Bandits

15. A Provably Efficient Sample Collection Strategy for Reinforcement Learning

16. Learning Near Optimal Policies with Low Inherent Bellman Error

17. Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

18. Concentration Inequalities for Multinoulli Random Variables

19. Active Model Estimation in Markov Decision Processes

20. Improved Analysis of UCRL2 with Empirical Bernstein Inequality

21. A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning

22. Frequentist Regret Bounds for Randomized Least-Squares Value Iteration

23. Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret

24. Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

25. Thompson Sampling for Linear-Quadratic Control Problems

26. Reinforcement Learning of POMDPs using Spectral Methods

27. Analysis of Kelner and Levin graph sparsification algorithm for a streaming setting

Catalog

Books, media, physical & digital resources