Search

Your search keyword '"Lazaric, Alessandro"' showing total 319 results

Search Constraints

Start Over You searched for: Author "Lazaric, Alessandro" Remove constraint Author: "Lazaric, Alessandro"
319 results on '"Lazaric, Alessandro"'

Search Results

1. System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

2. Simple Ingredients for Offline Reinforcement Learning

3. Reinforcement Learning with Options and State Representation

4. Layered State Discovery for Incremental Autonomous Exploration

5. Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

6. On the Complexity of Representation Learning in Contextual Linear Bandits

7. Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler

8. Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

9. Contextual bandits with concave rewards, and an application to fair ranking

10. Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

11. Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

12. Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL

13. Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

14. Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times

15. Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

16. Differentially Private Exploration in Reinforcement Learning with Linear Representation

17. Adaptive Multi-Goal Exploration

18. Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

19. Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

20. A general sample complexity analysis of vanilla policy gradient

21. Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

22. A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

23. A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning

24. Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

25. Leveraging Good Representations in Linear Contextual Bandits

26. Reinforcement Learning with Prototypical Representations

27. Improved Sample Complexity for Incremental Autonomous Exploration in MDPs

28. An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

29. Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

30. Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation

31. A Provably Efficient Sample Collection Strategy for Reinforcement Learning

32. Improved Analysis of UCRL2 with Empirical Bernstein Inequality

33. Sketched Newton-Raphson

34. A Novel Confidence-Based Algorithm for Structured Bandits

35. Meta-learning with Stochastic Linear Bandits

36. Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

37. Active Model Estimation in Markov Decision Processes

38. Learning Near Optimal Policies with Low Inherent Bellman Error

39. Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

40. Adversarial Attacks on Linear Contextual Bandits

41. Improved Algorithms for Conservative Exploration in Bandits

42. Conservative Exploration in Reinforcement Learning

43. Concentration Inequalities for Multinoulli Random Variables

44. No-Regret Exploration in Goal-Oriented Reinforcement Learning

45. Frequentist Regret Bounds for Randomized Least-Squares Value Iteration

46. A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning

47. Word-order biases in deep-agent emergent communication

48. Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret

49. Active Exploration in Markov Decision Processes

50. Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

Catalog

Books, media, physical & digital resources