Search

Your search keyword '"Ghavamzadeh, Mohammad"' showing total 35 results

Search Constraints

Start Over You searched for: Author "Ghavamzadeh, Mohammad" Remove constraint Author: "Ghavamzadeh, Mohammad" Topic machine learning (stat.ml) Remove constraint Topic: machine learning (stat.ml)
35 results on '"Ghavamzadeh, Mohammad"'

Search Results

1. A Convex Relaxation Approach to Bayesian Regret Minimization in Offline Bandits

2. Robust Reinforcement Learning using Offline Data

3. Meta-Learning for Simple Regret Minimization

4. Deep Hierarchy in Bandits

5. Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

6. Operator Splitting Value Iteration

7. Soft-Robust Algorithms for Batch Reinforcement Learning

8. Control-Aware Representations for Model-based Reinforcement Learning

9. Stochastic Bandits with Linear Constraints

10. Finite-Sample Analysis of Proximal Gradient TD Algorithms

11. Neural Lyapunov Redesign

12. Mirror Descent Policy Optimization

13. Active Model Estimation in Markov Decision Processes

14. Predictive Coding for Locally-Linear Control

15. Online Planning with Lookahead Policies

16. Lyapunov-based Safe Policy Optimization for Continuous Control

17. Randomized Exploration in Generalized Linear Bandits

18. Benchmarking Batch Deep Reinforcement Learning Algorithms

19. Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

20. Risk-Sensitive Generative Adversarial Imitation Learning

21. A Lyapunov-based Approach to Safe Reinforcement Learning

22. A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

23. Path Consistency Learning in Tsallis Entropy Regularized MDPs

24. Optimizing over a Restricted Policy Class in Markov Decision Processes

25. Active Learning for Accurate Estimation of Linear Models

26. Online Learning to Rank in Stochastic Click Models

27. Bottleneck Conditional Density Estimation

28. Graphical Model Sketch

29. Conservative Contextual Linear Bandits

30. Bayesian Reinforcement Learning: A Survey

31. Safe Policy Improvement by Minimizing Robust Baseline Regret

32. Classification-based Approximate Policy Iteration: Experiments and Extended Discussions

33. Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs

34. A Generalized Kernel Approach to Structured Output Learning

35. A Dantzig Selector Approach to Temporal Difference Learning

Catalog

Books, media, physical & digital resources