Search

Your search keyword '"Zimmert, Julian"' showing total 29 results

Search Constraints

Start Over You searched for: Author "Zimmert, Julian" Remove constraint Author: "Zimmert, Julian"
29 results on '"Zimmert, Julian"'

Search Results

1. Incentive-compatible Bandits: Importance Weighting No More

2. Optimal cross-learning for contextual bandits with unknown context distributions

3. Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

4. Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits

5. A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays

6. A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

7. Best of Both Worlds Policy Optimization

8. Refined Regret for Adversarial MDPs with Linear Function Approximation

9. A Unified Algorithm for Stochastic Path Problems

10. A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

11. A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback

12. Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality

13. Pushing the Efficiency-Regret Pareto Frontier for Online Learning of Portfolios and Quantum States

14. The Pareto Frontier of model selection for general Contextual Bandits

15. A Model Selection Approach for Corruption Robust Reinforcement Learning

16. Efficient Methods for Online Multiclass Logistic Regression

17. Adapting to Misspecification in Contextual Bandits

18. Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

19. Model Selection in Contextual Stochastic Bandit Problems

20. Online Learning for Active Cache Synchronization

21. An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays

22. Connections Between Mirror Descent, Thompson Sampling and the Information Ratio

23. Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously

24. Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits

25. Factored Bandits

26. Distributed Optimization of Multi-Class SVMs

27. Connections Between Mirror Descent, Thompson Sampling and the Information Ratio

28. Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits.

Catalog

Books, media, physical & digital resources