Search

Your search keyword '"Mannor, Shie"' showing total 1,003 results

Search Constraints

Start Over You searched for: Author "Mannor, Shie" Remove constraint Author: "Mannor, Shie"
1,003 results on '"Mannor, Shie"'

Search Results

1. Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

2. Improved Sample Complexity for Global Convergence of Actor-Critic Algorithms

3. Dual Pricing to Prioritize Renewable Energy and Consumer Preferences in Electricity Markets

4. Efficient Fairness-Performance Pareto Front Computation

5. From Glucose Patterns to Health Outcomes: A Generalizable Foundation Model for Continuous Glucose Monitor Data Analysis

6. PlaMo: Plan and Move in Rich 3D Physical Environments

7. RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation

8. On Bits and Bandits: Quantifying the Regret-Information Trade-off

9. Tree Search-Based Policy Optimization under Stochastic Execution Delay

10. On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

11. Conservative DDPG -- Pessimistic RL without Ensemble

12. Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization

13. Improving Token-Based World Models with Parallel Observation Prediction

14. SQT -- std $Q$-target

15. MinMaxMin $Q$-learning

16. Prospective Side Information for Latent MDPs

17. Optimization or Architecture: How to Hack Kalman Filtering

18. Solving Non-Rectangular Reward-Robust MDPs via Frequency Regularization

19. Sobolev Space Regularised Pre Density Models

20. Individualized Dosing Dynamics via Neural Eigen Decomposition

21. Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel

22. Representation-Driven Reinforcement Learning

23. CALM: Conditional Adversarial Latent Models for Directable Virtual Characters

24. Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization

25. An Efficient Solution to s-Rectangular Robust Markov Decision Processes

26. Policy Gradient for Rectangular Robust Markov Decision Processes

27. SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

28. Train Hard, Fight Easy: Robust Meta Reinforcement Learning

29. Towards Deployable RL -- What's Broken with RL Research and a Potential Fix

30. DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

31. Reward-Mixing MDPs with a Few Latent Contexts are Learnable

32. Tractable Optimality in Episodic Latent MABs

33. Policy Gradient for Reinforcement Learning with General Utilities

34. SoftTreeMax: Policy Gradient with Tree Search

35. Actor-Critic based Improper Reinforcement Learning

36. Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

37. Analysis of Stochastic Processes through Replay Buffers

38. Reinforcement Learning with a Terminator

39. Efficient Policy Iteration for Robust Markov Decision Processes via Regularization

40. Efficient Risk-Averse Reinforcement Learning

41. Optimizing Tensor Network Contraction Using Reinforcement Learning

42. Learning Hidden Markov Models When the Locations of Missing Observations are Unknown

43. Learning to reason about and to act on physical cascading events

44. Continuous Forecasting via Neural Eigen Decomposition

45. The Geometry of Robust Value Functions

46. Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms

47. Planning and Learning with Adaptive Lookahead

48. On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

49. Twice regularized MDPs and the equivalence between robustness and regularization

50. Query-Reward Tradeoffs in Multi-Armed Bandits

Catalog

Books, media, physical & digital resources