Search

Your search keyword '"Restelli, Marcello"' showing total 446 results

Search Constraints

Start Over You searched for: Author "Restelli, Marcello" Remove constraint Author: "Restelli, Marcello"
446 results on '"Restelli, Marcello"'

Search Results

1. Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs

2. Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach

3. Exploiting Risk-Aversion and Size-dependent fees in FX Trading with Fitted Natural Actor-Critic

4. Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting

5. Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting

6. State and Action Factorization in Power Grids

7. A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning

8. The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

9. Interpetable Target-Feature Aggregation for Multi-Task Learning based on Bias-Variance Analysis

10. Optimal Multi-Fidelity Best-Arm Identification

11. How to Explore with Belief: State Entropy Maximization in POMDPs

12. Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs

13. Policy Gradient with Active Importance Sampling

15. Information Capacity Regret Bounds for Bandits with Mediator Feedback

16. Inverse Reinforcement Learning with Sub-optimal Experts

17. Parameterized Projected Bellman Operator

19. Causal Feature Selection via Transfer Entropy

20. Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

21. Pure Exploration under Mediators' Feedback

22. Nonlinear Feature Aggregation: Two Algorithms driven by Theory

23. Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes

24. An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes

25. Truncating Trajectories in Monte Carlo Reinforcement Learning

26. Towards Theoretical Understanding of Inverse Reinforcement Learning

27. A Tale of Sampling and Estimation in Discounted Reinforcement Learning

28. Interpretable Linear Dimensionality Reduction based on Bias-Variance Analysis

29. Information-Theoretic Regret Bounds for Bandits with Fixed Expert Advice

30. Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control

31. Best Arm Identification for Stochastic Rising Bandits

32. Autoregressive Bandits

33. Tight Performance Guarantees of Imitator Policies with Continuous Actions

34. Stochastic Rising Bandits

35. Building Surrogate Models Using Trajectories of Agents Trained by Reinforcement Learning

36. Simultaneously Updating All Persistence Values in Reinforcement Learning

37. Dynamic Pricing with Volume Discounts in Online Settings

38. Dynamical Linear Bandits

39. Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

40. Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management

41. Analysis, Characterization, Prediction and Attribution of Extreme Atmospheric Events with Machine Learning: a Review

42. Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts

43. ARLO: A Framework for Automated Reinforcement Learning

44. Delayed Reinforcement Learning by Imitation

45. Reward-Free Policy Space Compression for Reinforcement Learning

46. Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization

47. The Importance of Non-Markovianity in Maximum State Entropy Exploration

48. Challenging Common Assumptions in Convex Reinforcement Learning

49. Unsupervised Reinforcement Learning in Multiple Environments

50. Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization

Catalog

Books, media, physical & digital resources