Search

Your search keyword '"Restelli, Marcello"' showing total 359 results

Search Constraints

Start Over You searched for: Author "Restelli, Marcello" Remove constraint Author: "Restelli, Marcello"
359 results on '"Restelli, Marcello"'

Search Results

1. A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning

2. The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

3. Interpetable Target-Feature Aggregation for Multi-Task Learning based on Bias-Variance Analysis

4. Optimal Multi-Fidelity Best-Arm Identification

5. How to Explore with Belief: State Entropy Maximization in POMDPs

6. Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs

7. Policy Gradient with Active Importance Sampling

8. Information Capacity Regret Bounds for Bandits with Mediator Feedback

9. Inverse Reinforcement Learning with Sub-optimal Experts

12. Parameterized Projected Bellman Operator

13. Causal Feature Selection via Transfer Entropy

14. Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

15. Pure Exploration under Mediators' Feedback

16. Nonlinear Feature Aggregation: Two Algorithms driven by Theory

17. Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes

18. An Option-Dependent Analysis of Regret Minimization Algorithms in Finite-Horizon Semi-Markov Decision Processes

19. Truncating Trajectories in Monte Carlo Reinforcement Learning

20. Towards Theoretical Understanding of Inverse Reinforcement Learning

21. A Tale of Sampling and Estimation in Discounted Reinforcement Learning

22. Interpretable Linear Dimensionality Reduction based on Bias-Variance Analysis

23. Information-Theoretic Regret Bounds for Bandits with Fixed Expert Advice

24. Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control

25. Best Arm Identification for Stochastic Rising Bandits

26. Autoregressive Bandits

27. Tight Performance Guarantees of Imitator Policies with Continuous Actions

28. Stochastic Rising Bandits

29. Simultaneously Updating All Persistence Values in Reinforcement Learning

30. Dynamic Pricing with Volume Discounts in Online Settings

31. Dynamical Linear Bandits

32. Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

33. Storehouse: a Reinforcement Learning Environment for Optimizing Warehouse Management

34. Analysis, Characterization, Prediction and Attribution of Extreme Atmospheric Events with Machine Learning: a Review

35. Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts

36. ARLO: A Framework for Automated Reinforcement Learning

37. Delayed Reinforcement Learning by Imitation

38. Reward-Free Policy Space Compression for Reinforcement Learning

39. Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization

40. The Importance of Non-Markovianity in Maximum State Entropy Exploration

41. Challenging Common Assumptions in Convex Reinforcement Learning

42. Unsupervised Reinforcement Learning in Multiple Environments

43. Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization

44. Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

45. Quantum Compiling by Deep Reinforcement Learning

46. Meta-Reinforcement Learning by Tracking Task Non-stationarity

47. Leveraging Good Representations in Linear Contextual Bandits

48. Towards an AI-Based Framework for Autonomous Design and Construction: Learning from Reinforcement Learning Success in RTS Games

50. A Practical Guide to Multi-Objective Reinforcement Learning and Planning

Catalog

Books, media, physical & digital resources