Search

Your search keyword '"Restelli, Marcello"' showing total 359 results

Search Constraints

Start Over You searched for: Author "Restelli, Marcello" Remove constraint Author: "Restelli, Marcello"
359 results on '"Restelli, Marcello"'

Search Results

51. Policy Optimization as Online Learning with Mediator Feedback

52. Option Hedging with Risk Averse Reinforcement Learning

53. An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

54. Inverse Reinforcement Learning from a Gradient-based Learner

55. Newton Optimization on Helmholtz Decomposition for Continuous Games

56. Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate

57. Sequential Transfer in Reinforcement Learning with a Generative Model

58. Time-Variant Variational Transfer for Value Functions

59. A Novel Confidence-Based Algorithm for Structured Bandits

60. Online Joint Bid/Daily Budget Optimization of Internet Advertising Campaigns

61. Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning

62. MushroomRL: Simplifying Reinforcement Learning Research

63. Risk-Averse Trust Region Optimization for Reward-Volatility Reduction

67. Policy Space Identification in Configurable Environments

68. Gradient-Aware Model-based Policy Search

69. Feature Selection via Mutual Information: New Theoretical Insights

70. An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies

71. Smoothing Policies and Safe Policy Gradients

72. Coherent Transport of Quantum States by Deep Reinforcement Learning

75. Policy Optimization via Importance Sampling

76. Stochastic Variance-Reduced Policy Gradient

77. Configurable Markov Decision Processes

78. Importance Weighted Transfer of Samples in Reinforcement Learning

81. Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent

83. Conservative Online Convex Optimization

84. Exploiting History Data for Nonstationary Multi-armed Bandit

85. Unimodal Thompson Sampling for Graph-Structured Arms

88. A practical guide to multi-objective reinforcement learning and planning

89. Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material

92. Efficient evolutionary dynamics with extensive-form games

93. Transfer from Multiple MDPs

100. The EU-funded I3LUNG Project: Integrative Science, Intelligent Data Platform for Individualized LUNG Cancer Care With Immunotherapy

Catalog

Books, media, physical & digital resources