Search

Your search keyword '"Stefano, V."' showing total 1,249 results

Search Constraints

Start Over You searched for: Author "Stefano, V." Remove constraint Author: "Stefano, V." Search Limiters Full Text Remove constraint Search Limiters: Full Text
1,249 results on '"Stefano, V."'

Search Results

1. Quantum groups of Borcherds-Cartan type and Khovanov-Lauda-Rouquier algebras

2. Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning

3. HyperMARL: Adaptive Hypernetworks for Multi-Agent RL

4. Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

5. Highway Graph to Accelerate Reinforcement Learning

6. Multi-Agent Reinforcement Learning for Energy Networks: Computational Challenges, Progress and Open Problems

7. LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots

8. Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras

9. People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior: Insights from Cognitive Science for Explainable AI

10. Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review

11. DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design

12. lpNTK: Better Generalisation with Less Data via Sample Interaction During Learning

13. Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning

14. Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning

15. How the level sampling process impacts zero-shot generalisation in deep reinforcement learning

16. Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

18. Conditional Mutual Information for Disentangled Representations in Reinforcement Learning

19. SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning

20. Using Offline Data to Speed Up Reinforcement Learning in Procedurally Generated Environments

21. Revisiting the Gumbel-Softmax in MADDPG

22. Causal Explanations for Sequential Decision-Making in Multi-Agent Systems

23. Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition

24. Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

25. Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

26. Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

27. DiPA: Probabilistic Multi-Modal Interactive Prediction for Autonomous Driving

28. A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning

29. Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

30. Deep Reinforcement Learning for Multi-Agent Interaction

31. Perspectives on the System-level Design of a Safe Autonomous Driving Stack

32. Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity

33. Few-Shot Teamwork

34. Cooperative Marine Operations via Ad Hoc Teams

35. Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning

36. Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

37. Verifiable Goal Recognition for Autonomous Driving with Occlusions

38. Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning

39. A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning

40. MIDGARD: A Simulation Platform for Autonomous Navigation in Unstructured Environments

41. Flash: Fast and Light Motion Prediction for Autonomous Driving with Bayesian Inverse Planning and Learned Motion Profiles

42. A Survey of Ad Hoc Teamwork Research

43. Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning

44. Learning Temporally-Consistent Representations for Data-Efficient Reinforcement Learning

45. Hydrogen reionisation ends by $z=5.3$: Lyman-$\alpha$ optical depth measured by the XQR-30 sample

46. Interpretable Goal Recognition in the Presence of Occluded Factors for Autonomous Vehicles

47. Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration

48. Expressivity of Emergent Language is a Trade-off between Contextual Complexity and Unpredictability

49. GRIT: Fast, Interpretable, and Verifiable Goal Recognition with Learned Decision Trees for Autonomous Driving

50. Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

Catalog

Books, media, physical & digital resources