Search

Your search keyword '"Wang, Zhaoran"' showing total 105 results

Search Constraints

Start Over You searched for: Author "Wang, Zhaoran" Remove constraint Author: "Wang, Zhaoran" Topic fos: computer and information sciences Remove constraint Topic: fos: computer and information sciences
105 results on '"Wang, Zhaoran"'

Search Results

1. Contextual Dynamic Pricing with Strategic Buyers

2. A General Framework for Sequential Decision-Making under Adaptivity Constraints

3. What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

4. One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration

5. Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

6. Dynamic Datasets and Market Environments for Financial Reinforcement Learning

7. Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

8. Differentiable Arbitrating in Zero-sum Markov Games

9. Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

10. A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

11. Achieving Hierarchy-Free Approximation for Bilevel Programs With Equilibrium Constraints

12. Wardrop Equilibrium Can Be Boundedly Rational: A New Behavioral Theory of Route Choice

13. An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

14. Policy learning 'without' overlap: Pessimism and generalized empirical Bernstein's inequality

15. Latent Variable Representation for Reinforcement Learning

16. GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond

17. A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

18. Differentiable Bilevel Programming for Stackelberg Congestion Games

19. Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

20. Federated Offline Reinforcement Learning

21. RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

22. Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency

23. Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

24. Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning

25. Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

26. Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions

27. Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach

28. Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

29. Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency

30. Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

31. Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes

32. Offline Policy Optimization in RL with Variance Regularizaton

33. Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

34. Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

35. Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

36. Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

37. Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

38. Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

39. Dynamic Bottleneck for Robust Self-Supervised Exploration

40. On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

41. Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs

42. Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation

43. Towards General Function Approximation in Zero-Sum Markov Games

44. A Unified Off-Policy Evaluation Approach for General Value Function

45. Gap-Dependent Bounds for Two-Player Markov Games

46. Verification in the Loop: Correct-by-Construction Control Learning with Reach-avoid Guarantees

47. Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

48. ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning

49. FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance

50. Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality

Catalog

Books, media, physical & digital resources