Search

Your search keyword '"Xiao, Chenjun"' showing total 33 results

Search Constraints

Start Over You searched for: Author "Xiao, Chenjun" Remove constraint Author: "Xiao, Chenjun"
33 results on '"Xiao, Chenjun"'

Search Results

1. Kimi k1.5: Scaling Reinforcement Learning with LLMs

2. $\beta$-DQN: Improving Deep Q-Learning By Evolving the Behavior

3. Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

4. Diffusion Spectral Representation for Reinforcement Learning

5. Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation

6. An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models

7. Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning

8. Rethinking Decision Transformer via Hierarchical Reinforcement Learning

9. HarmonyDream: Task Harmonization Inside World Models

10. Iteratively Refined Behavior Regularization for Offline Reinforcement Learning

11. Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

12. The In-Sample Softmax for Offline Reinforcement Learning

13. Latent Variable Representation for Reinforcement Learning

14. Understanding the Effect of Stochasticity in Policy Optimization

15. The Curse of Passive Data Collection in Batch Reinforcement Learning

16. On the Optimality of Batch Policy Optimization Algorithms

17. On the Global Convergence Rates of Softmax Policy Gradient Methods

18. Learning to Combat Compounding-Error in Model-Based Reinforcement Learning

19. Integrating Factorization Ranked Features in MCTS: An Experimental Study

20. Efficient Reinforcement Learning from Partial Observability

21. In-Sample Policy Iteration for Offline Reinforcement Learning

22. Advances in Simulation-Based Search and Batch Reinforcement Learning

28. Hash table in Chinese Chess

32. Hash table in Chinese Chess.

Catalog

Books, media, physical & digital resources