Search

Your search keyword '"Lin, Runji"' showing total 18 results

Search Constraints

Start Over You searched for: Author "Lin, Runji" Remove constraint Author: "Lin, Runji"
18 results on '"Lin, Runji"'

Search Results

1. Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

2. Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence

3. Qwen2 Technical Report

4. LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

5. Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

6. Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

7. Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

8. Qwen Technical Report

9. #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

10. Large Sequence Models for Sequential Decision-Making: A Survey

11. Learning Robust Communication by Adversarial Training in Networked System Control

12. Contextual Transformer for Offline Meta Reinforcement Learning

13. Scalable Model-based Policy Optimization for Decentralized Networked Systems

14. Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

17. Increasing the Data Rate for Reflected Optical Camera Communication Using Uniform LED Light

Catalog

Books, media, physical & digital resources