Search

Your search keyword '"Dai, Juntao"' showing total 21 results

Search Constraints

Start Over You searched for: Author "Dai, Juntao" Remove constraint Author: "Dai, Juntao"
21 results on '"Dai, Juntao"'

Search Results

1. Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

2. Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback

3. Aligner: Efficient Alignment by Learning to Correct

4. AI Alignment: A Comprehensive Survey

5. Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

6. Baichuan 2: Open Large-scale Language Models

7. BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

8. OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

9. Constrained Update Projection Approach to Safe Policy Optimization

10. CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

19. Laparoscopic cholecystectomy for acute cholecystitis: clinical analysis of 216 cases

Catalog

Books, media, physical & digital resources