Author: "Dai, Juntao" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Dai, Juntao"' showing total 21 results

Start Over Author "Dai, Juntao"

21 results on '"Dai, Juntao"'

1. Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

Author: Dai, Juntao, Yang, Yaodong, Zheng, Qian, and Pan, Gang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: A key aspect of Safe Reinforcement Learning (Safe RL) involves estimating the constraint condition for the next policy, which is crucial for guiding the optimization of safe policy updates. However, the existing Advantage-based Estimation (ABE) method relies on the infinite-horizon discounted advantage function. This dependence leads to catastrophic errors in finite-horizon scenarios with non-discounted constraints, resulting in safety-violation updates. In response, we propose the first estimation method for finite-horizon non-discounted constraints in deep Safe RL, termed Gradient-based Estimation (GBE), which relies on the analytic gradient derived along trajectories. Our theoretical and empirical analyses demonstrate that GBE can effectively estimate constraint changes over a finite horizon. Constructing a surrogate optimization problem with GBE, we developed a novel Safe RL algorithm called Constrained Gradient-based Policy Optimization (CGPO). CGPO identifies feasible optimal policies by iteratively resolving sub-problems within trust regions. Our empirical results reveal that CGPO, unlike baseline algorithms, successfully estimates the constraint functions of subsequent policies, thereby ensuring the efficiency and feasibility of each update.
Published: 2024

2. Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback

Author: Zhou, Jiayi, Ji, Jiaming, Dai, Juntao, and Yang, Yaodong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Aligning the behavior of Large language models (LLMs) with human intentions and values remains a critical challenge. Reinforcement learning from human feedback (RLHF) aligns LLMs by training a reward model (RM) on human preferences and fine-tuning the LLMs to maximize RM feedback. Despite its effectiveness and popularity, RLHF is prone to biased local optimization. It means RM fails to provide feedback that accurately aligns with human preference, causing LLMs to explore unexpected generalizations, and failing to achieve alignment objectives. To mitigate this issue, we propose a novel \textit{sequence-to-sequence (seq2seq) reward modeling} method. Its key insight is that learning from language feedback rather than scalar feedback improves RLHF without additional annotations. We replaced the reward modeling target from binary maximum likelihood estimation (MLE) with sequence MLE. This method enables richer and fine-grained language feedback without additional annotations, models, or training stages. Our experiments demonstrated its effectiveness, specifically, reducing the refusal-to-response paradigm in single-turn safety dialogues and the long-response bias in text summarization tasks. We provide further analysis that seq2seq RM improves RLHF performance across 2B and 7B LLMs on 3 NLP tasks, achieving an average win rate of 76.9\%. We further show that seq2seq RM can still improve the performance of RLHF under out-of-distribution prompts., Comment: 7 pages
Published: 2024

3. Aligner: Efficient Alignment by Learning to Correct

Author: Ji, Jiaming, Chen, Boyuan, Lou, Hantao, Hong, Donghai, Zhang, Borong, Pan, Xuehai, Dai, Juntao, Qiu, Tianyi, and Yang, Yaodong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: With the rapid development of large language models (LLMs) and ever-evolving practical requirements, finding an efficient and effective alignment method has never been more critical. However, the tension between the complexity of current alignment methods and the need for rapid iteration in deployment scenarios necessitates the development of a model-agnostic alignment approach that can operate under these constraints. In this paper, we introduce Aligner, a novel and simple alignment paradigm that learns the correctional residuals between preferred and dispreferred answers using a small model. Designed as a model-agnostic, plug-and-play module, Aligner can be directly applied to various open-source and API-based models with only one-off training, making it suitable for rapid iteration. Notably, Aligner can be applied to any powerful, large-scale upstream models. Moreover, it can even iteratively bootstrap the upstream models using corrected responses as synthetic human preference data, breaking through the model's performance ceiling. Our experiments demonstrate performance improvements by deploying the same Aligner model across 11 different LLMs, evaluated on the 3H dimensions (helpfulness, harmlessness, and honesty). Specifically, Aligner-7B has achieved an average improvement of 68.9% in helpfulness and 23.8% in harmlessness across the tested LLMs while also effectively reducing hallucination. In the Alpaca-Eval leaderboard, stacking Aligner-2B on GPT-4 Turbo improved its LC Win Rate from 55.0% to 58.3%, surpassing GPT-4 Omni's 57.5% Win Rate (community report)., Comment: Accepted by NeurIPS 2024 Oral Presentation
Published: 2024

4. AI Alignment: A Comprehensive Survey

Author: Ji, Jiaming, Qiu, Tianyi, Chen, Boyuan, Zhang, Borong, Lou, Hantao, Wang, Kaile, Duan, Yawen, He, Zhonghao, Zhou, Jiayi, Zhang, Zhaowei, Zeng, Fanzhi, Ng, Kwan Yee, Dai, Juntao, Pan, Xuehai, O'Gara, Aidan, Lei, Yingshan, Xu, Hua, Tse, Brian, Fu, Jie, McAleer, Stephen, Yang, Yaodong, Wang, Yizhou, Zhu, Song-Chun, Guo, Yike, and Gao, Wen
Subjects: Computer Science - Artificial Intelligence
Abstract: AI alignment aims to make AI systems behave in line with human intentions and values. As AI systems grow more capable, so do risks from misalignment. To provide a comprehensive and up-to-date overview of the alignment field, in this survey, we delve into the core concepts, methodology, and practice of alignment. First, we identify four principles as the key objectives of AI alignment: Robustness, Interpretability, Controllability, and Ethicality (RICE). Guided by these four principles, we outline the landscape of current alignment research and decompose them into two key components: forward alignment and backward alignment. The former aims to make AI systems aligned via alignment training, while the latter aims to gain evidence about the systems' alignment and govern them appropriately to avoid exacerbating misalignment risks. On forward alignment, we discuss techniques for learning from feedback and learning under distribution shift. On backward alignment, we discuss assurance techniques and governance practices. We also release and continually update the website (www.alignmentsurvey.com) which features tutorials, collections of papers, blog posts, and other resources., Comment: Continually updated, including weak-to-strong generalization and socio-technical thinking. 58 pages (excluding bibliography), 801 references
Published: 2023

5. Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

Author: Ji, Jiaming, Zhang, Borong, Zhou, Jiayi, Pan, Xuehai, Huang, Weidong, Sun, Ruiyang, Geng, Yiran, Zhong, Yifan, Dai, Juntao, and Yang, Yaodong
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Artificial intelligence (AI) systems possess significant potential to drive societal progress. However, their deployment often faces obstacles due to substantial safety concerns. Safe reinforcement learning (SafeRL) emerges as a solution to optimize policies while simultaneously adhering to multiple constraints, thereby addressing the challenge of integrating reinforcement learning in safety-critical scenarios. In this paper, we present an environment suite called Safety-Gymnasium, which encompasses safety-critical tasks in both single and multi-agent scenarios, accepting vector and vision-only input. Additionally, we offer a library of algorithms named Safe Policy Optimization (SafePO), comprising 16 state-of-the-art SafeRL algorithms. This comprehensive library can serve as a validation tool for the research community. By introducing this benchmark, we aim to facilitate the evaluation and comparison of safety performance, thus fostering the development of reinforcement learning for safer, more reliable, and responsible real-world applications. The website of this project can be accessed at https://sites.google.com/view/safety-gymnasium., Comment: Published at NeurIPS 2023
Published: 2023

6. Baichuan 2: Open Large-scale Language Models

Author: Yang, Aiyuan, Xiao, Bin, Wang, Bingning, Zhang, Borong, Bian, Ce, Yin, Chao, Lv, Chenxu, Pan, Da, Wang, Dian, Yan, Dong, Yang, Fan, Deng, Fei, Wang, Feng, Liu, Feng, Ai, Guangwei, Dong, Guosheng, Zhao, Haizhou, Xu, Hang, Sun, Haoze, Zhang, Hongda, Liu, Hui, Ji, Jiaming, Xie, Jian, Dai, JunTao, Fang, Kun, Su, Lei, Song, Liang, Liu, Lifeng, Ru, Liyun, Ma, Luyao, Wang, Mang, Liu, Mickel, Lin, MingAn, Nie, Nuolan, Guo, Peidong, Sun, Ruiyang, Zhang, Tao, Li, Tianpeng, Li, Tianyu, Cheng, Wei, Chen, Weipeng, Zeng, Xiangrong, Wang, Xiaochuan, Chen, Xiaoxi, Men, Xin, Yu, Xin, Pan, Xuehai, Shen, Yanjun, Wang, Yiding, Li, Yiyu, Jiang, Youxin, Gao, Yuchen, Zhang, Yupeng, Zhou, Zenan, and Wu, Zhiying
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have demonstrated remarkable performance on a variety of natural language tasks based on just a few examples of natural language instructions, reducing the need for extensive feature engineering. However, most powerful LLMs are closed-source or limited in their capability for languages other than English. In this technical report, we present Baichuan 2, a series of large-scale multilingual language models containing 7 billion and 13 billion parameters, trained from scratch, on 2.6 trillion tokens. Baichuan 2 matches or outperforms other open-source models of similar size on public benchmarks like MMLU, CMMLU, GSM8K, and HumanEval. Furthermore, Baichuan 2 excels in vertical domains such as medicine and law. We will release all pre-training model checkpoints to benefit the research community in better understanding the training dynamics of Baichuan 2., Comment: Baichuan 2 technical report. Github: https://github.com/baichuan-inc/Baichuan2
Published: 2023

7. BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

Author: Ji, Jiaming, Liu, Mickel, Dai, Juntao, Pan, Xuehai, Zhang, Chi, Bian, Ce, Sun, Ruiyang, Wang, Yizhou, and Yang, Yaodong
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we introduce the BeaverTails dataset, aimed at fostering research on safety alignment in large language models (LLMs). This dataset uniquely separates annotations of helpfulness and harmlessness for question-answering pairs, thus offering distinct perspectives on these crucial attributes. In total, we have gathered safety meta-labels for 333,963 question-answer (QA) pairs and 361,903 pairs of expert comparison data for both the helpfulness and harmlessness metrics. We further showcase applications of BeaverTails in content moderation and reinforcement learning with human feedback (RLHF), emphasizing its potential for practical safety measures in LLMs. We believe this dataset provides vital resources for the community, contributing towards the safe development and deployment of LLMs. Our project page is available at the following URL: https://sites.google.com/view/pku-beavertails., Comment: Published at NeurIPS 2023
Published: 2023

8. OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

Author: Ji, Jiaming, Zhou, Jiayi, Zhang, Borong, Dai, Juntao, Pan, Xuehai, Sun, Ruiyang, Huang, Weidong, Geng, Yiran, Liu, Mickel, and Yang, Yaodong
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: AI systems empowered by reinforcement learning (RL) algorithms harbor the immense potential to catalyze societal advancement, yet their deployment is often impeded by significant safety concerns. Particularly in safety-critical applications, researchers have raised concerns about unintended harms or unsafe behaviors of unaligned RL agents. The philosophy of safe reinforcement learning (SafeRL) is to align RL agents with harmless intentions and safe behavioral patterns. In SafeRL, agents learn to develop optimal policies by receiving feedback from the environment, while also fulfilling the requirement of minimizing the risk of unintended harm or unsafe behavior. However, due to the intricate nature of SafeRL algorithm implementation, combining methodologies across various domains presents a formidable challenge. This had led to an absence of a cohesive and efficacious learning framework within the contemporary SafeRL research milieu. In this work, we introduce a foundational framework designed to expedite SafeRL research endeavors. Our comprehensive framework encompasses an array of algorithms spanning different RL domains and places heavy emphasis on safety elements. Our efforts are to make the SafeRL-related research process more streamlined and efficient, therefore facilitating further research in AI safety. Our project is released at: https://github.com/PKU-Alignment/omnisafe.
Published: 2023

9. Constrained Update Projection Approach to Safe Policy Optimization

Author: Yang, Long, Ji, Jiaming, Dai, Juntao, Zhang, Linrui, Zhou, Binbin, Li, Pengfei, Yang, Yaodong, and Pan, Gang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Safe reinforcement learning (RL) studies problems where an intelligent agent has to not only maximize reward but also avoid exploring unsafe areas. In this study, we propose CUP, a novel policy optimization method based on Constrained Update Projection framework that enjoys rigorous safety guarantee. Central to our CUP development is the newly proposed surrogate functions along with the performance bound. Compared to previous safe RL methods, CUP enjoys the benefits of 1) CUP generalizes the surrogate functions to generalized advantage estimator (GAE), leading to strong empirical performance. 2) CUP unifies performance bounds, providing a better understanding and interpretability for some existing algorithms; 3) CUP provides a non-convex implementation via only first-order optimizers, which does not require any strong approximation on the convexity of the objectives. To validate our CUP method, we compared CUP against a comprehensive list of safe RL baselines on a wide range of tasks. Experiments show the effectiveness of CUP both in terms of reward and safety constraint satisfaction. We have opened the source code of CUP at this link https://github.com/zmsn-2077/ CUP-safe-rl., Comment: Accepted by NeurIPS2022. arXiv admin note: substantial text overlap with arXiv:2202.07565
Published: 2022

10. CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

Author: Yang, Long, Ji, Jiaming, Dai, Juntao, Zhang, Yu, Li, Pengfei, and Pan, Gang
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Safe reinforcement learning (RL) is still very challenging since it requires the agent to consider both return maximization and safe exploration. In this paper, we propose CUP, a Conservative Update Policy algorithm with a theoretical safety guarantee. We derive the CUP based on the new proposed performance bounds and surrogate functions. Although using bounds as surrogate functions to design safe RL algorithms have appeared in some existing works, we develop them at least three aspects: (i) We provide a rigorous theoretical analysis to extend the surrogate functions to generalized advantage estimator (GAE). GAE significantly reduces variance empirically while maintaining a tolerable level of bias, which is an efficient step for us to design CUP; (ii) The proposed bounds are tighter than existing works, i.e., using the proposed bounds as surrogate functions are better local approximations to the objective and safety constraints. (iii) The proposed CUP provides a non-convex implementation via first-order optimizers, which does not depend on any convex approximation. Finally, extensive experiments show the effectiveness of CUP where the agent satisfies safe constraints. We have opened the source code of CUP at https://github.com/RL-boxes/Safe-RL.
Published: 2022

11. Flow pattern transition and void fraction prediction of gas–liquid flow in helically coiled tubes

Author: Liu, Li, Zhang, Jiarong, Hu, Bing, Dai, Juntao, Wang, Ke, and Gu, Hanyang
Published: 2022
Full Text: View/download PDF

12. Smart manufacturing of nonferrous metallurgical processes: Review and perspectives

Author: Sun, Bei, Dai, Juntao, Huang, Keke, Yang, Chunhua, and Gui, Weihua
Published: 2022
Full Text: View/download PDF

13. Reducing postoperative complications and improving clinical outcome: Enhanced recovery after surgery in pancreaticoduodenectomy – A retrospective cohort study

Author: Dai, Juntao, Jiang, Yongjian, and Fu, Deliang
Published: 2017
Full Text: View/download PDF

14. Magnetic mesoporous nanospheres anchored with LyP-1 as an efficient pancreatic cancer probe

Author: Jiang, Yongjian, Liu, Shaojun, Zhang, Yu, Li, Hengchao, He, Hang, Dai, Juntao, Jiang, Tao, Ji, Weihang, Geng, Daoying, Elzatahry, Ahmed A., Alghamdi, Abdulaziz, Fu, Deliang, Deng, Yonghui, and Zhao, Dongyuan
Published: 2017
Full Text: View/download PDF

15. Characterization of Interfacial Wave in Annular Flow of Newtonian and Non-Newtonian Fluids Using Image Processing Technology

Author: Wang, Ke, primary, Dai, Juntao, additional, Liu, Li, additional, Lin, Ruinan, additional, and Shi, Fangjun, additional
Published: 2023
Full Text: View/download PDF

16. Improving the wet adhesive bonding of bamboo urea‐formaldehyde adhesive using styrene acrylate by controlling monomer ratios

Author: Wu, Sai, primary, Liang, Lulu, additional, Chen, Furong, additional, Yang, Zheng, additional, Zheng, Yu, additional, Wu, Yitian, additional, Li, Lanze, additional, Lou, Gaobo, additional, Dai, Juntao, additional, Pang, Yajun, additional, Chen, Hao, additional, Fang, Qun, additional, and Shen, Zhehong, additional
Published: 2022
Full Text: View/download PDF

17. Erasing and dehumanizing Natives to protect positive national identity: The Native mascot example

Author: Dai, Juntao Doris, primary, Lopez, Julisa J., additional, Brady, Laura M., additional, Eason, Arianne E., additional, and Fryberg, Stephanie A., additional
Published: 2021
Full Text: View/download PDF

18. Capture and recognition of interfacial waves in annular flow based on image analysis technology

Author: Liu, Li, primary, Wang, Ke, additional, Lin, Ruinan, additional, and Dai, Juntao, additional
Published: 2021
Full Text: View/download PDF

19. Laparoscopic cholecystectomy for acute cholecystitis: clinical analysis of 216 cases

Author: DAI Juntao
Subjects: acute, lcsh:Diseases of the digestive system. Gastroenterology, cholecystectomy, laparoscopic, cholecystitis, lcsh:RC799-869
Abstract: ObjectiveTo investigate the clinical experience of laparoscopic cholecystectomy (LC) for acute cholecystitis. MethodsA retrospective analysis was performed on the clinical records of 216 patients with acute cholecystitis who underwent LC in Qingpu Branch of Zhongshan Hospital, Fudan University from January 2010 to January 2013. LC was performed under intubation general anaesthesia, with three holes conventionally and four holes if necessary. After operation, the drainage tube was placed for 1-3 d, and antibiotics were administered for 3-5 d. The time of operation, length of postoperative hospital stay, and incidence of postoperative complications were determined. All patients were followed up for at least 0.5 year after operation. ResultsLC was successfully performed in 188 (87.0%) of all patients; 28 (13.0%) of all patients were converted to open surgery. The mean time of operation was 62.00±11.27 min; the mean length of hospital stay was 4.60±2.16 d; the incidence of postoperative complications was 2.3%(5/216). All patients were cured and discharged. During follow-up, no patients developed other complications and all recovered well. ConclusionLC is safe and feasible in the treatment of acute cholecystitis. Correct manipulation of the Calot's triangle and proper abdominal drainage are the key to successful operation.
Published: 2014

20. Research of the Penetration Process for Concrete Target Based on the Finite Element Method

Author: Wang, Feng, primary, Dai, Juntao, primary, Gao, Binsen, primary, and Ma, Yihong, primary
Published: 2018
Full Text: View/download PDF

21. Sonic Hedgehog Expression Correlates With Distant Metastasis in Pancreatic Adenocarcinoma

Author: Dai, Juntao, primary, Ai, Kaixing, additional, Du, Yilong, additional, and Chen, Guorong, additional
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

21 results on '"Dai, Juntao"'

1. Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation

2. Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback

3. Aligner: Efficient Alignment by Learning to Correct

4. AI Alignment: A Comprehensive Survey

5. Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark

6. Baichuan 2: Open Large-scale Language Models

7. BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset

8. OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research

9. Constrained Update Projection Approach to Safe Policy Optimization

10. CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

11. Flow pattern transition and void fraction prediction of gas–liquid flow in helically coiled tubes

12. Smart manufacturing of nonferrous metallurgical processes: Review and perspectives

13. Reducing postoperative complications and improving clinical outcome: Enhanced recovery after surgery in pancreaticoduodenectomy – A retrospective cohort study

14. Magnetic mesoporous nanospheres anchored with LyP-1 as an efficient pancreatic cancer probe

15. Characterization of Interfacial Wave in Annular Flow of Newtonian and Non-Newtonian Fluids Using Image Processing Technology

16. Improving the wet adhesive bonding of bamboo urea‐formaldehyde adhesive using styrene acrylate by controlling monomer ratios

17. Erasing and dehumanizing Natives to protect positive national identity: The Native mascot example

18. Capture and recognition of interfacial waves in annular flow based on image analysis technology

19. Laparoscopic cholecystectomy for acute cholecystitis: clinical analysis of 216 cases

20. Research of the Penetration Process for Concrete Target Based on the Finite Element Method

21. Sonic Hedgehog Expression Correlates With Distant Metastasis in Pancreatic Adenocarcinoma

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

21 results on '"Dai, Juntao"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources