Author: "Shen, Lixing" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Shen, Lixing"' showing total 10 results

Start Over Author "Shen, Lixing" Search Limiters Full Text

10 results on '"Shen, Lixing"'

1. Secrets of RLHF in Large Language Models Part II: Reward Modeling

Author: Wang, Binghai, Zheng, Rui, Chen, Lu, Liu, Yan, Dou, Shihan, Huang, Caishuang, Shen, Wei, Jin, Senjie, Zhou, Enyu, Shi, Chenyu, Gao, Songyang, Xu, Nuo, Zhou, Yuhao, Fan, Xiaoran, Xi, Zhiheng, Zhao, Jun, Wang, Xiao, Ji, Tao, Yan, Hang, Shen, Lixing, Chen, Zhan, Gui, Tao, Zhang, Qi, Qiu, Xipeng, Huang, Xuanjing, Wu, Zuxuan, and Jiang, Yu-Gang
Subjects: Computer Science - Artificial Intelligence
Abstract: Reinforcement Learning from Human Feedback (RLHF) has become a crucial technology for aligning language models with human values and intentions, enabling models to produce more helpful and harmless responses. Reward models are trained as proxies for human preferences to drive reinforcement learning optimization. While reward models are often considered central to achieving high performance, they face the following challenges in practical applications: (1) Incorrect and ambiguous preference pairs in the dataset may hinder the reward model from accurately capturing human intent. (2) Reward models trained on data from a specific distribution often struggle to generalize to examples outside that distribution and are not suitable for iterative RLHF training. In this report, we attempt to address these two issues. (1) From a data perspective, we propose a method to measure the strength of preferences within the data, based on a voting mechanism of multiple reward models. Experimental results confirm that data with varying preference strengths have different impacts on reward model performance. We introduce a series of novel methods to mitigate the influence of incorrect and ambiguous preferences in the dataset and fully leverage high-quality preference data. (2) From an algorithmic standpoint, we introduce contrastive learning to enhance the ability of reward models to distinguish between chosen and rejected responses, thereby improving model generalization. Furthermore, we employ meta-learning to enable the reward model to maintain the ability to differentiate subtle differences in out-of-distribution samples, and this approach can be utilized for iterative RLHF optimization.
Published: 2024

2. Mn(I)-catalyzed sigmatropic rearrangement of β, γ-unsaturated alcohols

Author: Yang, Can, Zhou, Xiaoyu, Shen, Lixing, Ke, Zhuofeng, Jiang, Huanfeng, and Zeng, Wei
Published: 2023
Full Text: View/download PDF

3. Two Step Joint Model for Drug Drug Interaction Extraction

Author: Tang, Siliang, Zhang, Qi, Zheng, Tianpeng, Zhou, Mengdi, Chen, Zhan, Shen, Lixing, Ren, Xiang, Zhuang, Yueting, Pu, Shiliang, and Wu, Fei
Subjects: Computer Science - Computation and Language
Abstract: When patients need to take medicine, particularly taking more than one kind of drug simultaneously, they should be alarmed that there possibly exists drug-drug interaction. Interaction between drugs may have a negative impact on patients or even cause death. Generally, drugs that conflict with a specific drug (or label drug) are usually described in its drug label or package insert. Since more and more new drug products come into the market, it is difficult to collect such information by manual. We take part in the Drug-Drug Interaction (DDI) Extraction from Drug Labels challenge of Text Analysis Conference (TAC) 2018, choosing task1 and task2 to automatically extract DDI related mentions and DDI relations respectively. Instead of regarding task1 as named entity recognition (NER) task and regarding task2 as relation extraction (RE) task then solving it in a pipeline, we propose a two step joint model to detect DDI and it's related mentions jointly. A sequence tagging system (CNN-GRU encoder-decoder) finds precipitants first and search its fine-grained Trigger and determine the DDI for each precipitant in the second step. Moreover, a rule based model is built to determine the sub-type for pharmacokinetic interation. Our system achieved best result in both task1 and task2. F-measure reaches 0.46 in task1 and 0.40 in task2., Comment: 8 pages, 6 figures
Published: 2020

4. Influence of aerosol properties and surface albedo on radiative forcing efficiency of key aerosol types using global AERONET data

Author: Chen, Annan, Zhao, Chuanfeng, Shen, Lixing, and Fan, Tianyi
Published: 2023
Full Text: View/download PDF

5. A new perspective on surface wind speed variation with respect to the contribution of sea-land breezes

Author: Shen, Lixing, Zhao, Chuanfeng, and Yang, Xingchuan
Published: 2022
Full Text: View/download PDF

6. Rhodium(III)-Catalyzed Oxidative 1,3-Aryl Migration of α‑Aryl Allylic Alcohols.

Author: Yang, Can, Tao, Jiale, Xuan, Yanshuo, Shen, Lixing, Jiang, Huanfeng, and Zeng, Wei
Published: 2024
Full Text: View/download PDF

7. Observed slump of sea land breeze in Brisbane under the effect of aerosols from remote transport during 2019 Australian mega fire events

Author: Shen, Lixing, primary, Zhao, Chuanfeng, additional, Yang, Xingchuan, additional, Yang, Yikun, additional, and Zhou, Ping, additional
Published: 2022
Full Text: View/download PDF

8. Health risk and disease burden attributable to long-term global fine-mode particles

Author: Yang, Xingchuan, primary, Wang, Yuan, additional, Zhao, Chuanfeng, additional, Fan, Hao, additional, Yang, Yikun, additional, Chi, Yulei, additional, Shen, Lixing, additional, and Yan, Xing, additional
Published: 2022
Full Text: View/download PDF

9. Climate‐Driven Characteristics of Sea‐Land Breezes Over the Globe

Author: Shen, Lixing, primary, Zhao, Chuanfeng, additional, and Yang, Xingchuan, additional
Published: 2021
Full Text: View/download PDF

10. Observed decrease of summer sea-land breeze in Shanghai from 1994 to 2014 and its association with urbanization.

Author: Shen, Lixing, Zhao, Chuanfeng, Ma, Zhanshan, Li, Zhanqing, Li, Jianping, and Wang, Kaicun
Abstract: Sea-land breeze (SLB) is widely known as a common wind circulation in coastal cities, which plays an important role to heat transport and pollution diffusion. Using 21-year observation data, a continuous decrease of SLB is discovered in a Chinese metropolis city - Shanghai. In general, the thermodynamic difference between sea and land accounts for the magnitude of SLB. However, other factors associated with urbanization could also affect the SLB significantly, such as anthropogenic aerosol loading, surface roughness, heat release, and so on. This study statistically analyzes the influence of these factors on SLB under the background of global warming. As expected, the decrease of SLB is a combined effect of global warming and great urbanization in Shanghai. The latter, which affects the circulation in both thermodynamic and dynamic ways, is consisted of three parts: temperature increase with urbanization, change of atmospheric radiative forcing, and variation of the underlying surface. At night, the urbanization effect and global warming effect play the similar role by decreasing the temperature gap between sea and land, which would cause the decrease of land wind. At day, decreasing downwelling solar radiation partially offsets the temperature increase caused by both global warming and urbanization although the increasing trend of temperature remains. Different from the expected increasing trend of sea breeze based on the increasing temperature in Shanghai, sea breeze demonstrates a decreasing trend, which could be caused by the dynamic effect from increased surface roughness in Shanghai along with the cooling contribution from radiation-influential factors such as anthropogenic aerosols. • Both the SLB days and magnitude show decreasing trends over Shanghai from 1994 to 2014. • The effects of thermodynamic and dynamic influential factors to SLB circulation are analyzed. • Urbanization likely contributes more to the decreasing trends of land winds than marine winds. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Shen, Lixing"'

1. Secrets of RLHF in Large Language Models Part II: Reward Modeling

2. Mn(I)-catalyzed sigmatropic rearrangement of β, γ-unsaturated alcohols

3. Two Step Joint Model for Drug Drug Interaction Extraction

4. Influence of aerosol properties and surface albedo on radiative forcing efficiency of key aerosol types using global AERONET data

5. A new perspective on surface wind speed variation with respect to the contribution of sea-land breezes

6. Rhodium(III)-Catalyzed Oxidative 1,3-Aryl Migration of α‑Aryl Allylic Alcohols.

7. Observed slump of sea land breeze in Brisbane under the effect of aerosols from remote transport during 2019 Australian mega fire events

8. Health risk and disease burden attributable to long-term global fine-mode particles

9. Climate‐Driven Characteristics of Sea‐Land Breezes Over the Globe

10. Observed decrease of summer sea-land breeze in Shanghai from 1994 to 2014 and its association with urbanization.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

10 results on '"Shen, Lixing"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources