Author: "Xu, Benfeng" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xu, Benfeng"' showing total 22 results

Start Over Author "Xu, Benfeng" Publication Year Range Last 10 years

22 results on '"Xu, Benfeng"'

1. Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

Author: Chen, Yihan, Xu, Benfeng, Wang, Quan, Liu, Yi, and Mao, Zhendong
Subjects: Computer Science - Computation and Language
Abstract: While large language models (LLMs) have exhibited impressive instruction-following capabilities, it is still unclear whether and to what extent they can respond to explicit constraints that might be entailed in various instructions. As a significant aspect of LLM alignment, it is thus important to formulate such a specialized set of instructions as well as investigate the resulting behavior of LLMs. To address this vacancy, we propose a new benchmark CoDI-Eval to systematically and comprehensively evaluate LLMs' responses to instructions with various constraints. We construct a large collection of constraints-attributed instructions as a test suite focused on both generalization and coverage. Specifically, we advocate an instruction diversification process to synthesize diverse forms of constraint expression and also deliberate the candidate task taxonomy with even finer-grained sub-categories. Finally, we automate the entire evaluation process to facilitate further developments. Different from existing studies on controllable text generation, CoDI-Eval extends the scope to the prevalent instruction-following paradigm for the first time. We provide extensive evaluations of representative LLMs (e.g., ChatGPT, Vicuna) on CoDI-Eval, revealing their limitations in following instructions with specific constraints and there is still a significant gap between open-source and commercial closed-source LLMs. We believe this benchmark will facilitate research into improving the controllability of LLMs' responses to instructions. Our data and code are available at https://github.com/Xt-cyh/CoDI-Eval., Comment: Accepted to AAAI 2024
Published: 2024

2. On the Calibration of Large Language Models and Alignment

Author: Zhu, Chiwei, Xu, Benfeng, Wang, Quan, Zhang, Yongdong, and Mao, Zhendong
Subjects: Computer Science - Computation and Language
Abstract: As large language models attract increasing attention and find widespread application, concurrent challenges of reliability also arise at the same time. Confidence calibration, an effective analysis method for gauging the reliability of deep models, serves as a crucial tool for assessing and improving their reliability. However, such investigation has been comparatively underexplored. In this work, we conduct a systematic examination of the calibration of aligned language models throughout the entire construction process, including pretraining and alignment training. At each stage, we investigate how different training settings, such as parameter scales and training data, affect model calibration. To thoroughly assess model calibration, we evaluate models on three most concerned aspects: generation, factuality and understanding. Our work sheds light on whether popular LLMs are well-calibrated and how the training process influences model calibration., Comment: to be published in findings of EMNLP-2023
Published: 2023

3. Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

Author: Wu, Shengguang, Lu, Keming, Xu, Benfeng, Lin, Junyang, Su, Qi, and Zhou, Chang
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Enhancing the instruction-following ability of Large Language Models (LLMs) primarily demands substantial instruction-tuning datasets. However, the sheer volume of these imposes a considerable computational burden and annotation cost. To investigate a label-efficient instruction tuning method that allows the model itself to actively sample subsets that are equally or even more effective, we introduce a self-evolving mechanism DiverseEvol. In this process, a model iteratively augments its training subset to refine its own performance, without requiring any intervention from humans or more advanced LLMs. The key to our data sampling technique lies in the enhancement of diversity in the chosen subsets, as the model selects new data points most distinct from any existing ones according to its current embedding space. Extensive experiments across three datasets and benchmarks demonstrate the effectiveness of DiverseEvol. Our models, trained on less than 8% of the original dataset, maintain or improve performance compared with finetuning on full data. We also provide empirical evidence to analyze the importance of diversity in instruction data and the iterative scheme as opposed to one-time sampling. Our code is publicly available at https://github.com/OFA-Sys/DiverseEvol.git.
Published: 2023

4. Qwen Technical Report

Author: Bai, Jinze, Bai, Shuai, Chu, Yunfei, Cui, Zeyu, Dang, Kai, Deng, Xiaodong, Fan, Yang, Ge, Wenbin, Han, Yu, Huang, Fei, Hui, Binyuan, Ji, Luo, Li, Mei, Lin, Junyang, Lin, Runji, Liu, Dayiheng, Liu, Gao, Lu, Chengqiang, Lu, Keming, Ma, Jianxin, Men, Rui, Ren, Xingzhang, Ren, Xuancheng, Tan, Chuanqi, Tan, Sinan, Tu, Jianhong, Wang, Peng, Wang, Shijie, Wang, Wei, Wu, Shengguang, Xu, Benfeng, Xu, Jin, Yang, An, Yang, Hao, Yang, Jian, Yang, Shusheng, Yao, Yang, Yu, Bowen, Yuan, Hongyi, Yuan, Zheng, Zhang, Jianwei, Zhang, Xingxuan, Zhang, Yichang, Zhang, Zhenru, Zhou, Chang, Zhou, Jingren, Zhou, Xiaohuan, and Zhu, Tianhang
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have revolutionized the field of artificial intelligence, enabling natural language processing tasks that were previously thought to be exclusive to humans. In this work, we introduce Qwen, the first installment of our large language model series. Qwen is a comprehensive language model series that encompasses distinct models with varying parameter counts. It includes Qwen, the base pretrained language models, and Qwen-Chat, the chat models finetuned with human alignment techniques. The base language models consistently demonstrate superior performance across a multitude of downstream tasks, and the chat models, particularly those trained using Reinforcement Learning from Human Feedback (RLHF), are highly competitive. The chat models possess advanced tool-use and planning capabilities for creating agent applications, showcasing impressive performance even when compared to bigger models on complex tasks like utilizing a code interpreter. Furthermore, we have developed coding-specialized models, Code-Qwen and Code-Qwen-Chat, as well as mathematics-focused models, Math-Qwen-Chat, which are built upon base language models. These models demonstrate significantly improved performance in comparison with open-source models, and slightly fall behind the proprietary models., Comment: 59 pages, 5 figures
Published: 2023

5. ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

Author: Xu, Benfeng, Yang, An, Lin, Junyang, Wang, Quan, Zhou, Chang, Zhang, Yongdong, and Mao, Zhendong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The answering quality of an aligned large language model (LLM) can be drastically improved if treated with proper crafting of prompts. In this paper, we propose ExpertPrompting to elicit the potential of LLMs to answer as distinguished experts. We first utilize In-Context Learning to automatically synthesize detailed and customized descriptions of the expert identity for each specific instruction, and then ask LLMs to provide answer conditioned on such agent background. Based on this augmented prompting strategy, we produce a new set of instruction-following data using GPT-3.5, and train a competitive open-source chat assistant called ExpertLLaMA. We employ GPT4-based evaluation to show that 1) the expert data is of significantly higher quality than vanilla answers, and 2) ExpertLLaMA outperforms existing open-source opponents and achieves 96\% of the original ChatGPT's capability. All data and the ExpertLLaMA model will be made publicly available at \url{https://github.com/OFA-Sys/ExpertLLaMA}.
Published: 2023

6. $k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

Author: Xu, Benfeng, Wang, Quan, Mao, Zhendong, Lyu, Yajuan, She, Qiaoqiao, and Zhang, Yongdong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In-Context Learning (ICL), which formulates target tasks as prompt completion conditioned on in-context demonstrations, has become the prevailing utilization of LLMs. In this paper, we first disclose an actual predicament for this typical usage that it can not scale up with training data due to context length restriction. Besides, existing works have shown that ICL also suffers from various biases and requires delicate calibration treatment. To address both challenges, we advocate a simple and effective solution, $k$NN Prompting, which first queries LLM with training data for distributed representations, then predicts test instances by simply referring to nearest neighbors. We conduct comprehensive experiments to demonstrate its two-fold superiority: 1) Calibration-Free: $k$NN Prompting does not directly align LLM output distribution with task-specific label space, instead leverages such distribution to align test and training instances. It significantly outperforms state-of-the-art calibration-based methods under comparable few-shot scenario. 2) Beyond-Context: $k$NN Prompting can further scale up effectively with as many training data as are available, continually bringing substantial improvements. The scaling trend holds across 10 orders of magnitude ranging from 2 shots to 1024 shots as well as different LLMs scales ranging from 0.8B to 30B. It successfully bridges data scaling into model scaling, and brings new potentials for the gradient-free paradigm of LLM deployment. Code is publicly available., Comment: ICLR 2023. Code is available at https://github.com/BenfengXu/KNNPrompting
Published: 2023

7. UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

Author: Tang, Wei, Xu, Benfeng, Zhao, Yuyue, Mao, Zhendong, Liu, Yifeng, Liao, Yong, and Xie, Haiyong
Subjects: Computer Science - Computation and Language
Abstract: Relational triple extraction is challenging for its difficulty in capturing rich correlations between entities and relations. Existing works suffer from 1) heterogeneous representations of entities and relations, and 2) heterogeneous modeling of entity-entity interactions and entity-relation interactions. Therefore, the rich correlations are not fully exploited by existing works. In this paper, we propose UniRel to address these challenges. Specifically, we unify the representations of entities and relations by jointly encoding them within a concatenated natural language sequence, and unify the modeling of interactions with a proposed Interaction Map, which is built upon the off-the-shelf self-attention mechanism within any Transformer block. With comprehensive experiments on two popular relational triple extraction datasets, we demonstrate that UniRel is more effective and computationally efficient. The source code is available at https://github.com/wtangdev/UniRel., Comment: Accepted at EMNLP 2022. Camera-ready version
Published: 2022

8. Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

Author: Wang, Quan, Dai, Songtai, Xu, Benfeng, Lyu, Yajuan, Zhu, Yong, Wu, Hua, and Wang, Haifeng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Pre-trained language models (PLMs), such as BERT and GPT, have revolutionized the field of NLP, not only in the general domain but also in the biomedical domain. Most prior efforts in building biomedical PLMs have resorted simply to domain adaptation and focused mainly on English. In this work we introduce eHealth, a Chinese biomedical PLM built from scratch with a new pre-training framework. This new framework pre-trains eHealth as a discriminator through both token- and sequence-level discrimination. The former is to detect input tokens corrupted by a generator and recover their original identities from plausible candidates, while the latter is to further distinguish corruptions of a same original sequence from those of others. As such, eHealth can learn language semantics at both token and sequence levels. Extensive experiments on 11 Chinese biomedical language understanding tasks of various forms verify the effectiveness and superiority of our approach. We release the pre-trained model at \url{https://github.com/PaddlePaddle/Research/tree/master/KG/eHealth} and will also release the code later.
Published: 2021

9. Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

Author: Xu, Benfeng, Wang, Quan, Lyu, Yajuan, Zhu, Yong, and Mao, Zhendong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Entities, as the essential elements in relation extraction tasks, exhibit certain structure. In this work, we formulate such structure as distinctive dependencies between mention pairs. We then propose SSAN, which incorporates these structural dependencies within the standard self-attention mechanism and throughout the overall encoding stage. Specifically, we design two alternative transformation modules inside each self-attention building block to produce attentive biases so as to adaptively regularize its attention flow. Our experiments demonstrate the usefulness of the proposed entity structure and the effectiveness of SSAN. It significantly outperforms competitive baselines, achieving new state-of-the-art results on three popular document-level relation extraction datasets. We further provide ablation and visualization to show how the entity structure guides the model for better relation extraction. Our code is publicly available., Comment: Accepted to AAAI 2021
Published: 2021

10. Endoscopic versus minimally invasive surgical approach for infected necrotizing pancreatitis: a systematic review and meta-analysis of randomized controlled trials

Author: Tang, Penghao, primary, Ali, Kamran, additional, Khizar, Hayat, additional, Ni, Yuanzhi, additional, Cheng, Zhiwen, additional, Xu, Benfeng, additional, Qin, Zhiwen, additional, and Zhang, Wu, additional
Published: 2023
Full Text: View/download PDF

11. Modaldrop: Modality-Aware Regularization for Temporal-Spectral Fusion in Human Activity Recognition

Author: Zeng, Xin, primary, Chen, Yiqiang, additional, Xu, Benfeng, additional, and Zhang, Tengxiang, additional
Published: 2023
Full Text: View/download PDF

12. On the Calibration of Large Language Models and Alignment

Author: Zhu, Chiwei, primary, Xu, Benfeng, additional, Wang, Quan, additional, Zhang, Yongdong, additional, and Mao, Zhendong, additional
Published: 2023
Full Text: View/download PDF

13. S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction

Author: Xu, Benfeng, primary, Wang, Quan, additional, Lyu, Yajuan, additional, Dai, Dai, additional, Zhang, Yongdong, additional, and Mao, Zhendong, additional
Published: 2023
Full Text: View/download PDF

14. Retrieval-Augmented Domain Adaptation of Language Models

Author: Xu, Benfeng, primary, Zhao, Chunxu, additional, Jiang, Wenbin, additional, Zhu, PengFei, additional, Dai, Songtai, additional, Pang, Chao, additional, Sun, Zhuo, additional, Wang, Shuohuan, additional, and Sun, Yu, additional
Published: 2023
Full Text: View/download PDF

15. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn2+ Homeostasis

Author: Xie, Shiyi, primary, Xu, Benfeng, additional, Tang, Rui, additional, Chen, Siyu, additional, Lei, Chunyang, additional, and Nie, Zhou, additional
Published: 2022
Full Text: View/download PDF

16. UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

Author: Tang, Wei, primary, Xu, Benfeng, additional, Zhao, Yuyue, additional, Mao, Zhendong, additional, Liu, Yifeng, additional, Liao, Yong, additional, and Xie, Haiyong, additional
Published: 2022
Full Text: View/download PDF

17. EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction

Author: Xu, Benfeng, primary, Wang, Quan, additional, Lyu, Yajuan, additional, Shi, Yabing, additional, Zhu, Yong, additional, Gao, Jie, additional, and Mao, Zhendong, additional
Published: 2022
Full Text: View/download PDF

18. Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

Author: Xu, Benfeng, primary, Wang, Quan, additional, Lyu, Yajuan, additional, Zhu, Yong, additional, and Mao, Zhendong, additional
Published: 2021
Full Text: View/download PDF

19. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn2+ Homeostasis.

Author: Xie, Shiyi, Xu, Benfeng, Tang, Rui, Chen, Siyu, Lei, Chunyang, and Nie, Zhou
Published: 2022
Full Text: View/download PDF

20. Review and Arrange: Curriculum Learning for Natural Language Understanding

Author: Zhang, Licheng, primary, Mao, Zhendong, additional, Xu, Benfeng, additional, Wang, Quan, additional, and Zhang, Yongdong, additional
Published: 2021
Full Text: View/download PDF

21. Curriculum Learning for Natural Language Understanding

Author: Xu, Benfeng, primary, Zhang, Licheng, additional, Mao, Zhendong, additional, Wang, Quan, additional, Xie, Hongtao, additional, and Zhang, Yongdong, additional
Published: 2020
Full Text: View/download PDF

22. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn 2+ Homeostasis.

Author: Xie S, Xu B, Tang R, Chen S, Lei C, and Nie Z
Subjects: Homeostasis, Kinetics, RNA, Guide, CRISPR-Cas Systems genetics, CRISPR-Associated Proteins genetics, CRISPR-Cas Systems genetics
Abstract: The CRISPR/Cas12a system has been repurposed as a versatile nuclei acid bio-imaging tool, but its utility in sensing non-nucleic acid analytes in living cells has been less exploited. Herein, we demonstrated the ability of Mn 2+ to accelerate cleavage kinetics of Cas12a and deployed for live-cell Mn 2+ sensing by leveraging the accelerated trans-cleavage for signal reporting. In this work, we found that Mn 2+ could significantly boost both the cis-cleavage and trans-cleavage activities of Cas12a. On the basis of this phenomenon, we harnessed CRISPR-Cas12a as a direct sensing system for Mn 2+ , which achieved robust Mn 2+ detection in the concentration range of 0.5-700 μM within 15 min in complex biological samples. Furthermore, we also demonstrated the versatility of this system to sense Mn 2+ in the cytoplasm of living cells. With the usage of a conditional guide RNA, this Cas12a-based sensing method was applied to study the cytotoxicity of Mn 2+ in living nerve cells, offering a valuable tool to reveal the cellular response of nerve cells to Mn 2+ disorder and homeostasis.
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

22 results on '"Xu, Benfeng"'

1. Benchmarking Large Language Models on Controllable Generation under Diversified Instructions

2. On the Calibration of Large Language Models and Alignment

3. Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

4. Qwen Technical Report

5. ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

6. $k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

7. UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

8. Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

9. Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

10. Endoscopic versus minimally invasive surgical approach for infected necrotizing pancreatitis: a systematic review and meta-analysis of randomized controlled trials

11. Modaldrop: Modality-Aware Regularization for Temporal-Spectral Fusion in Human Activity Recognition

12. On the Calibration of Large Language Models and Alignment

13. S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction

14. Retrieval-Augmented Domain Adaptation of Language Models

15. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn2+ Homeostasis

16. UniRel: Unified Representation and Interaction for Joint Relational Triple Extraction

17. EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction

18. Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

19. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn2+ Homeostasis.

20. Review and Arrange: Curriculum Learning for Natural Language Understanding

21. Curriculum Learning for Natural Language Understanding

22. Kinetics Accelerated CRISPR-Cas12a Enabling Live-Cell Monitoring of Mn 2+ Homeostasis.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

22 results on '"Xu, Benfeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources