Author: "Wu, Shujin" / Publication Year Range: Last 3 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wu, Shujin"' showing total 11 results

Start Over Author "Wu, Shujin" Publication Year Range Last 3 years

11 results on '"Wu, Shujin"'

1. Aligning LLMs with Individual Preferences via Interaction

Author: Wu, Shujin, Fung, May, Qian, Cheng, Kim, Jeonghwan, Hakkani-Tur, Dilek, and Ji, Heng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: As large language models (LLMs) demonstrate increasingly advanced capabilities, aligning their behaviors with human values and preferences becomes crucial for their wide adoption. While previous research focuses on general alignment to principles such as helpfulness, harmlessness, and honesty, the need to account for individual and diverse preferences has been largely overlooked, potentially undermining customized human experiences. To address this gap, we train LLMs that can ''interact to align'', essentially cultivating the meta-skill of LLMs to implicitly infer the unspoken personalized preferences of the current user through multi-turn conversations, and then dynamically align their following behaviors and responses to these inferred preferences. Our approach involves establishing a diverse pool of 3,310 distinct user personas by initially creating seed examples, which are then expanded through iterative self-generation and filtering. Guided by distinct user personas, we leverage multi-LLM collaboration to develop a multi-turn preference dataset containing 3K+ multi-turn conversations in tree structures. Finally, we apply supervised fine-tuning and reinforcement learning to enhance LLMs using this dataset. For evaluation, we establish the ALOE (ALign With CustOmized PrEferences) benchmark, consisting of 100 carefully selected examples and well-designed metrics to measure the customized alignment performance during conversations. Experimental results demonstrate the effectiveness of our method in enabling dynamic, personalized alignment via interaction., Comment: The code and dataset are made public at https://github.com/ShujinWu-0814/ALOE
Published: 2024

2. MACAROON: Training Vision-Language Models To Be Your Engaged Partners

Author: Wu, Shujin, Fung, Yi R., Li, Sha, Wan, Yixin, Chang, Kai-Wei, and Ji, Heng
Subjects: Computer Science - Computation and Language
Abstract: Large vision-language models (LVLMs), while proficient in following instructions and responding to diverse questions, invariably generate detailed responses even when questions are ambiguous or unanswerable, leading to hallucinations and bias issues. Thus, it is essential for LVLMs to proactively engage with humans to ask for clarifications or additional information for better responses. In this study, we aim to shift LVLMs from passive answer providers to proactive engaged partners. We begin by establishing a three-tiered hierarchy for questions of invalid, ambiguous, and personalizable nature to measure the proactive engagement capabilities of LVLMs. Utilizing this hierarchy, we create PIE, (ProactIve Engagement Evaluation) through GPT-4o and human annotators, consisting of 853 questions across six distinct, fine-grained question types that are verified by human annotators and accompanied with well-defined metrics. Our evaluations on \benchmark indicate poor performance of existing LVLMs, with the best-performing open-weights model only achieving an Aggregate Align Rate (AAR) of 0.28. In response, we introduce MACAROON, self-iMaginAtion for ContrAstive pReference OptimizatiON, which instructs LVLMs to autonomously generate contrastive response pairs for unlabeled questions given the task description and human-crafted criteria. Then, the self-imagined data is formatted for conditional reinforcement learning. Experimental results show MACAROON effectively improves LVLMs' capabilities to be proactively engaged (0.84 AAR) while maintaining comparable performance on general tasks., Comment: The code will be made public at https://github.com/ShujinWu-0814/MACAROON
Published: 2024

3. SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Author: Xu, Tianyang, Wu, Shujin, Diao, Shizhe, Liu, Xiaoze, Wang, Xingyao, Chen, Yangyi, and Gao, Jing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) often generate inaccurate or fabricated information and generally fail to indicate their confidence, which limits their broader applications. Previous work elicits confidence from LLMs by direct or self-consistency prompting, or constructing specific datasets for supervised finetuning. The prompting-based approaches have inferior performance, and the training-based approaches are limited to binary or inaccurate group-level confidence estimates. In this work, we present the advanced SaySelf, a training framework that teaches LLMs to express more accurate fine-grained confidence estimates. In addition, beyond the confidence scores, SaySelf initiates the process of directing LLMs to produce self-reflective rationales that clearly identify gaps in their parametric knowledge and explain their uncertainty. This is achieved by using an LLM to automatically summarize the uncertainties in specific knowledge via natural language. The summarization is based on the analysis of the inconsistency in multiple sampled reasoning chains, and the resulting data is utilized for supervised fine-tuning. Moreover, we utilize reinforcement learning with a meticulously crafted reward function to calibrate the confidence estimates, motivating LLMs to deliver accurate, high-confidence predictions and to penalize overconfidence in erroneous outputs. Experimental results in both in-distribution and out-of-distribution datasets demonstrate the effectiveness of SaySelf in reducing the confidence calibration error and maintaining the task performance. We show that the generated self-reflective rationales are reasonable and can further contribute to the calibration. The code is made public at https://github.com/xu1868/SaySelf., Comment: EMNLP 2024 Main
Published: 2024

4. Review on flexible radiation-protective clothing materials

Author: Wu, Shujin, Bao, Jingwen, Gao, Yantao, Hu, Wenfeng, and Lu, Zan
Published: 2024
Full Text: View/download PDF

5. Research progress on the role of endoplasmic reticulum stress in osteoarthritis

Author: JIN Tao, YANG Qingshan, WU Shujin, ZHU Xiaoyan, SHI Yucong, NIU Jianxiong, LIU Lin
Subjects: osteoarthritis, chondrocyte apoptosis, endoplasmic reticulum stress, unfolded protein response, Medicine
Abstract: Osteoarthritis is a common degenerative disease in which endoplasmic reticulum stress(ER stress) plays an important role in the pathogenesis of chondrocyte apoptosis. To limit the adverse effects of endoplasmic reticulum stress, cells activate the unfolded protein response (UPR). However, when endoplasmic reticulum stress persists beyond the maximum tolerance of UPR, cells may trigger endoplasmic reticulum dysfunction through three pathways of UPR leading to chondrocyte apoptosis.
Published: 2023
Full Text: View/download PDF

6. Ultrahigh water permeance of a composite reduced graphene oxide/graphene oxide membrane for efficient rejection of dyes.

Author: Liang, Shanshan, Yang, Rujie, Di, Yingjie, Liu, Guangxiao, and Wu, Shujin
Subjects: COMPOSITE membranes (Chemistry), GRAPHENE oxide, WATER purification, LAMINATED materials, POPULARITY
Abstract: Graphene oxide (GO) laminate membranes for water purification have surged in popularity due to their hydrophilicity, high throughput and excellent separation abilities. However, concerns about swelling and stability in water persist. Herein, we prepared high stability, composite reduced graphene oxide (rGO)/graphene oxide (GO) membranes. The composite membranes (i.e. rGO/GO composite membranes) displayed excellent rejection performance for methylene blue (MB) of up to 99.0%, together with ultrahigh water permeance (201.7 L m−2 h−1 bar−1) compared to pristine GO membranes (54.8 L m−2 h−1 bar−1). This study broadens the applications of graphene-based membranes and enhances their performance in water treatment. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. Autoregressive moving average model for matrix time series

Author: Wu, Shujin, primary and Bi, Ping, additional
Published: 2023
Full Text: View/download PDF

8. Roots originating from different shoot parts are functionally different in running bamboo, Phyllostachys glauca

Author: Wang, Guangru, primary, Yu, Fen, additional, Wu, Hongyan, additional, Hu, Shuzhen, additional, Wu, Shujin, additional, Pei, Nancai, additional, Shi, Jianmin, additional, and Lambers, Hans, additional
Published: 2023
Full Text: View/download PDF

9. Analysis of Stock Splits Based on Risk Theory: Empirical Evidence from the Chinese Stock Markets

Author: WU, Shujin, primary and XU, Tong, additional
Published: 2022
Full Text: View/download PDF

10. Advanced Glycation End Products Induced Mitochondrial Dysfunction of Chondrocytes through Repression of AMPKα-SIRT1-PGC-1α Pathway

Author: Yang, Qingshan, primary, Shi, Yucong, additional, Jin, Tao, additional, Duan, Bowen, additional, and Wu, Shujin, additional
Published: 2022
Full Text: View/download PDF

11. Poisson-Gamma mixture processes and applications to premium calculation.

Author: Wu, Shujin
Subjects: *WAITING rooms, *POISSON processes
Abstract: In the paper, Poisson-Gamma mixture process is first brought forward, which is dynamically expanded from the well-known Poisson-Gamma mixture model. Some properties on Poisson-Gamma mixture process are presented, including the distribution of increment, Markov property, infinitesimal generator, joint density function of jump/waiting times, and the limit distribution of compound Poisson-Gamma mixture process, etc., which provide a thorough grounding in application of Poisson-Gamma mixture process. At last, some premium calculation principles are presented to show the application of Poisson-Gamma mixture process, which include expected value premium, stop-loss premium, mean-variance premium, and exponential premium. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

11 results on '"Wu, Shujin"'

1. Aligning LLMs with Individual Preferences via Interaction

2. MACAROON: Training Vision-Language Models To Be Your Engaged Partners

3. SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

4. Review on flexible radiation-protective clothing materials

5. Research progress on the role of endoplasmic reticulum stress in osteoarthritis

6. Ultrahigh water permeance of a composite reduced graphene oxide/graphene oxide membrane for efficient rejection of dyes.

7. Autoregressive moving average model for matrix time series

8. Roots originating from different shoot parts are functionally different in running bamboo, Phyllostachys glauca

9. Analysis of Stock Splits Based on Risk Theory: Empirical Evidence from the Chinese Stock Markets

10. Advanced Glycation End Products Induced Mitochondrial Dysfunction of Chondrocytes through Repression of AMPKα-SIRT1-PGC-1α Pathway

11. Poisson-Gamma mixture processes and applications to premium calculation.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

11 results on '"Wu, Shujin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources