Author: "Yu, Fangxu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yu, Fangxu"' showing total 5 results

Start Over Author "Yu, Fangxu"

5 results on '"Yu, Fangxu"'

1. Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples

Author: Yu, Fangxu, Jiang, Lai, Kang, Haoqiang, Hao, Shibo, and Qin, Lianhui
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: The ability to generate diverse solutions to a given problem is a hallmark of human creativity. This divergent reasoning is also crucial for machines, enhancing their robustness and enabling them to assist humans in many applications such as scientific discovery. However, existing approaches to multi-step reasoning with large language models (LLMs) have mostly focused only on reasoning accuracy, without further discovering more diverse valid solutions. For example, supervised fine-tuning can improve LLM reasoning quality, but requires extensive supervised data to capture the full range of possible solutions. Reinforcement learning aims to find limited highest-reward solutions while neglecting the solution diversity. To fill this gap, we propose Flow of Reasoning (FoR), an efficient diversity-seeking LLM finetuning method aimed at improving reasoning quality and diversity with minimal data. FoR formulates multi-step LLM reasoning as a Markovian flow on a DAG-structured reasoning graph. This formulation allows us to incorporate and adapt principled GFlowNet approaches, for finetuning LLMs to sample diverse reasoning paths with probabilities proportional to the (unnormalized) reward of target problems. Extensive experiments show that, with limited training examples (e.g., 15 examples), FoR enables the discovery of diverse, creative, high-quality solutions, greatly outperforming a wide range of existing inference and training methods across five challenging puzzle-solving tasks, including BlocksWorld (embodied reasoning), Game24 (math puzzle solving), Rubik's Cube (spatial reasoning), 1D-ARC (abstraction reasoning), and PrOntoQA (logical reasoning). Code is available at https://github.com/Yu-Fangxu/FoR.
Published: 2024

2. Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation

Author: Yu, Fangxu, Guo, Junjie, Wu, Zhen, and Dai, Xinyu
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Emotion Recognition in Conversation (ERC) involves detecting the underlying emotion behind each utterance within a conversation. Effectively generating representations for utterances remains a significant challenge in this task. Recent works propose various models to address this issue, but they still struggle with differentiating similar emotions such as excitement and happiness. To alleviate this problem, We propose an Emotion-Anchored Contrastive Learning (EACL) framework that can generate more distinguishable utterance representations for similar emotions. To achieve this, we utilize label encodings as anchors to guide the learning of utterance representations and design an auxiliary loss to ensure the effective separation of anchors for similar emotions. Moreover, an additional adaptation process is proposed to adapt anchors to serve as effective classifiers to improve classification performance. Across extensive experiments, our proposed EACL achieves state-of-the-art emotion recognition performance and exhibits superior performance on similar emotions. Our code is available at https://github.com/Yu-Fangxu/EACL., Comment: Accepted by Findings of NAACL 2024
Published: 2024

3. COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Author: Guo, Xingang, Yu, Fangxu, Zhang, Huan, Qin, Lianhui, and Hu, Bin
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Jailbreaks on large language models (LLMs) have recently received increasing attention. For a comprehensive assessment of LLM safety, it is essential to consider jailbreaks with diverse attributes, such as contextual coherence and sentiment/stylistic variations, and hence it is beneficial to study controllable jailbreaking, i.e. how to enforce control on LLM attacks. In this paper, we formally formulate the controllable attack generation problem, and build a novel connection between this problem and controllable text generation, a well-explored topic of natural language processing. Based on this connection, we adapt the Energy-based Constrained Decoding with Langevin Dynamics (COLD), a state-of-the-art, highly efficient algorithm in controllable text generation, and introduce the COLD-Attack framework which unifies and automates the search of adversarial LLM attacks under a variety of control requirements such as fluency, stealthiness, sentiment, and left-right-coherence. The controllability enabled by COLD-Attack leads to diverse new jailbreak scenarios which not only cover the standard setting of generating fluent (suffix) attack with continuation constraint, but also allow us to address new controllable attack settings such as revising a user query adversarially with paraphrasing constraint, and inserting stealthy attacks in context with position constraint. Our extensive experiments on various LLMs (Llama-2, Mistral, Vicuna, Guanaco, GPT-3.5, and GPT-4) show COLD-Attack's broad applicability, strong controllability, high success rate, and attack transferability. Our code is available at https://github.com/Yu-Fangxu/COLD-Attack., Comment: Accepted to ICML 2024
Published: 2024

4. Bidirectional recurrent gamma belief network for HRRP target recognition

Author: Chen, Wenchao, primary, Chen, Bo, additional, Liu, Yicheng, additional, Peng, Xiaojun, additional, Fan, Haoyang, additional, Yu, Fangxu, additional, and Liu, Hongwei, additional
Published: 2021
Full Text: View/download PDF

5. NTIRE 2021 Challenge on Video Super-Resolution

Author: Son, Sanghyun, primary, Lee, Suyoung, additional, Nah, Seungjun, additional, Timofte, Radu, additional, Lee, Kyoung Mu, additional, Chan, Kelvin C. K., additional, Zhou, Shangchen, additional, Xu, Xiangyu, additional, Loy, Chen Change, additional, Jiang, Boyuan, additional, Lin, Chuming, additional, Dong, Yuchun, additional, Luo, Donghao, additional, Chu, Wenqing, additional, Ji, Xiaozhong, additional, Yang, Siqian, additional, Tai, Ying, additional, Wang, Chengjie, additional, Li, Jilin, additional, Huang, Feiyue, additional, Chen, Chengpeng, additional, Chu, Xiaojie, additional, Zhang, Jie, additional, Lu, Xin, additional, Chen, Liangyu, additional, Lin, Jing, additional, Du, Guodong, additional, Hao, Jia, additional, Zou, Xueyi, additional, Zhang, Qi, additional, Jiang, Lielin, additional, Li, Xin, additional, Zheng, He, additional, Liu, Fanglong, additional, He, Dongliang, additional, Li, Fu, additional, Dang, Qingqing, additional, Yi, Peng, additional, Wang, Zhongyuan, additional, Jiang, Kui, additional, Jiang, Junjun, additional, Ma, Jiayi, additional, Chen, Yuxiang, additional, Wang, Yutong, additional, Liu, Ting, additional, Sun, Qichao, additional, Liang, Huanwei, additional, Li, Yiming, additional, Li, Zekun, additional, Ruan, Zhubo, additional, Shang, Fanjie, additional, Guo, Chen, additional, Li, Haining, additional, Luo, Renjun, additional, Shen, Longjie, additional, Zafirouli, Kassiani, additional, Karageorgos, Konstantinos, additional, Konstantoudakis, Konstantinos, additional, Dimou, Anastasios, additional, Daras, Petros, additional, Song, Xiaowei, additional, Zhuo, Xu, additional, Liu, Hanxi, additional, Guo, Mengxi, additional, Li, Junlin, additional, Li, Yu, additional, Zhu, Ye, additional, Wang, Qing, additional, Zhao, Shijie, additional, Sun, Xiaopeng, additional, Zhan, Gen, additional, Xie, Tangxin, additional, Jia, Yu, additional, Lu, Yunhua, additional, Zhang, Wenhao, additional, Sun, Mengdi, additional, Michelini, Pablo Navarrete, additional, Zhang, Xueheng, additional, Jiang, Hao, additional, Chen, Zhiyu, additional, Chen, Li, additional, Xiong, Zhiwei, additional, Xiao, Zeyu, additional, Xu, Ruikang, additional, Cheng, Zhen, additional, Fu, Xueyang, additional, Song, Fenglong, additional, Luo, Zhipeng, additional, Yao, Yuehan, additional, Dutta, Saikat, additional, Shah, Nisarg A., additional, Dipta Das, Sourya, additional, Zhao, Peng, additional, Shi, Yukai, additional, Liu, Hongying, additional, Shang, Fanhua, additional, Liu, Yuanyuan, additional, Chen, Fei, additional, Yu, Fangxu, additional, Gao, Ruisheng, additional, Bai, Yixin, additional, Heo, Jeonghwan, additional, Yue, Shijie, additional, Li, Chenghua, additional, Li, Jinjing, additional, Zheng, Qian, additional, Gang, Ruipeng, additional, Song, Ruixia, additional, Wee, Seungwoo, additional, Jeong, Jechang, additional, Li, Chen, additional, Wen, Geyingjie, additional, Chai, Xinning, additional, and Song, Li, additional
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Yu, Fangxu"'

1. Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples

2. Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation

3. COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

4. Bidirectional recurrent gamma belief network for HRRP target recognition

5. NTIRE 2021 Challenge on Video Super-Resolution

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

5 results on '"Yu, Fangxu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources