Author: "Huang, Haojing" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Huang, Haojing"' showing total 17 results

Start Over Author "Huang, Haojing"

17 results on '"Huang, Haojing"'

1. Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness

Author: Li, Jian, Huang, Haojing, Zhang, Yujia, Xu, Pengfei, Chen, Xi, Song, Rui, Shi, Lida, Wang, Jingwen, and Xu, Hao
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Recently, there has been significant interest in replacing the reward model in Reinforcement Learning with Human Feedback (RLHF) methods for Large Language Models (LLMs), such as Direct Preference Optimization (DPO) and its variants. These approaches commonly use a binary cross-entropy mechanism on pairwise samples, i.e., minimizing and maximizing the loss based on preferred or dis-preferred responses, respectively. However, while this training strategy omits the reward model, it also overlooks the varying preference degrees within different responses. We hypothesize that this is a key factor hindering LLMs from sufficiently understanding human preferences. To address this problem, we propose a novel Self-supervised Preference Optimization (SPO) framework, which constructs a self-supervised preference degree loss combined with the alignment loss, thereby helping LLMs improve their ability to understand the degree of preference. Extensive experiments are conducted on two widely used datasets of different tasks. The results demonstrate that SPO can be seamlessly integrated with existing preference optimization methods and significantly boost their performance to achieve state-of-the-art performance. We also conduct detailed analyses to offer comprehensive insights into SPO, which verifies its effectiveness. The code is available at https://github.com/lijian16/SPO., Comment: Accepted at EMNLP 2024 Findings
Published: 2024

2. Do Large Language Model Understand Multi-Intent Spoken Language ?

Author: Yin, Shangjian, Huang, Peijie, Xu, Yuhong, Huang, Haojing, and Chen, Jiatian
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This research signifies a considerable breakthrough in leveraging Large Language Models (LLMs) for multi-intent spoken language understanding (SLU). Our approach re-imagines the use of entity slots in multi-intent SLU applications, making the most of the generative potential of LLMs within the SLU landscape, leading to the development of the EN-LLM series. Furthermore, we introduce the concept of Sub-Intent Instruction (SII) to amplify the analysis and interpretation of complex, multi-intent communications, which further supports the creation of the ENSI-LLM models series. Our novel datasets, identified as LM-MixATIS and LM-MixSNIPS, are synthesized from existing benchmarks. The study evidences that LLMs may match or even surpass the performance of the current best multi-intent SLU models. We also scrutinize the performance of LLMs across a spectrum of intent configurations and dataset distributions. On top of this, we present two revolutionary metrics - Entity Slot Accuracy (ESA) and Combined Semantic Accuracy (CSA) - to facilitate a detailed assessment of LLM competence in this multifaceted field." Our code and datasets are available at \url{https://github.com/SJY8460/SLM}.
Published: 2024

3. Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

Author: Xing, Peng, Li, Yinghui, Ma, Shirong, Liang, Xinnian, Huang, Haojing, Li, Yangning, Zheng, Hai-Tao, Jiang, Wenhao, and Shen, Ying
Subjects: Computer Science - Computation and Language
Abstract: Chinese Spelling Correction (CSC) aims to detect and correct spelling errors in given sentences. Recently, multi-domain CSC has gradually attracted the attention of researchers because it is more practicable. In this paper, we focus on the key flaw of the CSC model when adapting to multi-domain scenarios: the tendency to forget previously acquired knowledge upon learning new domain-specific knowledge (i.e., catastrophic forgetting). To address this, we propose a novel model-agnostic Multi-stage Knowledge Transfer (MKT) framework, which utilizes a continuously evolving teacher model for knowledge transfer in each domain, rather than focusing solely on new domain knowledge. It deserves to be mentioned that we are the first to apply continual learning methods to the multi-domain CSC task. Experiments prove the effectiveness of our proposed method, and further analyses demonstrate the importance of overcoming catastrophic forgetting for improving the model performance.
Published: 2024

4. Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

Author: Li, Yinghui, Qin, Shang, Huang, Haojing, Li, Yangning, Qin, Libo, Hu, Xuming, Jiang, Wenhao, Zheng, Hai-Tao, and Yu, Philip S.
Subjects: Computer Science - Computation and Language
Abstract: Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challenging task focus. To promote the CGEC field to better adapt to the era of LLMs, we rethink the roles of LLMs in the CGEC task so that they can be better utilized and explored in CGEC. Considering the rich grammatical knowledge stored in LLMs and their powerful semantic understanding capabilities, we utilize LLMs as explainers to provide explanation information for the CGEC small models during error correction to enhance performance. We also use LLMs as evaluators to bring more reasonable CGEC evaluations, thus alleviating the troubles caused by the subjectivity of the CGEC task. In particular, our work is also an active exploration of how LLMs and small models better collaborate in downstream tasks. Extensive experiments and detailed analyses on widely used datasets verify the effectiveness of our thinking intuition and the proposed methods.
Published: 2024

5. Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters

Author: Li, Yinghui, Xu, Zishan, Chen, Shaoshen, Huang, Haojing, Li, Yangning, Jiang, Yong, Li, Zhongli, Zhou, Qingyu, Zheng, Hai-Tao, and Shen, Ying
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: Writing assistance is an application closely related to human life and is also a fundamental Natural Language Processing (NLP) research field. Its aim is to improve the correctness and quality of input texts, with character checking being crucial in detecting and correcting wrong characters. From the perspective of the real world where handwriting occupies the vast majority, characters that humans get wrong include faked characters (i.e., untrue characters created due to writing errors) and misspelled characters (i.e., true characters used incorrectly due to spelling errors). However, existing datasets and related studies only focus on misspelled characters mainly caused by phonological or visual confusion, thereby ignoring faked characters which are more common and difficult. To break through this dilemma, we present Visual-C$^3$, a human-annotated Visual Chinese Character Checking dataset with faked and misspelled Chinese characters. To the best of our knowledge, Visual-C$^3$ is the first real-world visual and the largest human-crafted dataset for the Chinese character checking scenario. Additionally, we also propose and evaluate novel baseline methods on Visual-C$^3$. Extensive empirical results and analyses show that Visual-C$^3$ is high-quality yet challenging. The Visual-C$^3$ dataset and the baseline methods will be publicly available to facilitate further research in the community., Comment: Work in progress
Published: 2023

6. A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check

Author: Huang, Haojing, Ye, Jingheng, Zhou, Qingyu, Li, Yinghui, Li, Yangning, Zhou, Feng, and Zheng, Hai-Tao
Subjects: Computer Science - Computation and Language
Abstract: In recent years, Chinese Spelling Check (CSC) has been greatly improved by designing task-specific pre-training methods or introducing auxiliary tasks, which mostly solve this task in an end-to-end fashion. In this paper, we propose to decompose the CSC workflow into detection, reasoning, and searching subtasks so that the rich external knowledge about the Chinese language can be leveraged more directly and efficiently. Specifically, we design a plug-and-play detection-and-reasoning module that is compatible with existing SOTA non-autoregressive CSC models to further boost their performance. We find that the detection-and-reasoning module trained for one model can also benefit other models. We also study the primary interpretability provided by the task decomposition. Extensive experiments and detailed analyses demonstrate the effectiveness and competitiveness of the proposed module., Comment: Accepted for publication in Findings of EMNLP 2023
Published: 2023

7. On the (In)Effectiveness of Large Language Models for Chinese Text Correction

Author: Li, Yinghui, Huang, Haojing, Ma, Shirong, Jiang, Yong, Li, Yangning, Zhou, Feng, Zheng, Hai-Tao, and Zhou, Qingyu
Subjects: Computer Science - Computation and Language
Abstract: Recently, the development and progress of Large Language Models (LLMs) have amazed the entire Artificial Intelligence community. Benefiting from their emergent abilities, LLMs have attracted more and more researchers to study their capabilities and performance on various downstream Natural Language Processing (NLP) tasks. While marveling at LLMs' incredible performance on all kinds of tasks, we notice that they also have excellent multilingual processing capabilities, such as Chinese. To explore the Chinese processing ability of LLMs, we focus on Chinese Text Correction, a fundamental and challenging Chinese NLP task. Specifically, we evaluate various representative LLMs on the Chinese Grammatical Error Correction (CGEC) and Chinese Spelling Check (CSC) tasks, which are two main Chinese Text Correction scenarios. Additionally, we also fine-tune LLMs for Chinese Text Correction to better observe the potential capabilities of LLMs. From extensive analyses and comparisons with previous state-of-the-art small models, we empirically find that the LLMs currently have both amazing performance and unsatisfactory behavior for Chinese Text Correction. We believe our findings will promote the landing and application of LLMs in the Chinese NLP community., Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible
Published: 2023

8. Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction

Author: Li, Yinghui, Ma, Shirong, Chen, Shaoshen, Huang, Haojing, Huang, Shulin, Li, Yangning, Zheng, Hai-Tao, and Shen, Ying
Subjects: Computer Science - Computation and Language
Abstract: Chinese Text Error Correction (CTEC) aims to detect and correct errors in the input text, which benefits human daily life and various downstream tasks. Recent approaches mainly employ Pre-trained Language Models (PLMs) to resolve CTEC. Although PLMs have achieved remarkable success in CTEC, we argue that previous studies still overlook the importance of human thinking patterns. To enhance the development of PLMs for CTEC, inspired by humans' daily error-correcting behavior, we propose a novel model-agnostic progressive learning framework, named ProTEC, which guides PLMs-based CTEC models to learn to correct like humans. During the training process, ProTEC guides the model to learn text error correction by incorporating these sub-tasks into a progressive paradigm. During the inference process, the model completes these sub-tasks in turn to generate the correction results. Extensive experiments and detailed analyses demonstrate the effectiveness and efficiency of our proposed model-agnostic ProTEC framework.
Published: 2023

9. Influence of reverse saturable absorption effect on conventional and dissipative solitons fiber lasers

Author: Wang, Gang, Ma, Yuxuan, Shang, Ce, Huang, Haojing, Lu, Zherui, Wang, Shuaixin, Sun, Jingxuan, Zhang, Chenghong, and Fu, Bo
Published: 2021
Full Text: View/download PDF

10. A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check

Author: Huang, Haojing, primary, Ye, Jingheng, additional, Zhou, Qingyu, additional, Li, Yinghui, additional, Li, Yangning, additional, Zhou, Feng, additional, and Zheng, Hai-Tao, additional
Published: 2023
Full Text: View/download PDF

11. A Graph Attention Interactive Refine Framework with Contextual Regularization for Jointing Intent Detection and Slot Filling

Author: Zhu, Zhanbiao, primary, Huang, Peijie, additional, Huang, Haojing, additional, Liu, Shudong, additional, and Lao, Leyi, additional
Published: 2022
Full Text: View/download PDF

12. CLID: A Chunk-Level Intent Detection Framework for Multiple Intent Spoken Language Understanding

Author: Huang, Haojing, primary, Huang, Peijie, additional, Zhu, Zhanbiao, additional, Li, Jia, additional, and Lin, Piyuan, additional
Published: 2022
Full Text: View/download PDF

13. Constructing Topic Models of Internet of Things for Information Processing

Author: Xin, Jie, primary, Cui, Zhiming, additional, Zhang, Shukui, additional, He, Tianxu, additional, Li, Chunhua, additional, and Huang, Haojing, additional
Published: 2014
Full Text: View/download PDF

14. A Spread Willingness Computing-Based Information Dissemination Model

Author: Huang, Haojing, primary, Cui, Zhiming, additional, and Zhang, Shukui, additional
Published: 2014
Full Text: View/download PDF

15. A Routing Algorithm Based on Dynamic Forecast of Vehicle Speed and Position in VANET

Author: Huang, Haojing, primary and Zhang, Shukui, additional
Published: 2013
Full Text: View/download PDF

16. Constructing topic models of Internet of Things for information processing.

Author: Xin J, Cui Z, Zhang S, He T, Li C, and Huang H
Subjects: Algorithms, Electronic Data Processing, Internet, Models, Theoretical
Abstract: Internet of Things (IoT) is regarded as a remarkable development of the modern information technology. There is abundant digital products data on the IoT, linking with multiple types of objects/entities. Those associated entities carry rich information and usually in the form of query records. Therefore, constructing high quality topic hierarchies that can capture the term distribution of each product record enables us to better understand users' search intent and benefits tasks such as taxonomy construction, recommendation systems, and other communications solutions for the future IoT. In this paper, we propose a novel record entity topic model (RETM) for IoT environment that is associated with a set of entities and records and a Gibbs sampling-based algorithm is proposed to learn the model. We conduct extensive experiments on real-world datasets and compare our approach with existing methods to demonstrate the advantage of our approach.
Published: 2014
Full Text: View/download PDF

17. A spread willingness computing-based information dissemination model.

Author: Huang H, Cui Z, and Zhang S
Subjects: Algorithms, Humans, Information Dissemination, Models, Theoretical, Social Networking
Abstract: This paper constructs a kind of spread willingness computing based on information dissemination model for social network. The model takes into account the impact of node degree and dissemination mechanism, combined with the complex network theory and dynamics of infectious diseases, and further establishes the dynamical evolution equations. Equations characterize the evolutionary relationship between different types of nodes with time. The spread willingness computing contains three factors which have impact on user's spread behavior: strength of the relationship between the nodes, views identity, and frequency of contact. Simulation results show that different degrees of nodes show the same trend in the network, and even if the degree of node is very small, there is likelihood of a large area of information dissemination. The weaker the relationship between nodes, the higher probability of views selection and the higher the frequency of contact with information so that information spreads rapidly and leads to a wide range of dissemination. As the dissemination probability and immune probability change, the speed of information dissemination is also changing accordingly. The studies meet social networking features and can help to master the behavior of users and understand and analyze characteristics of information dissemination in social network.
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Huang, Haojing"'

1. Self-supervised Preference Optimization: Enhance Your Language Model with Preference Degree Awareness

2. Do Large Language Model Understand Multi-Intent Spoken Language ?

3. Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

4. Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

5. Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters

6. A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check

7. On the (In)Effectiveness of Large Language Models for Chinese Text Correction

8. Correct Like Humans: Progressive Learning Framework for Chinese Text Error Correction

9. Influence of reverse saturable absorption effect on conventional and dissipative solitons fiber lasers

10. A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check

11. A Graph Attention Interactive Refine Framework with Contextual Regularization for Jointing Intent Detection and Slot Filling

12. CLID: A Chunk-Level Intent Detection Framework for Multiple Intent Spoken Language Understanding

13. Constructing Topic Models of Internet of Things for Information Processing

14. A Spread Willingness Computing-Based Information Dissemination Model

15. A Routing Algorithm Based on Dynamic Forecast of Vehicle Speed and Position in VANET

16. Constructing topic models of Internet of Things for information processing.

17. A spread willingness computing-based information dissemination model.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

17 results on '"Huang, Haojing"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources