Author: "Zhang, Sen" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhang, Sen"' showing total 5,863 results

Start Over Author "Zhang, Sen"

5,863 results on '"Zhang, Sen"'

1. Aligning Few-Step Diffusion Models with Dense Reward Difference Learning

Author: Zhang, Ziyi, Shen, Li, Zhang, Sen, Ye, Deheng, Luo, Yong, Shi, Miaojing, Du, Bo, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Aligning diffusion models with downstream objectives is essential for their practical applications. However, standard alignment methods often struggle with step generalization when directly applied to few-step diffusion models, leading to inconsistent performance across different denoising step scenarios. To address this, we introduce Stepwise Diffusion Policy Optimization (SDPO), a novel alignment method tailored for few-step diffusion models. Unlike prior approaches that rely on a single sparse reward from only the final step of each denoising trajectory for trajectory-level optimization, SDPO incorporates dense reward feedback at every intermediate step. By learning the differences in dense rewards between paired samples, SDPO facilitates stepwise optimization of few-step diffusion models, ensuring consistent alignment across all denoising steps. To promote stable and efficient training, SDPO introduces an online reinforcement learning framework featuring several novel strategies designed to effectively exploit the stepwise granularity of dense rewards. Experimental results demonstrate that SDPO consistently outperforms prior methods in reward-based alignment across diverse step configurations, underscoring its robust step generalization capabilities. Code is avaliable at https://github.com/ZiyiZhang27/sdpo.
Published: 2024

2. Deep Learning and Machine Learning -- Natural Language Processing: From Theory to Application

Author: Chen, Keyu, Fei, Cheng, Bi, Ziqian, Liu, Junyu, Peng, Benji, Zhang, Sen, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, Yin, Caitlyn Heqi, Zhang, Yichao, Feng, Pohsun, Wen, Yizhu, Wang, Tianyang, Li, Ming, Ren, Jintao, Niu, Qian, Chen, Silin, Hsieh, Weiche, Yan, Lawrence K. Q., Liang, Chia Xin, Xu, Han, Tseng, Hong-Ming, Song, Xinyuan, and Liu, Ming
Subjects: Computer Science - Computation and Language, Computer Science - Human-Computer Interaction
Abstract: With a focus on natural language processing (NLP) and the role of large language models (LLMs), we explore the intersection of machine learning, deep learning, and artificial intelligence. As artificial intelligence continues to revolutionize fields from healthcare to finance, NLP techniques such as tokenization, text classification, and entity recognition are essential for processing and understanding human language. This paper discusses advanced data preprocessing techniques and the use of frameworks like Hugging Face for implementing transformer-based models. Additionally, it highlights challenges such as handling multilingual data, reducing bias, and ensuring model robustness. By addressing key aspects of data processing and model fine-tuning, this work aims to provide insights into deploying effective and ethically sound AI solutions., Comment: 255 pages
Published: 2024

3. Offline Behavior Distillation

Author: Lei, Shiye, Zhang, Sen, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Massive reinforcement learning (RL) data are typically collected to train policies offline without the need for interactions, but the large data volume can cause training inefficiencies. To tackle this issue, we formulate offline behavior distillation (OBD), which synthesizes limited expert behavioral data from sub-optimal RL data, enabling rapid policy learning. We propose two naive OBD objectives, DBC and PBC, which measure distillation performance via the decision difference between policies trained on distilled data and either offline data or a near-expert policy. Due to intractable bi-level optimization, the OBD objective is difficult to minimize to small values, which deteriorates PBC by its distillation performance guarantee with quadratic discount complexity $\mathcal{O}(1/(1-\gamma)^2)$. We theoretically establish the equivalence between the policy performance and action-value weighted decision difference, and introduce action-value weighted PBC (Av-PBC) as a more effective OBD objective. By optimizing the weighted decision difference, Av-PBC achieves a superior distillation guarantee with linear discount complexity $\mathcal{O}(1/(1-\gamma))$. Extensive experiments on multiple D4RL datasets reveal that Av-PBC offers significant improvements in OBD performance, fast distillation convergence speed, and robust cross-architecture/optimizer generalization., Comment: Accepted by NeurIPS 2024
Published: 2024

4. Deep Learning, Machine Learning -- Digital Signal and Image Processing: From Theory to Application

Author: Hsieh, Weiche, Bi, Ziqian, Liu, Junyu, Peng, Benji, Zhang, Sen, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, Chen, Keyu, Yin, Caitlyn Heqi, Feng, Pohsun, Wen, Yizhu, Wang, Tianyang, Li, Ming, Ren, Jintao, Niu, Qian, Chen, Silin, and Liu, Ming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Electrical Engineering and Systems Science - Image and Video Processing, Electrical Engineering and Systems Science - Signal Processing
Abstract: Digital Signal Processing (DSP) and Digital Image Processing (DIP) with Machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer Vision and related fields. We highlight transformative applications in image enhancement, filtering techniques, and pattern recognition. By integrating frameworks like the Discrete Fourier Transform (DFT), Z-Transform, and Fourier Transform methods, we enable robust data manipulation and feature extraction essential for AI-driven tasks. Using Python, we implement algorithms that optimize real-time data processing, forming a foundation for scalable, high-performance solutions in computer vision. This work illustrates the potential of ML and DL to advance DSP and DIP methodologies, contributing to artificial intelligence, automated feature extraction, and applications across diverse domains., Comment: 293 pages
Published: 2024

5. Deep Learning and Machine Learning -- Python Data Structures and Mathematics Fundamental: From Theory to Practice

Author: Chen, Silin, Bi, Ziqian, Liu, Junyu, Peng, Benji, Zhang, Sen, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, Chen, Keyu, Yin, Caitlyn Heqi, Feng, Pohsun, Wen, Yizhu, Wang, Tianyang, Li, Ming, Ren, Jintao, Niu, Qian, and Liu, Ming
Subjects: Computer Science - Machine Learning, Computer Science - Data Structures and Algorithms, Computer Science - Programming Languages
Abstract: This book provides a comprehensive introduction to the foundational concepts of machine learning (ML) and deep learning (DL). It bridges the gap between theoretical mathematics and practical application, focusing on Python as the primary programming language for implementing key algorithms and data structures. The book covers a wide range of topics, including basic and advanced Python programming, fundamental mathematical operations, matrix operations, linear algebra, and optimization techniques crucial for training ML and DL models. Advanced subjects like neural networks, optimization algorithms, and frequency domain methods are also explored, along with real-world applications of large language models (LLMs) and artificial intelligence (AI) in big data management. Designed for both beginners and advanced learners, the book emphasizes the critical role of mathematical principles in developing scalable AI solutions. Practical examples and Python code are provided throughout, ensuring readers gain hands-on experience in applying theoretical knowledge to solve complex problems in ML, DL, and big data analytics., Comment: 298 pages
Published: 2024

6. Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications

Author: Ren, Jintao, Bi, Ziqian, Niu, Qian, Liu, Junyu, Peng, Benji, Zhang, Sen, Pan, Xuanhe, Wang, Jinlang, Chen, Keyu, Yin, Caitlyn Heqi, Feng, Pohsun, Wen, Yizhu, Wang, Tianyang, Chen, Silin, Li, Ming, Xu, Jiawei, and Liu, Ming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: This book offers an in-depth exploration of object detection and semantic segmentation, combining theoretical foundations with practical applications. It covers state-of-the-art advancements in machine learning and deep learning, with a focus on convolutional neural networks (CNNs), YOLO architectures, and transformer-based approaches like DETR. The book also delves into the integration of artificial intelligence (AI) techniques and large language models for enhanced object detection in complex environments. A thorough discussion of big data analysis is presented, highlighting the importance of data processing, model optimization, and performance evaluation metrics. By bridging the gap between traditional methods and modern deep learning frameworks, this book serves as a comprehensive guide for researchers, data scientists, and engineers aiming to leverage AI-driven methodologies in large-scale object detection tasks., Comment: 167 pages
Published: 2024

7. Mastering AI: Big Data, Deep Learning, and the Evolution of Large Language Models -- Blockchain and Applications

Author: Feng, Pohsun, Bi, Ziqian, Yan, Lawrence K. Q., Wen, Yizhu, Peng, Benji, Liu, Junyu, Yin, Caitlyn Heqi, Wang, Tianyang, Chen, Keyu, Zhang, Sen, Li, Ming, Xu, Jiawei, Liu, Ming, Pan, Xuanhe, Wang, Jinlang, and Niu, Qian
Subjects: Computer Science - Cryptography and Security
Abstract: This article provides a detailed exploration of blockchain technology and its applications across various fields. It begins with an introduction to cryptography fundamentals, including symmetric and asymmetric encryption, and their roles in ensuring security and trust within blockchain systems. The article then delves into the structure and mechanics of Bitcoin and Ethereum, covering topics such as proof-of-work, proof-of-stake, and smart contracts. Additionally, it highlights practical applications of blockchain in industries like decentralized finance (DeFi), supply chain management, and identity authentication. The discussion also extends to consensus mechanisms and scalability challenges in blockchain, offering insights into emerging technologies like Layer 2 solutions and cross-chain interoperability. The article concludes by addressing the current state of academic research on blockchain and its potential future developments., Comment: This book contains 241 pages and 5 figures
Published: 2024

8. Mastering AI: Big Data, Deep Learning, and the Evolution of Large Language Models -- AutoML from Basics to State-of-the-Art Techniques

Author: Feng, Pohsun, Bi, Ziqian, Wen, Yizhu, Peng, Benji, Liu, Junyu, Yin, Caitlyn Heqi, Wang, Tianyang, Chen, Keyu, Zhang, Sen, Li, Ming, Xu, Jiawei, Liu, Ming, Pan, Xuanhe, Wang, Jinlang, and Niu, Qian
Subjects: Computer Science - Machine Learning
Abstract: This manuscript presents a comprehensive guide to Automated Machine Learning (AutoML), covering fundamental principles, practical implementations, and future trends. The paper is structured to assist both beginners and experienced practitioners, with detailed discussions on popular AutoML tools such as TPOT, AutoGluon, and Auto-Keras. It also addresses emerging topics like Neural Architecture Search (NAS) and AutoML's applications in deep learning. We believe this work will contribute to ongoing research and development in the field of AI and machine learning., Comment: This book contains 170 pages and 5 figures
Published: 2024

9. NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models

Author: Ho, Zheng Yi, Liang, Siyuan, Zhang, Sen, Zhan, Yibing, and Tao, Dacheng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Hallucinations in Large Language Models (LLMs) remain a major obstacle, particularly in high-stakes applications where factual accuracy is critical. While representation editing and reading methods have made strides in reducing hallucinations, their heavy reliance on specialised tools and training on in-domain samples, makes them difficult to scale and prone to overfitting. This limits their accuracy gains and generalizability to diverse datasets. This paper presents a lightweight method, Norm Voting (NoVo), which harnesses the untapped potential of attention head norms to dramatically enhance factual accuracy in zero-shot multiple-choice questions (MCQs). NoVo begins by automatically selecting truth-correlated head norms with an efficient, inference-only algorithm using only 30 random samples, allowing NoVo to effortlessly scale to diverse datasets. Afterwards, selected head norms are employed in a simple voting algorithm, which yields significant gains in prediction accuracy. On TruthfulQA MC1, NoVo surpasses the current state-of-the-art and all previous methods by an astounding margin -- at least 19 accuracy points. NoVo demonstrates exceptional generalization to 20 diverse datasets, with significant gains in over 90\% of them, far exceeding all current representation editing and reading methods. NoVo also reveals promising gains to finetuning strategies and building textual adversarial defence. NoVo's effectiveness with head norms opens new frontiers in LLM interpretability, robustness and reliability.
Published: 2024

10. Deep Learning and Machine Learning with GPGPU and CUDA: Unlocking the Power of Parallel Computing

Author: Li, Ming, Bi, Ziqian, Wang, Tianyang, Wen, Yizhu, Niu, Qian, Liu, Junyu, Peng, Benji, Zhang, Sen, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, Chen, Keyu, Yin, Caitlyn Heqi, Feng, Pohsun, and Liu, Ming
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Hardware Architecture
Abstract: This book presents a comprehensive exploration of GPGPU (General Purpose Graphics Processing Unit) and its applications in deep learning and machine learning. It focuses on how parallel computing, particularly through the use of CUDA (Compute Unified Device Architecture), can unlock unprecedented computational power for complex tasks. The book provides detailed discussions on CPU and GPU architectures, data flow in deep learning, and advanced GPU features like streams, concurrency, and dynamic parallelism. Furthermore, it delves into practical applications of GPGPU in various domains such as scientific computing, machine learning acceleration, real-time rendering, and cryptocurrency mining. The authors also emphasize the importance of selecting the right parallel architecture (e.g., GPU, FPGA, TPU, ASIC) based on specific tasks, offering insights into optimizing algorithms for these platforms. The book also provides practical examples with popular machine learning frameworks like PyTorch, TensorFlow, and XGBoost, demonstrating how to efficiently leverage GPU resources in both training and inference. This resource is valuable for both beginners and advanced readers who are looking to deepen their understanding of GPU-based parallel computing and its significant role in modern machine learning and AI applications., Comment: 106 pages
Published: 2024

11. Deep Learning and Machine Learning: Advancing Big Data Analytics and Management with Design Patterns

Author: Chen, Keyu, Bi, Ziqian, Wang, Tianyang, Wen, Yizhu, Feng, Pohsun, Niu, Qian, Liu, Junyu, Peng, Benji, Zhang, Sen, Li, Ming, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, Yin, Caitlyn Heqi, and Liu, Ming
Subjects: Computer Science - Software Engineering, Computer Science - Machine Learning
Abstract: This book, Design Patterns in Machine Learning and Deep Learning: Advancing Big Data Analytics Management, presents a comprehensive study of essential design patterns tailored for large-scale machine learning and deep learning applications. The book explores the application of classical software engineering patterns, Creational, Structural, Behavioral, and Concurrency Patterns, to optimize the development, maintenance, and scalability of big data analytics systems. Through practical examples and detailed Python implementations, it bridges the gap between traditional object-oriented design patterns and the unique demands of modern data analytics environments. Key design patterns such as Singleton, Factory, Observer, and Strategy are analyzed for their impact on model management, deployment strategies, and team collaboration, providing invaluable insights into the engineering of efficient, reusable, and flexible systems. This volume is an essential resource for developers, researchers, and engineers aiming to enhance their technical expertise in both machine learning and software design., Comment: 138pages
Published: 2024

12. Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Unveiling AI's Potential Through Tools, Techniques, and Applications

Author: Feng, Pohsun, Bi, Ziqian, Wen, Yizhu, Pan, Xuanhe, Peng, Benji, Liu, Ming, Xu, Jiawei, Chen, Keyu, Liu, Junyu, Yin, Caitlyn Heqi, Zhang, Sen, Wang, Jinlang, Niu, Qian, Li, Ming, and Wang, Tianyang
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: This book serves as an introduction to deep learning and machine learning, focusing on their applications in big data analytics. It covers essential concepts, tools like ChatGPT and Claude, hardware recommendations, and practical guidance on setting up development environments using libraries like PyTorch and TensorFlow. Designed for beginners and advanced users alike, it provides step-by-step instructions, hands-on projects, and insights into AI's future, including AutoML and edge computing., Comment: This book contains 156 pages and 9 figures
Published: 2024

13. Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Object-Oriented Programming

Author: Wang, Tianyang, Bi, Ziqian, Chen, Keyu, Xu, Jiawei, Niu, Qian, Liu, Junyu, Peng, Benji, Li, Ming, Zhang, Sen, Pan, Xuanhe, Wang, Jinlang, Feng, Pohsun, Yin, Caitlyn Heqi, Wen, Yizhu, and Liu, Ming
Subjects: Computer Science - Computation and Language, Computer Science - Software Engineering
Abstract: Object-Oriented Programming (OOP) has become a crucial paradigm for managing the growing complexity of modern software systems, particularly in fields like machine learning, deep learning, large language models (LLM), and data analytics. This work provides a comprehensive introduction to the integration of OOP techniques within these domains, with a focus on improving code modularity, maintainability, and scalability. We begin by outlining the evolution of computing and the rise of OOP, followed by an in-depth discussion of key OOP principles such as encapsulation, inheritance, polymorphism, and abstraction. The practical application of these principles is demonstrated using Python, a widely adopted language in AI and data science. Furthermore, we examine how design patterns and modular programming can be employed to enhance the structure and efficiency of machine learning systems. In subsequent sections, we apply these OOP concepts to real-world AI tasks, including the encapsulation of preprocessing workflows, machine learning model training, and evaluation. Detailed examples illustrate how OOP can be used to build reusable, scalable machine learning systems while maintaining code clarity and reducing redundancy.This work is intended to serve as a bridge for both beginners and experienced developers, equipping them with the necessary knowledge to apply OOP methodologies in AI-driven projects, ultimately fostering the development of more robust and maintainable systems., Comment: 47pages
Published: 2024

14. Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Handy Appetizer

Author: Peng, Benji, Pan, Xuanhe, Wen, Yizhu, Bi, Ziqian, Chen, Keyu, Li, Ming, Liu, Ming, Niu, Qian, Liu, Junyu, Wang, Jinlang, Zhang, Sen, Xu, Jiawei, and Feng, Pohsun
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: This book explores the role of Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) in driving the progress of big data analytics and management. The book focuses on simplifying the complex mathematical concepts behind deep learning, offering intuitive visualizations and practical case studies to help readers understand how neural networks and technologies like Convolutional Neural Networks (CNNs) work. It introduces several classic models and technologies such as Transformers, GPT, ResNet, BERT, and YOLO, highlighting their applications in fields like natural language processing, image recognition, and autonomous driving. The book also emphasizes the importance of pre-trained models and how they can enhance model performance and accuracy, with instructions on how to apply these models in various real-world scenarios. Additionally, it provides an overview of key big data management technologies like SQL and NoSQL databases, as well as distributed computing frameworks such as Apache Hadoop and Spark, explaining their importance in managing and processing vast amounts of data. Ultimately, the book underscores the value of mastering deep learning and big data management skills as critical tools for the future workforce, making it an essential resource for both beginners and experienced professionals., Comment: This book contains 93 pages and 60 figures
Published: 2024

15. Deep Learning and Machine Learning, Advancing Big Data Analytics and Management: Tensorflow Pretrained Models

Author: Chen, Keyu, Bi, Ziqian, Niu, Qian, Liu, Junyu, Peng, Benji, Zhang, Sen, Liu, Ming, Li, Ming, Pan, Xuanhe, Xu, Jiawei, Wang, Jinlang, and Feng, Pohsun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This book focuses on the application of TensorFlow pre-trained models in deep learning, providing detailed guidance on effectively using these models for tasks such as image classification and object detection. It covers practical implementations of modern architectures like ResNet, MobileNet, and EfficientNet, demonstrating the power of transfer learning through real-world examples and experiments. The book compares linear probing and model fine-tuning, offering visualizations using techniques such as PCA, t-SNE, and UMAP to help readers intuitively understand the impact of different approaches. Designed for beginners to advanced users, this book includes complete example code and step-by-step instructions, enabling readers to quickly master how to leverage pre-trained models to improve performance in practical scenarios. By blending theoretical insights with hands-on practice, this book equips readers with the knowledge to confidently tackle various deep learning challenges., Comment: This book contains 148 pages and 7 figures
Published: 2024

16. Surveying the MLLM Landscape: A Meta-Review of Current Surveys

Author: Li, Ming, Chen, Keyu, Bi, Ziqian, Liu, Ming, Peng, Benji, Niu, Qian, Liu, Junyu, Wang, Jinlang, Zhang, Sen, Pan, Xuanhe, Xu, Jiawei, and Feng, Pohsun
Subjects: Computer Science - Computation and Language
Abstract: The rise of Multimodal Large Language Models (MLLMs) has become a transformative force in the field of artificial intelligence, enabling machines to process and generate content across multiple modalities, such as text, images, audio, and video. These models represent a significant advancement over traditional unimodal systems, opening new frontiers in diverse applications ranging from autonomous agents to medical diagnostics. By integrating multiple modalities, MLLMs achieve a more holistic understanding of information, closely mimicking human perception. As the capabilities of MLLMs expand, the need for comprehensive and accurate performance evaluation has become increasingly critical. This survey aims to provide a systematic review of benchmark tests and evaluation methods for MLLMs, covering key topics such as foundational concepts, applications, evaluation methodologies, ethical concerns, security, efficiency, and domain-specific applications. Through the classification and analysis of existing literature, we summarize the main contributions and methodologies of various surveys, conduct a detailed comparative analysis, and examine their impact within the academic community. Additionally, we identify emerging trends and underexplored areas in MLLM research, proposing potential directions for future studies. This survey is intended to offer researchers and practitioners a comprehensive understanding of the current state of MLLM evaluation, thereby facilitating further progress in this rapidly evolving field., Comment: The article consists of 22 pages, including 2 figures and 108 references. The paper provides a meta-review of surveys on Multimodal Large Language Models (MLLMs), categorizing findings into key areas such as evaluation, applications, security, and future directions
Published: 2024

17. Alignment-Aware Model Extraction Attacks on Large Language Models

Author: Liang, Zi, Ye, Qingqing, Wang, Yanyun, Zhang, Sen, Xiao, Yaxin, Li, Ronghua, Xu, Jianliang, and Hu, Haibo
Subjects: Computer Science - Cryptography and Security, Computer Science - Computation and Language
Abstract: Model extraction attacks (MEAs) on large language models (LLMs) have received increasing research attention lately. Existing attack methods on LLMs inherit the extraction strategies from those designed for deep neural networks (DNNs) yet neglect the inconsistency of training tasks between MEA and LLMs' alignments. As such, they result in poor attack performances. To tackle this issue, we present Locality Reinforced Distillation (LoRD), a novel model extraction attack algorithm specifically for LLMs. In particular, we design a policy-gradient-style training task, which utilizes victim models' responses as a signal to guide the crafting of preference for the local model. Theoretical analysis has shown that i) LoRD's convergence procedure in MEAs is consistent with the alignments of LLMs, and ii) LoRD can reduce query complexity while mitigating watermark protection through exploration-based stealing. Extensive experiments on domain-specific extractions demonstrate the superiority of our method by examining the extraction of various state-of-the-art commercial LLMs., Comment: Source code: https://github.com/liangzid/alignmentExtraction
Published: 2024

18. New global Carleman estimates and null controllability for a stochastic Cahn-Hilliard type equation

Author: Zhang, Sen, Gao, Hang, and Yuan, Ganghua
Subjects: Mathematics - Optimization and Control, Mathematics - Analysis of PDEs
Abstract: In this paper, we study the null controllability for a stochastic semilinear CahnHilliard type equation, whose semilinear term contains first and second order derivatives of solutions. To start with, an improved global Carleman estimate for linear backward stochastic fourth order parabolic equations with $L^2$-valued source terms is derived, which is based on a new fundamental identity for a stochastic fourth order parabolic operator. Based on it, we establish a new global Carleman estimate for linear backward stochastic fourth order parabolic equations with $H^{-2}$-valued source terms, which, together with a fixed point argument, derive the desired null controllability for the stochastic Cahn-Hilliard type equation., Comment: arXiv admin note: substantial text overlap with arXiv:2408.03114
Published: 2024

19. Global null controllability of stochastic semilinear complex Ginzburg-Landau equations

Author: Zhang, Sen, Gao, Hang, and Yuan, Ganghua
Subjects: Mathematics - Optimization and Control
Abstract: In this paper, we study the null controllability of forward and backward stochastic semilinear complex Ginzburg-Landau equations with global Lipschitz nonlinear terms. For this purpose, by deriving an improved global Carleman estimates for linear systems, we obtain the controllability results for the stochastic linear systems with a $L^2$-valued source term. Based on it, together with a Banach fixed point argument, the desired null controllability of semilinear systems is derived.
Published: 2024

20. Observation of Heat Pumping Effect by Radiative Shuttling

Author: Li, Yuxuan, Dang, Yongdi, Zhang, Sen, Li, Xinran, Chen, Tianle, Choudhury, Pankaj K., Jin, Yi, Xu, Jianbin, Ben-Abdallah, Philippe, Ju, Bing-Feng, and Ma, Yungui
Subjects: Physics - Applied Physics
Abstract: Heat shuttling phenomenon is characterized by the presence of a non-zero heat flow between two bodies without net thermal bias on average. It was initially predicted in the context of nonlinear heat conduction within atomic lattices coupled to two time-oscillating thermostats. Recent theoretical works revealed an analog of this effect for heat exchanges mediated by thermal photons between two solids having a temperature dependent emissivity. In this paper, we present the experimental proof of this effect using systems made with composite materials based on phase change materials. By periodically modulating the temperature of one of two solids we report that the system akin to heat pumping with a controllable heat flow direction. Additionally, we demonstrate the effectiveness of a simultaneous modulation of two temperatures to control both the strength and direction of heat shuttling by exploiting the phase delay between these temperatures. These results show that this effect is promising for an active thermal management of solid-state technology, to cool down solids, to insulate them from their background or to amplify heat exchanges.
Published: 2024

21. Transient Measurement of Near-field Thermal Radiation between Macroscopic Objects

Author: Zhang, Sen, Dang, Yongdi, Li, Xinran, Li, Yuxuan, Jin, Yi, Choudhury, Pankaj K, Xu, Jianbing, and Ma, Yungui
Subjects: Physics - Optics, Physics - Data Analysis, Statistics and Probability
Abstract: The involvement of evanescent waves in the near-field regime could greatly enhance the spontaneous thermal radiation, offering a unique opportunity to study nanoscale photon-phonon interaction. However, accurately characterizing this subtle phenomenon is very challenging. This paper proposes a transient all-optical method for rapidly characterizing near-field radiative heat transfer (NFRHT) between macroscopic objects, using the first law of thermodynamics. Significantly, a full measurement at a fixed gap distance is completed within tens of seconds. By simplifying the configuration, the transient all-optical method achieves high measurement accuracy and reliable reproducibility. The proposed method can effectively analyze the NFRHT in various material systems, including SiO2, SiC, and Si, which involve different phonon or plasmon polaritons. Experimental observations demonstrate significant super-Planckian radiation, which arises from the near-field coupling of bounded surface modes. Furthermore, the method achieves excellent agreement with theory, with a minimal discrepancy of less than 2.7% across a wide temperature range. This wireless method could accurately characterize the NFRHT for objects with different sizes or optical properties, enabling the exploration of both fundamental interests and practical applications.
Published: 2024

22. From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

Author: Sohn, Samuel S., Li, Danrui, Zhang, Sen, Chang, Che-Jui, and Kapadia, Mubbasir
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Graphics
Abstract: Digital storytelling, essential in entertainment, education, and marketing, faces challenges in production scalability and flexibility. The StoryAgent framework, introduced in this paper, utilizes Large Language Models and generative tools to automate and refine digital storytelling. Employing a top-down story drafting and bottom-up asset generation approach, StoryAgent tackles key issues such as manual intervention, interactive scene orchestration, and narrative consistency. This framework enables efficient production of interactive and consistent narratives across multiple modalities, democratizing content creation and enhancing engagement. Our results demonstrate the framework's capability to produce coherent digital stories without reference videos, marking a significant advancement in automated digital storytelling., Comment: 16 pages, 13 figures
Published: 2024

23. Effects of CeO2 Content on the Microstructure and Mechanical Properties of ZK60 Mg Alloy

Author: Qiu, Wei, Li, Shuang-Long, Lu, Zhao-Yuan, Zhang, Sen-Mao, Chen, Jian, Chen, Wei, Gan, Lang, Li, Wei, Ren, Yan-Jie, Luo, Jun, Yao, Mao-Hai, and Xie, Wen
Published: 2024
Full Text: View/download PDF

24. The effect of laser cladding coating on the friction and wear resistance of hydraulic pump

Author: Zhang, Sen
Published: 2024
Full Text: View/download PDF

25. CT assessed morphological features can predict higher mitotic index in gastric gastrointestinal stromal tumors

Author: Jia, Xiaoxuan, Xiao, Youping, Zhang, Hui, Li, Jiazheng, Lv, Shiying, Zhang, Yinli, Chai, Fan, Feng, Caizhen, Liu, Yulu, Chen, Haoquan, Ma, Feiyu, Wei, Shengcai, Cheng, Jin, Zhang, Sen, Gao, Zhidong, Hong, Nan, Tang, Lei, and Wang, Yi
Published: 2024
Full Text: View/download PDF

26. Interferon Gamma Inducible Protein 30: from biological functions to potential therapeutic target in cancers

Author: Zhang, Sen, Ren, Liwen, Li, Wan, Zhang, Yizhi, Yang, Yihui, Yang, Hong, Xu, Fang, Cao, Wanxin, Li, Xiaoxue, Zhang, Xu, Du, Guanhua, and Wang, Jinhua
Published: 2024
Full Text: View/download PDF

27. Towards Theoretical Understandings of Self-Consuming Generative Models

Author: Fu, Shi, Zhang, Sen, Wang, Yingjie, Tian, Xinmei, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This paper tackles the emerging challenge of training generative models within a self-consuming loop, wherein successive generations of models are recursively trained on mixtures of real and synthetic data from previous generations. We construct a theoretical framework to rigorously evaluate how this training procedure impacts the data distributions learned by future models, including parametric and non-parametric models. Specifically, we derive bounds on the total variation (TV) distance between the synthetic data distributions produced by future models and the original real data distribution under various mixed training scenarios for diffusion models with a one-hidden-layer neural network score function. Our analysis demonstrates that this distance can be effectively controlled under the condition that mixed training dataset sizes or proportions of real data are large enough. Interestingly, we further unveil a phase transition induced by expanding synthetic data amounts, proving theoretically that while the TV distance exhibits an initial ascent, it declines beyond a threshold point. Finally, we present results for kernel density estimation, delivering nuanced insights such as the impact of mixed data training on error propagation., Comment: Accepted at ICML 2024
Published: 2024

28. InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling

Author: Miao, Yuchun, Zhang, Sen, Ding, Liang, Bao, Rong, Zhang, Lefei, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Despite the success of reinforcement learning from human feedback (RLHF) in aligning language models with human values, reward hacking, also termed reward overoptimization, remains a critical challenge. This issue primarily arises from reward misgeneralization, where reward models (RMs) compute reward using spurious features that are irrelevant to human preferences. In this work, we tackle this problem from an information-theoretic perspective and propose a framework for reward modeling, namely InfoRM, by introducing a variational information bottleneck objective to filter out irrelevant information. Notably, we further identify a correlation between overoptimization and outliers in the IB latent space of InfoRM, establishing it as a promising tool for detecting reward overoptimization. Inspired by this finding, we propose the Cluster Separation Index (CSI), which quantifies deviations in the IB latent space, as an indicator of reward overoptimization to facilitate the development of online mitigation strategies. Extensive experiments on a wide range of settings and RM scales (70M, 440M, 1.4B, and 7B) demonstrate the effectiveness of InfoRM. Further analyses reveal that InfoRM's overoptimization detection mechanism is not only effective but also robust across a broad range of datasets, signifying a notable advancement in the field of RLHF. The code will be released upon acceptance., Comment: The paper has been accepted by NeurIPS 2024
Published: 2024

29. Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Author: Zhang, Ziyi, Zhang, Sen, Zhan, Yibing, Luo, Yong, Wen, Yonggang, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Bridging the gap between diffusion models and human preferences is crucial for their integration into practical generative workflows. While optimizing downstream reward models has emerged as a promising alignment strategy, concerns arise regarding the risk of excessive optimization with learned reward models, which potentially compromises ground-truth performance. In this work, we confront the reward overoptimization problem in diffusion model alignment through the lenses of both inductive and primacy biases. We first identify a mismatch between current methods and the temporal inductive bias inherent in the multi-step denoising process of diffusion models, as a potential source of reward overoptimization. Then, we surprisingly discover that dormant neurons in our critic model act as a regularization against reward overoptimization while active neurons reflect primacy bias. Motivated by these observations, we propose Temporal Diffusion Policy Optimization with critic active neuron Reset (TDPO-R), a policy gradient algorithm that exploits the temporal inductive bias of diffusion models and mitigates the primacy bias stemming from active neurons. Empirical results demonstrate the superior efficacy of our methods in mitigating reward overoptimization. Code is avaliable at https://github.com/ZiyiZhang27/tdpo., Comment: Accepted to ICML 2024
Published: 2024

30. FreDF: Learning to Forecast in Frequency Domain

Author: Wang, Hao, Pan, Licheng, Chen, Zhichao, Yang, Degui, Zhang, Sen, Yang, Yifei, Liu, Xinggao, Li, Haoxuan, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Applications, Statistics - Machine Learning
Abstract: Time series modeling is uniquely challenged by the presence of autocorrelation in both historical and label sequences. Current research predominantly focuses on handling autocorrelation within the historical sequence but often neglects its presence in the label sequence. Specifically, emerging forecast models mainly conform to the direct forecast (DF) paradigm, generating multi-step forecasts under the assumption of conditional independence within the label sequence. This assumption disregards the inherent autocorrelation in the label sequence, thereby limiting the performance of DF-based models. In response to this gap, we introduce the Frequency-enhanced Direct Forecast (FreDF), which bypasses the complexity of label autocorrelation by learning to forecast in the frequency domain. Our experiments demonstrate that FreDF substantially outperforms existing state-of-the-art methods including iTransformer and is compatible with a variety of forecast models.
Published: 2024

31. Friction performance prediction of automotive pads under operating conditions using attention-based CNN-BiLSTM deep learning framework

Author: Yin, Xiaojing, Zhang, Sen, Zhang, Yu, Pang, Zaixiang, and Zhang, Bangcheng
Published: 2024
Full Text: View/download PDF

32. Effects of indole-3-butyric acid supplementation in diets containing high soybean meal on growth, intestinal inflammation, and intestinal flora of pearl gentian grouper (Epinephelus fuscoguttatus♀ × Epinephelus lanceolatus♂)

Author: Pang, Aobo, Zhang, Sen, Dong, Shaohan, Zhang, Xianxin, Liang, Junhui, Fang, Yanxin, Tan, Beiping, and Zhang, Wei
Published: 2024
Full Text: View/download PDF

33. Oxide-modified HZSM-5 for efficient production of hydrogen via steam reforming of dimethyl ether

Author: Zhang, Sen-Han, Zhao, Yong‑Hua, Zhang, Jia-Kang, Feng, Xiao-Qian, Zhang, Qi‑Jian, and Wang, Huan
Published: 2024
Full Text: View/download PDF

34. S2M: Converting Single-Turn to Multi-Turn Datasets for Conversational Question Answering

Author: Li, Baokui, Zhang, Sen, Zhang, Wangshu, Chen, Yicheng, Yang, Changlin, Hu, Sen, Xu, Teng, liu, Siye, and Li, Jiwei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Supplying data augmentation to conversational question answering (CQA) can effectively improve model performance. However, there is less improvement from single-turn datasets in CQA due to the distribution gap between single-turn and multi-turn datasets. On the other hand, while numerous single-turn datasets are available, we have not utilized them effectively. To solve this problem, we propose a novel method to convert single-turn datasets to multi-turn datasets. The proposed method consists of three parts, namely, a QA pair Generator, a QA pair Reassembler, and a question Rewriter. Given a sample consisting of context and single-turn QA pairs, the Generator obtains candidate QA pairs and a knowledge graph based on the context. The Reassembler utilizes the knowledge graph to get sequential QA pairs, and the Rewriter rewrites questions from a conversational perspective to obtain a multi-turn dataset S2M. Our experiments show that our method can synthesize effective training resources for CQA. Notably, S2M ranks 1st place on the QuAC leaderboard at the time of submission (Aug 24th, 2022).
Published: 2023

35. Measurement of Near-field Thermal Radiation between Multilayered Metamaterials

Author: Zhang, Sen, Dang, Yongdi, Li, Xinran, Naeem, Iqbal, Jin, Yi, Choudhury, Pankaj K, Antezza, Mauro, Xu, Jianbin, and Ma, Yungui
Subjects: Physics - Optics, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: The near-field radiative heat transfer (NFRHT) between one-dimensional metamaterials comprising phonon dielectric multilayers was experimented. Large sized (1cm x 1cm) near-field samples were fabricated using SiC, SiO2 and Ge layers at a certain gap distance, and the effect of layer stacking order and phonon resonance quality on the NFRHT was examined. The measured results show good agreement with those obtained theoretically employing the transmission matrix method. Super-Planckian blackbody radiation was observed between the emitters and receivers with identical structures. Measurements demonstrate failure of the effective medium theory (EMT) in predicting the near-field heat flux especially in the presence of bounded surface modes, such as the epsilon-near-zero (ENZ) mode. Analyses also indicate that, in certain cases, the EMT can provide reasonable physical insight into the underlying coupling process from the perspective of homogenized media. The conditions to apply the EMT in the near-field regime was also touched upon., Comment: 18 pages,5 figures
Published: 2023
Full Text: View/download PDF

36. Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

Author: Ma, Guozheng, Li, Lu, Zhang, Sen, Liu, Zixuan, Wang, Zhen, Chen, Yixin, Shen, Li, Wang, Xueqian, and Tao, Dacheng
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Plasticity, the ability of a neural network to evolve with new data, is crucial for high-performance and sample-efficient visual reinforcement learning (VRL). Although methods like resetting and regularization can potentially mitigate plasticity loss, the influences of various components within the VRL framework on the agent's plasticity are still poorly understood. In this work, we conduct a systematic empirical exploration focusing on three primary underexplored facets and derive the following insightful conclusions: (1) data augmentation is essential in maintaining plasticity; (2) the critic's plasticity loss serves as the principal bottleneck impeding efficient training; and (3) without timely intervention to recover critic's plasticity in the early stages, its loss becomes catastrophic. These insights suggest a novel strategy to address the high replay ratio (RR) dilemma, where exacerbated plasticity loss hinders the potential improvements of sample efficiency brought by increased reuse frequency. Rather than setting a static RR for the entire training process, we propose Adaptive RR, which dynamically adjusts the RR based on the critic's plasticity level. Extensive evaluations indicate that Adaptive RR not only avoids catastrophic plasticity loss in the early stages but also benefits from more frequent reuse in later phases, resulting in superior sample efficiency., Comment: ICLR 2024 poster
Published: 2023

37. Construction of a Pt‐CeOx Interface for the Electrocatalytic Hydrogen Evolution Reaction

Author: Yu, Shen‐Wei, Kwon, Soonho, Chen, Yizhen, Xie, Zhenhua, Lu, Xiner, He, Kai, Hwang, Sooyeon, Chen, Jingguang G, Goddard, William A, and Zhang, Sen
Subjects: Macromolecular and Materials Chemistry, Chemical Sciences, Physical Chemistry, Engineering, Materials Engineering, Affordable and Clean Energy, electrolysis, hydrogen evolution reaction, metal-metal oxide interface, Pt-CeOx, Physical Sciences, Materials, Chemical sciences, Physical sciences
Abstract: The creation of metal-metal oxide interfaces is an important approach to fine-tuning catalyst properties through strong interfacial interactions. This article presents the work on developing interfaces between Pt and CeOx that improve Pt surface energetics for the hydrogen evolution reaction (HER) within an alkaline electrolyte. The Pt-CeOx interfaces are formed by depositing size-controlled Pt nanoparticles onto a carbon support already coated with ultrathin CeOx nanosheets. This interface structure facilitates substantial electron transfer from Pt to CeOx, resulting in decreased hydrogen binding energies on Pt surfaces, and water dissociation for the HER, as predicted by the density functional theory (DFT) calculations. Electrochemical testing indicates that both Pt specific activity and mass activity are improved by a factor of 2 to 3 following the formation of Pt-CeOx interfaces. This study underscores the significance and potential of harnessing robust interfacial effects to enhance electrocatalytic reactions.
Published: 2024

38. In-depth analysis of lymph node metastasis-related sialylated protein profiling and their clinical and biological significance in colorectal cancer using mass spectrometry and multi-omics technologies

Author: Shao, Yanfei, Yu, Mengqin, Zhang, Luyang, Zhou, Leqi, Yan, Xialin, Feng, Bo, and Zhang, Sen
Published: 2024
Full Text: View/download PDF

39. Identifying genetic targets in clinical subtypes of Parkinson’s disease for optimizing pharmacological treatment strategies

Author: Kong, Dewen, Li, Cao, Ma, LingYan, Du, Lida, Jiang, Nan, Zhao, Xiaoyue, Zhang, Sen, Zhao, Zhigang, Fang, Lianhua, and Du, Guanhua
Published: 2024
Full Text: View/download PDF

40. Author Correction: Prognostic value of pre-treatment neutrophil-to-lymphocyte ratio in patients with brain metastasis from cancer: a meta-analysis

Author: Zhou, Ke, Wan, Jun, Li, Yile, Yuan, Yi, Liu, Qian, Li, Huixuan, Jiang, Xinyi, Yuan, Xiang, Zhang, Sen, and Zhang, Yu
Published: 2024
Full Text: View/download PDF

41. Plant growth-promoting fungi improve tobacco yield and chemical components by reassembling rhizosphere fungal microbiome and recruiting probiotic taxa

Author: Shi, Mingzi, Hao, Shanghua, Wang, Yuhe, Zhang, Sen, Cui, Guangzhou, Zhang, Bin, Zhou, Wang, Chen, Hongge, and Wang, Mingdao
Published: 2024
Full Text: View/download PDF

42. Prognostic value of pre-treatment neutrophil-to-lymphocyte ratio in patients with brain metastasis from cancer: a meta-analysis

Author: Zhou, Ke, Wan, Jun, Li, Yile, Yuan, Yi, Liu, Qian, Li, Huixuan, Jiang, Xinyi, Yuan, Xiang, Zhang, Sen, and Zhang, Yu
Published: 2024
Full Text: View/download PDF

43. Protein complex structure modeling by cross-modal alignment between cryo-EM maps and protein sequences

Author: Chen, Sheng, Zhang, Sen, Fang, Xiaoyu, Lin, Liang, Zhao, Huiying, and Yang, Yuedong
Published: 2024
Full Text: View/download PDF

44. Correction: AAA237, an SKP2 inhibitor, suppresses glioblastoma by inducing BNIP3-dependent autophagy through the mTOR pathway

Author: Zhang, Yizhi, Li, Wan, Yang, Yihui, Zhang, Sen, Yang, Hong, Hao, Yue, Fang, Xu, Du, Guanhua, Shi, Jianyou, Wu, Lianqiu, and Wang, Jinhua
Published: 2024
Full Text: View/download PDF

45. Impact of the triglyceride-glucose index on 28-day mortality in non-diabetic critically Ill patients with sepsis: a retrospective cohort analysis

Author: Zhang, Sen, Fan, Tianhua, Wang, Li, Chen, Nan, and Ma, Liansheng
Published: 2024
Full Text: View/download PDF

46. Single-cell tumor heterogeneity landscape of hepatocellular carcinoma: unraveling the pro-metastatic subtype and its interaction loop with fibroblasts

Author: Guo, De-Zhen, Zhang, Xin, Zhang, Sen-Quan, Zhang, Shi-Yu, Zhang, Xiang-Yu, Yan, Jia-Yan, Dong, San-Yuan, Zhu, Kai, Yang, Xin-Rong, Fan, Jia, Zhou, Jian, and Huang, Ao
Published: 2024
Full Text: View/download PDF

47. Longitudinal study on the change trend of serum alkaline phosphatase and its possible influencing factors in peritoneal dialysis patients

Author: Zhao, Wei, Zhang, Sen, and Zhao, Hai-dan
Published: 2024
Full Text: View/download PDF

48. AAA237, an SKP2 inhibitor, suppresses glioblastoma by inducing BNIP3-dependent autophagy through the mTOR pathway

Author: Zhang, Yizhi, Li, Wan, Yang, Yihui, Zhang, Sen, Yang, Hong, Hao, Yue, Fang, Xu, Du, Guanhua, Shi, Jianyou, Wu, Lianqiu, and Wang, Jinhua
Published: 2024
Full Text: View/download PDF

49. The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents

Author: Chang, Che-Jui, Sohn, Samuel S., Zhang, Sen, Jayashankar, Rajath, Usman, Muhammad, and Kapadia, Mubbasir
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Graphics
Abstract: Previous studies regarding the perception of emotions for embodied virtual agents have shown the effectiveness of using virtual characters in conveying emotions through interactions with humans. However, creating an autonomous embodied conversational agent with expressive behaviors presents two major challenges. The first challenge is the difficulty of synthesizing the conversational behaviors for each modality that are as expressive as real human behaviors. The second challenge is that the affects are modeled independently, which makes it difficult to generate multimodal responses with consistent emotions across all modalities. In this work, we propose a conceptual framework, ACTOR (Affect-Consistent mulTimodal behaviOR generation), that aims to increase the perception of affects by generating multimodal behaviors conditioned on a consistent driving affect. We have conducted a user study with 199 participants to assess how the average person judges the affects perceived from multimodal behaviors that are consistent and inconsistent with respect to a driving affect. The result shows that among all model conditions, our affect-consistent framework receives the highest Likert scores for the perception of driving affects. Our statistical analysis suggests that making a modality affect-inconsistent significantly decreases the perception of driving affects. We also observe that multimodal behaviors conditioned on consistent affects are more expressive compared to behaviors with inconsistent affects. Therefore, we conclude that multimodal emotion conditioning and affect consistency are vital to enhancing the perception of affects for embodied conversational agents.
Published: 2023
Full Text: View/download PDF

50. Are Large Language Models Really Robust to Word-Level Perturbations?

Author: Wang, Haoyu, Ma, Guozheng, Yu, Cong, Gui, Ning, Zhang, Linrui, Huang, Zhiqi, Ma, Suwei, Chang, Yongzhe, Zhang, Sen, Shen, Li, Wang, Xueqian, Zhao, Peilin, and Tao, Dacheng
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The swift advancement in the scales and capabilities of Large Language Models (LLMs) positions them as promising tools for a variety of downstream tasks. In addition to the pursuit of better performance and the avoidance of violent feedback on a certain prompt, to ensure the responsibility of the LLM, much attention is drawn to the robustness of LLMs. However, existing evaluation methods mostly rely on traditional question answering datasets with predefined supervised labels, which do not align with the superior generation capabilities of contemporary LLMs. To address this issue, we propose a novel rational evaluation approach that leverages pre-trained reward models as diagnostic tools to evaluate the longer conversation generated from more challenging open questions by LLMs, which we refer to as the Reward Model for Reasonable Robustness Evaluation (TREvaL). Longer conversations manifest the comprehensive grasp of language models in terms of their proficiency in understanding questions, a capability not entirely encompassed by individual words or letters, which may exhibit oversimplification and inherent biases. Our extensive empirical experiments demonstrate that TREvaL provides an innovative method for evaluating the robustness of an LLM. Furthermore, our results demonstrate that LLMs frequently exhibit vulnerability to word-level perturbations that are commonplace in daily language usage. Notably, we are surprised to discover that robustness tends to decrease as fine-tuning (SFT and RLHF) is conducted. The code of TREval is available in https://github.com/Harry-mic/TREvaL.
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

5,863 results on '"Zhang, Sen"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources