Author: "Liu, Zirui" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Liu, Zirui"' showing total 1,266 results

Start Over Author "Liu, Zirui"

1,266 results on '"Liu, Zirui"'

1. Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization

Author: Chuang, Yu-Neng, Yu, Leisheng, Wang, Guanchu, Zhang, Lizhe, Liu, Zirui, Cai, Xuanting, Sui, Yang, Braverman, Vladimir, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) are increasingly deployed and democratized on edge devices. To improve the efficiency of on-device deployment, small language models (SLMs) are often adopted due to their efficient decoding latency and reduced energy consumption. However, these SLMs often generate inaccurate responses when handling complex queries. One promising solution is uncertainty-based SLM routing, offloading high-stakes queries to stronger LLMs when resulting in low-confidence responses on SLM. This follows the principle of "If you lack confidence, seek stronger support" to enhance reliability. Relying on more powerful LLMs is yet effective but increases invocation costs. Therefore, striking a routing balance between efficiency and efficacy remains a critical challenge. Additionally, efficiently generalizing the routing strategy to new datasets remains under-explored. In this paper, we conduct a comprehensive investigation into benchmarking and generalization of uncertainty-driven routing strategies from SLMs to LLMs over 1500+ settings. Our findings highlight: First, uncertainty-correctness alignment in different uncertainty quantification (UQ) methods significantly impacts routing performance. Second, uncertainty distributions depend more on both the specific SLM and the chosen UQ method, rather than downstream data. Building on the insight, we propose a calibration data construction instruction pipeline and open-source a constructed hold-out set to enhance routing generalization on new downstream scenarios. The experimental results indicate calibration data effectively bootstraps routing performance without any new data.
Published: 2025

2. Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding

Author: Jin, Mingyu, Mei, Kai, Xu, Wujiang, Sun, Mingjie, Tang, Ruixiang, Du, Mengnan, Liu, Zirui, and Zhang, Yongfeng
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have achieved remarkable success in contextual knowledge understanding. In this paper, we show that these concentrated massive values consistently emerge in specific regions of attention queries (Q) and keys (K) while not having such patterns in values (V) in various modern transformer-based LLMs (Q, K, and V mean the representations output by the query, key, and value layers respectively). Through extensive experiments, we further demonstrate that these massive values play a critical role in interpreting contextual knowledge (knowledge obtained from the current context window) rather than in retrieving parametric knowledge stored within the model's parameters. Our further investigation of quantization strategies reveals that ignoring these massive values leads to a pronounced drop in performance on tasks requiring rich contextual understanding, aligning with our analysis. Finally, we trace the emergence of concentrated massive values and find that such concentration is caused by Rotary Positional Encoding (RoPE), which has appeared since the first layers. These findings shed new light on how Q and K operate in LLMs and offer practical insights for model design and optimization. The Code is Available at https://github.com/MingyuJ666/Rope_with_LLM.
Published: 2025

3. LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series Forecasting

Author: Yu, Guoqi, Li, Yaoming, Guo, Xiaoyu, Wang, Dayu, Liu, Zirui, Wang, Shujun, and Yang, Tong
Subjects: Computer Science - Machine Learning
Abstract: Forecasting models are pivotal in a data-driven world with vast volumes of time series data that appear as a compound of vast Linear and Nonlinear patterns. Recent deep time series forecasting models struggle to utilize seasonal and trend decomposition to separate the entangled components. Such a strategy only explicitly extracts simple linear patterns like trends, leaving the other linear modes and vast unexplored nonlinear patterns to the residual. Their flawed linear and nonlinear feature extraction models and shallow-level decomposition limit their adaptation to the diverse patterns present in real-world scenarios. Given this, we innovate Recursive Residual Decomposition by introducing explicit extraction of both linear and nonlinear patterns. This deeper-level decomposition framework, which is named LiNo, captures linear patterns using a Li block which can be a moving average kernel, and models nonlinear patterns using a No block which can be a Transformer encoder. The extraction of these two patterns is performed alternatively and recursively. To achieve the full potential of LiNo, we develop the current simple linear pattern extractor to a general learnable autoregressive model, and design a novel No block that can handle all essential nonlinear patterns. Remarkably, the proposed LiNo achieves state-of-the-art on thirteen real-world benchmarks under univariate and multivariate forecasting scenarios. Experiments show that current forecasting models can deliver more robust and precise results through this advanced Recursive Residual Decomposition. We hope this work could offer insight into designing more effective forecasting models. Code is available at this Repository: https://github.com/Levi-Ackman/LiNo.
Published: 2024

4. Weighted Diversified Sampling for Efficient Data-Driven Single-Cell Gene-Gene Interaction Discovery

Author: Wu, Yifan, Yang, Yuntao, Liu, Zirui, Li, Zhao, Pahwa, Khushbu, Li, Rongbin, Zheng, Wenjin, Hu, Xia, and Xu, Zhaozhuo
Subjects: Computer Science - Artificial Intelligence
Abstract: Gene-gene interactions play a crucial role in the manifestation of complex human diseases. Uncovering significant gene-gene interactions is a challenging task. Here, we present an innovative approach utilizing data-driven computational tools, leveraging an advanced Transformer model, to unearth noteworthy gene-gene interactions. Despite the efficacy of Transformer models, their parameter intensity presents a bottleneck in data ingestion, hindering data efficiency. To mitigate this, we introduce a novel weighted diversified sampling algorithm. This algorithm computes the diversity score of each data sample in just two passes of the dataset, facilitating efficient subset generation for interaction discovery. Our extensive experimentation demonstrates that by sampling a mere 1\% of the single-cell dataset, we achieve performance comparable to that of utilizing the entire dataset.
Published: 2024

5. Gradient Rewiring for Editable Graph Neural Network Training

Author: Jiang, Zhimeng, Liu, Zirui, Han, Xiaotian, Feng, Qizhang, Jin, Hongye, Tan, Qiaoyu, Zhou, Kaixiong, Zou, Na, and Hu, Xia
Subjects: Computer Science - Machine Learning
Abstract: Deep neural networks are ubiquitously adopted in many applications, such as computer vision, natural language processing, and graph analytics. However, well-trained neural networks can make prediction errors after deployment as the world changes. \textit{Model editing} involves updating the base model to correct prediction errors with less accessible training data and computational resources. Despite recent advances in model editors in computer vision and natural language processing, editable training in graph neural networks (GNNs) is rarely explored. The challenge with editable GNN training lies in the inherent information aggregation across neighbors, which can lead model editors to affect the predictions of other nodes unintentionally. In this paper, we first observe the gradient of cross-entropy loss for the target node and training nodes with significant inconsistency, which indicates that directly fine-tuning the base model using the loss on the target node deteriorates the performance on training nodes. Motivated by the gradient inconsistency observation, we propose a simple yet effective \underline{G}radient \underline{R}ewiring method for \underline{E}ditable graph neural network training, named \textbf{GRE}. Specifically, we first store the anchor gradient of the loss on training nodes to preserve the locality. Subsequently, we rewire the gradient of the loss on the target node to preserve performance on the training node using anchor gradient. Experiments demonstrate the effectiveness of GRE on various model architectures and graph datasets in terms of multiple editing situations. The source code is available at \url{https://github.com/zhimengj0326/Gradient_rewiring_editing}, Comment: NeurIPS 2024
Published: 2024

6. Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion

Author: Wang, Guanchu, Chuang, Yu-Neng, Tang, Ruixiang, Zhong, Shaochen, Yuan, Jiayi, Jin, Hongye, Liu, Zirui, Chaudhary, Vipin, Xu, Shuai, Caverlee, James, and Hu, Xia
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Ensuring the security of released large language models (LLMs) poses a significant dilemma, as existing mechanisms either compromise ownership rights or raise data privacy concerns. To address this dilemma, we introduce TaylorMLP to protect the ownership of released LLMs and prevent their abuse. Specifically, TaylorMLP preserves the ownership of LLMs by transforming the weights of LLMs into parameters of Taylor-series. Instead of releasing the original weights, developers can release the Taylor-series parameters with users, thereby ensuring the security of LLMs. Moreover, TaylorMLP can prevent abuse of LLMs by adjusting the generation speed. It can induce low-speed token generation for the protected LLMs by increasing the terms in the Taylor-series. This intentional delay helps LLM developers prevent potential large-scale unauthorized uses of their models. Empirical experiments across five datasets and three LLM architectures demonstrate that TaylorMLP induces over 4x increase in latency, producing the tokens precisely matched with original LLMs. Subsequent defensive experiments further confirm that TaylorMLP effectively prevents users from reconstructing the weight values based on downstream datasets.
Published: 2024

7. Robust Network Learning via Inverse Scale Variational Sparsification

Author: Zhou, Zhiling, Liu, Zirui, Xu, Chengming, Fu, Yanwei, and Sun, Xinwei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: While neural networks have made significant strides in many AI tasks, they remain vulnerable to a range of noise types, including natural corruptions, adversarial noise, and low-resolution artifacts. Many existing approaches focus on enhancing robustness against specific noise types, limiting their adaptability to others. Previous studies have addressed general robustness by adopting a spectral perspective, which tends to blur crucial features like texture and object contours. Our proposed solution, however, introduces an inverse scale variational sparsification framework within a time-continuous inverse scale space formulation. This framework progressively learns finer-scale features by discerning variational differences between pixels, ultimately preserving only large-scale features in the smoothed image. Unlike frequency-based methods, our approach not only removes noise by smoothing small-scale features where corruptions often occur but also retains high-contrast details such as textures and object contours. Moreover, our framework offers simplicity and efficiency in implementation. By integrating this algorithm into neural network training, we guide the model to prioritize learning large-scale features. We show the efficacy of our approach through enhanced robustness against various noise types., Comment: 21 pages, 7 figures
Published: 2024

8. INT-FlashAttention: Enabling Flash Attention for INT8 Quantization

Author: Chen, Shimao, Liu, Zirui, Wu, Zhiying, Zheng, Ce, Cong, Peizhuang, Jiang, Zihan, Wu, Yuhan, Su, Lei, and Yang, Tong
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: As the foundation of large language models (LLMs), self-attention module faces the challenge of quadratic time and memory complexity with respect to sequence length. FlashAttention accelerates attention computation and reduces its memory usage by leveraging the GPU memory hierarchy. A promising research direction is to integrate FlashAttention with quantization methods. This paper introduces INT-FlashAttention, the first INT8 quantization architecture compatible with the forward workflow of FlashAttention, which significantly improves the inference speed of FlashAttention on Ampere GPUs. We implement our INT-FlashAttention prototype with fully INT8 activations and general matrix-multiplication (GEMM) kernels, making it the first attention operator with fully INT8 input. As a general token-level post-training quantization framework, INT-FlashAttention is also compatible with other data formats like INT4, etc. Experimental results show INT-FlashAttention achieves 72% faster inference speed and 82% smaller quantization error compared to standard FlashAttention with FP16 and FP8 data format.
Published: 2024

9. From Commands to Prompts: LLM-based Semantic File System for AIOS

Author: Shi, Zeru, Mei, Kai, Su, Yongye, Zuo, Chaoji, Hua, Wenyue, Xu, Wujiang, Ren, Yujie, Liu, Zirui, Du, Mengnan, Deng, Dong, and Zhang, Yongfeng
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Databases, Computer Science - Machine Learning
Abstract: Large language models (LLMs) have demonstrated significant potential in the development of intelligent applications and systems such as LLM-based agents and agent operating systems (AIOS). However, when these applications and systems interact with the underlying file system, the file system still remains the traditional paradigm: reliant on manual navigation through precise commands. This paradigm poses a bottleneck to the usability of these systems as users are required to navigate complex folder hierarchies and remember cryptic file names. To address this limitation, we propose an LLM-based semantic file system ( LSFS ) for prompt-driven file management. Unlike conventional approaches, LSFS incorporates LLMs to enable users or agents to interact with files through natural language prompts, facilitating semantic file management. At the macro-level, we develop a comprehensive API set to achieve semantic file management functionalities, such as semantic file retrieval, file update monitoring and summarization, and semantic file rollback). At the micro-level, we store files by constructing semantic indexes for them, design and implement syscalls of different semantic operations (e.g., CRUD, group by, join) powered by vector database. Our experiments show that LSFS offers significant improvements over traditional file systems in terms of user convenience, the diversity of supported functions, and the accuracy and efficiency of file operations. Additionally, with the integration of LLM, our system enables more intelligent file management tasks, such as content summarization and version comparison, further enhancing its capabilities.
Published: 2024

10. Assessing and Enhancing Large Language Models in Rare Disease Question-answering

Author: Wang, Guanchu, Ran, Junhao, Tang, Ruixiang, Chang, Chia-Yuan, Chuang, Yu-Neng, Liu, Zirui, Braverman, Vladimir, Liu, Zhandong, and Hu, Xia
Subjects: Computer Science - Computational Engineering, Finance, and Science, Computer Science - Artificial Intelligence
Abstract: Despite the impressive capabilities of Large Language Models (LLMs) in general medical domains, questions remain about their performance in diagnosing rare diseases. To answer this question, we aim to assess the diagnostic performance of LLMs in rare diseases, and explore methods to enhance their effectiveness in this area. In this work, we introduce a rare disease question-answering (ReDis-QA) dataset to evaluate the performance of LLMs in diagnosing rare diseases. Specifically, we collected 1360 high-quality question-answer pairs within the ReDis-QA dataset, covering 205 rare diseases. Additionally, we annotated meta-data for each question, facilitating the extraction of subsets specific to any given disease and its property. Based on the ReDis-QA dataset, we benchmarked several open-source LLMs, revealing that diagnosing rare diseases remains a significant challenge for these models. To facilitate retrieval augmentation generation for rare disease diagnosis, we collect the first rare diseases corpus (ReCOP), sourced from the National Organization for Rare Disorders (NORD) database. Specifically, we split the report of each rare disease into multiple chunks, each representing a different property of the disease, including their overview, symptoms, causes, effects, related disorders, diagnosis, and standard therapies. This structure ensures that the information within each chunk aligns consistently with a question. Experiment results demonstrate that ReCOP can effectively improve the accuracy of LLMs on the ReDis-QA dataset by an average of 8%. Moreover, it significantly guides LLMs to generate trustworthy answers and explanations that can be traced back to existing literature.
Published: 2024

11. Research on Tibetan Tourism Viewpoints information generation system based on LLM

Author: Qi, Jinhu, Yan, Shuai, Zhang, Wentao, Zhang, Yibo, Liu, Zirui, and Wang, Ke
Subjects: Computer Science - Computation and Language
Abstract: Tibet, ensconced within China's territorial expanse, is distinguished by its labyrinthine and heterogeneous topography, a testament to its profound historical heritage, and the cradle of a unique religious ethos. The very essence of these attributes, however, has impeded the advancement of Tibet's tourism service infrastructure, rendering existing smart tourism services inadequate for the region's visitors. This study delves into the ramifications of informational disparities at tourist sites on Tibetan tourism and addresses the challenge of establishing the Large Language Model (LLM) evaluation criteria. It introduces an innovative approach, the DualGen Bridge AI system, employing supervised fine-tuning techniques to bolster model functionality and enhance optimization processes. Furthermore, it pioneers a multi-structured generative results assessment framework. Empirical validation confirms the efficacy of this framework. The study also explores the application of the supervised fine-tuning method within the proprietary DualGen Bridge AI, aimed at refining the generation of tourist site information. The study's findings offer valuable insights for optimizing system performance and provide support and inspiration for the application of LLM technology in Tibet's tourism services and beyond, potentially revolutionizing the smart tourism industry with advanced, tailored information generation capabilities.
Published: 2024

12. KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

Author: Yuan, Jiayi, Liu, Hongyi, Zhong, Shaochen, Chuang, Yu-Neng, Li, Songchen, Wang, Guanchu, Le, Duy, Jin, Hongye, Chaudhary, Vipin, Xu, Zhaozhuo, Liu, Zirui, and Hu, Xia
Subjects: Computer Science - Computation and Language
Abstract: Long context capability is a crucial competency for large language models (LLMs) as it mitigates the human struggle to digest long-form texts. This capability enables complex task-solving scenarios such as book summarization, code assistance, and many more tasks that are traditionally manpower-intensive. However, transformer-based LLMs face significant challenges with long context input due to the growing size of the KV cache and the intrinsic complexity of attending to extended inputs; where multiple schools of efficiency-driven approaches - such as KV cache quantization, token dropping, prompt compression, linear-time sequence models, and hybrid architectures - have been proposed to produce efficient yet long context-capable models. Despite these advancements, no existing work has comprehensively benchmarked these methods in a reasonably aligned environment. In this work, we fill this gap by providing a taxonomy of current methods and evaluating 10+ state-of-the-art approaches across seven categories of long context tasks. Our work reveals numerous previously unknown phenomena and offers insights - as well as a friendly workbench - for the future development of long context-capable LLMs. The source code is available at https://github.com/henryzhongsc/longctx_bench.
Published: 2024

13. Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity

Author: Guo, Wentao, Long, Jikai, Zeng, Yimeng, Liu, Zirui, Yang, Xinyu, Ran, Yide, Gardner, Jacob R., Bastani, Osbert, De Sa, Christopher, Yu, Xiaodong, Chen, Beidi, and Xu, Zhaozhuo
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Zeroth-order optimization (ZO) is a memory-efficient strategy for fine-tuning Large Language Models using only forward passes. However, the application of ZO fine-tuning in memory-constrained settings such as mobile phones and laptops is still challenging since full precision forward passes are infeasible. In this study, we address this limitation by integrating sparsity and quantization into ZO fine-tuning of LLMs. Specifically, we investigate the feasibility of fine-tuning an extremely small subset of LLM parameters using ZO. This approach allows the majority of un-tuned parameters to be quantized to accommodate the constraint of limited device memory. Our findings reveal that the pre-training process can identify a set of "sensitive parameters" that can guide the ZO fine-tuning of LLMs on downstream tasks. Our results demonstrate that fine-tuning 0.1% sensitive parameters in the LLM with ZO can outperform the full ZO fine-tuning performance, while offering wall-clock time speedup. Additionally, we show that ZO fine-tuning targeting these 0.1% sensitive parameters, combined with 4 bit quantization, enables efficient ZO fine-tuning of an Llama2-7B model on a GPU device with less than 8 GiB of memory and notably reduced latency.
Published: 2024

14. Active zero-sequence voltage injection modulation strategy for capacitor current suppression in dual sector PMSM systems

Author: Shao, Yifei, Kong, Wubin, Liu, Zirui, Guo, Haifeng, Wang, Longxiao, Li, Dawei, and Qu, Ronghai
Published: 2025
Full Text: View/download PDF

15. High-resolution Simulation Dataset of Hourly PM2.5 Chemical Composition in China (CAQRA-aerosol) from 2013 to 2020

Author: Kong, Lei, Tang, Xiao, Zhu, Jiang, Wang, Zifa, Liu, Bing, Zhu, Yuanyuan, Zhu, Lili, Chen, Duohong, Hu, Ke, Wu, Huangjian, Wu, Qian, Shen, Jin, Sun, Yele, Liu, Zirui, Xin, Jinyuan, Ji, Dongsheng, and Zheng, Mei
Published: 2025
Full Text: View/download PDF

16. Postdigital Videogames Literacies: Thinking With, Through, and Beyond James Gee’s Learning Principles

Author: Bacalja, Alexander, Nichols, T. Phillip, Robinson, Bradley, Bhatt, Ibrar, Kucharczyk, Stefan, Zomer, Chris, Nash, Brady, Dupont, Bruno, De Cock, Rozane, Zaman, Bieke, Bonenfant, Maude, Grosemans, Eva, Abrams, Sandra Schamroth, Vallis, Carmen, Koutsogiannis, Dimitrios, Dishon, Gideon, Reed, Jack, Byers, Thomas, Fawzy, Rania Magdi, Hsu, Hsiao-Ping, Lowien, Nathan, Barton, Georgina, Callow, Jon, Liu, Zirui, Serafini, Frank, Vermeire, Zowi, deHaan, Jonathan, Croasdale, Alison, Torres-Toukoumidis, Angel, Xu, Xiao, and Schnaider, Karoline
Published: 2024
Full Text: View/download PDF

17. SimiSketch: Efficiently Estimating Similarity of streaming Multisets

Author: Dong, Fenghao, He, Yang, Liang, Yutong, Liu, Zirui, Wu, Yuhan, Chen, Peiqing, and Yang, Tong
Subjects: Computer Science - Data Structures and Algorithms
Abstract: The challenge of estimating similarity between sets has been a significant concern in data science, finding diverse applications across various domains. However, previous approaches, such as MinHash, have predominantly centered around hashing techniques, which are well-suited for sets but less naturally adaptable to multisets, a common occurrence in scenarios like network streams and text data. Moreover, with the increasing prevalence of data arriving in streaming patterns, many existing methods struggle to handle cases where set items are presented in a continuous stream. Consequently, our focus in this paper is on the challenging scenario of multisets with item streams. To address this, we propose SimiSketch, a sketching algorithm designed to tackle this specific problem. The paper begins by presenting two simpler versions that employ intuitive sketches for similarity estimation. Subsequently, we formally introduce SimiSketch and leverage SALSA to enhance accuracy. To validate our algorithms, we conduct extensive testing on synthetic datasets, real-world network traffic, and text articles. Our experiment shows that compared with the state-of-the-art, SimiSketch can improve the accuracy by up to 42 times, and increase the throughput by up to 360 times. The complete source code is open-sourced and available on GitHub for reference.
Published: 2024

18. CuckooGraph: A Scalable and Space-Time Efficient Data Structure for Large-Scale Dynamic Graphs

Author: Fan, Zhuochen, Cai, Yalun, Liu, Zirui, Guo, Jiarui, Fan, Xin, Yang, Tong, and Cui, Bin
Subjects: Computer Science - Databases, Computer Science - Data Structures and Algorithms
Abstract: Graphs play an increasingly important role in various big data applications. However, existing graph data structures cannot simultaneously address the performance bottlenecks caused by the dynamic updates, large scale, and high query complexity of current graphs. This paper proposes a novel data structure for large-scale dynamic graphs called CuckooGraph. It does not require any prior knowledge of the upcoming graphs, and can adaptively resize to the most memory-efficient form while requiring few memory accesses for very fast graph data processing. The key techniques of CuckooGraph include TRANSFORMATION and DENYLIST. TRANSFORMATION fully utilizes the limited memory by designing related data structures that allow flexible space transformations to smoothly expand/tighten the required space depending on the number of incoming items. DENYLIST efficiently handles item insertion failures and further improves processing speed. Our experimental results show that compared with the most competitive solution Spruce, CuckooGraph achieves about $33\times$ higher insertion throughput while requiring only about $68\%$ of the memory space., Comment: 2025 IEEE International Conference on Data Engineering (ICDE)
Published: 2024

19. Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages

Author: Li, Zihao, Shi, Yucheng, Liu, Zirui, Yang, Fan, Payani, Ali, Liu, Ninghao, and Du, Mengnan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The development of Large Language Models (LLMs) relies on extensive text corpora, which are often unevenly distributed across languages. This imbalance results in LLMs performing significantly better on high-resource languages like English, German, and French, while their capabilities in low-resource languages remain inadequate. Currently, there is a lack of quantitative methods to evaluate the performance of LLMs in these low-resource languages. To address this gap, we propose the Language Ranker, an intrinsic metric designed to benchmark and rank languages based on LLM performance using internal representations. By comparing the LLM's internal representation of various languages against a baseline derived from English, we can assess the model's multilingual capabilities in a robust and language-agnostic manner. Our analysis reveals that high-resource languages exhibit higher similarity scores with English, demonstrating superior performance, while low-resource languages show lower similarity scores, underscoring the effectiveness of our metric in assessing language-specific capabilities. Besides, the experiments show that there is a strong correlation between the LLM's performance in different languages and the proportion of those languages in its pre-training corpus. These insights underscore the efficacy of the Language Ranker as a tool for evaluating LLM performance across different languages, particularly those with limited resources., Comment: Accepted by AAAI 2025 (Social Impact Track)
Published: 2024

20. Survey of Computerized Adaptive Testing: A Machine Learning Perspective

Author: Liu, Qi, Zhuang, Yan, Bi, Haoyang, Huang, Zhenya, Huang, Weizhe, Li, Jiatong, Yu, Junhao, Liu, Zirui, Hu, Zirui, Hong, Yuting, Pardos, Zachary A., Ma, Haiping, Zhu, Mengxiao, Wang, Shijin, and Chen, Enhong
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computers and Society, Computer Science - Information Retrieval
Abstract: Computerized Adaptive Testing (CAT) provides an efficient and tailored method for assessing the proficiency of examinees, by dynamically adjusting test questions based on their performance. Widely adopted across diverse fields like education, healthcare, sports, and sociology, CAT has revolutionized testing practices. While traditional methods rely on psychometrics and statistics, the increasing complexity of large-scale testing has spurred the integration of machine learning techniques. This paper aims to provide a machine learning-focused survey on CAT, presenting a fresh perspective on this adaptive testing method. By examining the test question selection algorithm at the heart of CAT's adaptivity, we shed light on its functionality. Furthermore, we delve into cognitive diagnosis models, question bank construction, and test control within CAT, exploring how machine learning can optimize these components. Through an analysis of current methods, strengths, limitations, and challenges, we strive to develop robust, fair, and efficient CAT systems. By bridging psychometric-driven CAT research with machine learning, this survey advocates for a more inclusive and interdisciplinary approach to the future of adaptive testing.
Published: 2024

21. Enabling space-time efficient range queries with REncoder

Author: Fan, Zhuochen, Ye, Bowen, Wang, Ziwei, Zhong, Zheng, Guo, Jiarui, Wu, Yuhan, Li, Haoyu, Yang, Tong, Tu, Yaofeng, Liu, Zirui, and Cui, Bin
Published: 2024
Full Text: View/download PDF

22. Isolated Perfused Hearts for Cardiovascular Research: An Old Dog with New Tricks

Author: Yang, Tianshuo, Liu, Zirui, Shu, Songren, Chen, Zhice, Hua, Xiumeng, and Song, Jiangping
Published: 2024
Full Text: View/download PDF

23. WavingSketch: an unbiased and generic sketch for finding top-k items in data streams

Author: Liu, Zirui, Dong, Fenghao, Liu, Chengwu, Deng, Xiangwei, Yang, Tong, Zhao, Yikai, Li, Jizhou, Cui, Bin, and Zhang, Gong
Published: 2024
Full Text: View/download PDF

24. LoRA-as-an-Attack! Piercing LLM Safety Under The Share-and-Play Scenario

Author: Liu, Hongyi, Liu, Zirui, Tang, Ruixiang, Yuan, Jiayi, Zhong, Shaochen, Chuang, Yu-Neng, Li, Li, Chen, Rui, and Hu, Xia
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Fine-tuning LLMs is crucial to enhancing their task-specific performance and ensuring model behaviors are aligned with human preferences. Among various fine-tuning methods, LoRA is popular for its efficiency and ease to use, allowing end-users to easily post and adopt lightweight LoRA modules on open-source platforms to tailor their model for different customization. However, such a handy share-and-play setting opens up new attack surfaces, that the attacker can render LoRA as an attacker, such as backdoor injection, and widely distribute the adversarial LoRA to the community easily. This can result in detrimental outcomes. Despite the huge potential risks of sharing LoRA modules, this aspect however has not been fully explored. To fill the gap, in this study we thoroughly investigate the attack opportunities enabled in the growing share-and-play scenario. Specifically, we study how to inject backdoor into the LoRA module and dive deeper into LoRA's infection mechanisms. We found that training-free mechanism is possible in LoRA backdoor injection. We also discover the impact of backdoor attacks with the presence of multiple LoRA adaptions concurrently as well as LoRA based backdoor transferability. Our aim is to raise awareness of the potential risks under the emerging share-and-play scenario, so as to proactively prevent potential consequences caused by LoRA-as-an-Attack. Warning: the paper contains potential offensive content generated by models.
Published: 2024

25. Learning to Compress Prompt in Natural Language Formats

Author: Chuang, Yu-Neng, Xing, Tianwei, Chang, Chia-Yuan, Liu, Zirui, Chen, Xun, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) are great at processing multiple natural language processing tasks, but their abilities are constrained by inferior performance with long context, slow inference speed, and the high cost of computing the results. Deploying LLMs with precise and informative context helps users process large-scale datasets more effectively and cost-efficiently. Existing works rely on compressing long prompt contexts into soft prompts. However, soft prompt compression encounters limitations in transferability across different LLMs, especially API-based LLMs. To this end, this work aims to compress lengthy prompts in the form of natural language with LLM transferability. This poses two challenges: (i) Natural Language (NL) prompts are incompatible with back-propagation, and (ii) NL prompts lack flexibility in imposing length constraints. In this work, we propose a Natural Language Prompt Encapsulation (Nano-Capsulator) framework compressing original prompts into NL formatted Capsule Prompt while maintaining the prompt utility and transferability. Specifically, to tackle the first challenge, the Nano-Capsulator is optimized by a reward function that interacts with the proposed semantics preserving loss. To address the second question, the Nano-Capsulator is optimized by a reward function featuring length constraints. Experimental results demonstrate that the Capsule Prompt can reduce 81.4% of the original length, decrease inference latency up to 4.5x, and save 80.1% of budget overheads while providing transferability across diverse LLMs and different datasets.
Published: 2024

26. KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Author: Liu, Zirui, Yuan, Jiayi, Jin, Hongye, Zhong, Shaochen, Xu, Zhaozhuo, Braverman, Vladimir, Chen, Beidi, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Performance
Abstract: Efficiently serving large language models (LLMs) requires batching of many requests to reduce the cost per request. Yet, with larger batch sizes and longer context lengths, the key-value (KV) cache, which stores attention keys and values to avoid re-computations, significantly increases memory demands and becomes the new bottleneck in speed and memory usage. Additionally, the loading of the KV cache causes the computational core to be idle, which limits the inference speed. A straightforward and effective solution to reduce KV cache size is quantization, which decreases the total bytes taken by KV cache. However, there is a lack of in-depth studies that explore the element distribution of KV cache to understand the hardness and limitation of KV cache quantization. To fill the gap, we conducted a comprehensive study on the element distribution in KV cache of popular LLMs. Our findings indicate that the key cache should be quantized per-channel, i.e., group elements along the channel dimension and quantize them together. In contrast, the value cache should be quantized per-token. From this analysis, we developed a tuning-free 2bit KV cache quantization algorithm named KIVI. With hardware-friendly implementation, KIVI can enable Llama, Falcon, and Mistral models to maintain almost the same quality while using $\mathbf{2.6\times}$ less peak memory (including model weight). This reduction in memory usage enables up to $\mathbf{4\times}$ larger batch size, bringing $\mathbf{2.35\times \sim 3.47\times}$ throughput on real LLM inference workload. The source code is available at https://github.com/jy-yuan/KIVI., Comment: ICML2024
Published: 2024
Full Text: View/download PDF

27. FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference

Author: Liu, Zirui, Song, Qingquan, Xiao, Qiang Charles, Selvaraj, Sathiya Keerthi, Mazumder, Rahul, Gupta, Aman, and Hu, Xia
Subjects: Computer Science - Computation and Language
Abstract: The large number of parameters in Pretrained Language Models enhance their performance, but also make them resource-intensive, making it challenging to deploy them on commodity hardware like a single GPU. Due to the memory and power limitations of these devices, model compression techniques are often used to decrease both the model's size and its inference latency. This usually results in a trade-off between model accuracy and efficiency. Therefore, optimizing this balance is essential for effectively deploying LLMs on commodity hardware. A significant portion of the efficiency challenge is the Feed-forward network (FFN) component, which accounts for roughly $\frac{2}{3}$ total parameters and inference latency. In this paper, we first observe that only a few neurons of FFN module have large output norm for any input tokens, a.k.a. heavy hitters, while the others are sparsely triggered by different tokens. Based on this observation, we explicitly split the FFN into two parts according to the heavy hitters. We improve the efficiency-accuracy trade-off of existing compression methods by allocating more resource to FFN parts with heavy hitters. In practice, our method can reduce model size by 43.1\% and bring $1.25\sim1.56\times$ wall clock time speedup on different hardware with negligible accuracy drop.
Published: 2024

28. LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Author: Jin, Hongye, Han, Xiaotian, Yang, Jingfeng, Jiang, Zhimeng, Liu, Zirui, Chang, Chia-Yuan, Chen, Huiyuan, and Hu, Xia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: It is well known that LLMs cannot generalize well to long contexts whose lengths are larger than the training sequence length. This poses challenges when employing LLMs for processing long input sequences during inference. In this work, we argue that LLMs themselves have inherent capabilities to handle long contexts without fine-tuning. To achieve this goal, we propose SelfExtend to extend the context window of LLMs by constructing bi-level attention information: the grouped attention and the neighbor attention. The grouped attention captures the dependencies among tokens that are far apart, while neighbor attention captures dependencies among adjacent tokens within a specified range. The two-level attentions are computed based on the original model's self-attention mechanism during inference. With minor code modification, our SelfExtend can effortlessly extend existing LLMs' context window without any fine-tuning. We conduct comprehensive experiments on multiple benchmarks and the results show that our SelfExtend can effectively extend existing LLMs' context window length. The code can be found at \url{https://github.com/datamllab/LongLM}., Comment: ICML2024 Spotlight
Published: 2024

29. TVE: Learning Meta-attribution for Transferable Vision Explainer

Author: Wang, Guanchu, Chuang, Yu-Neng, Yang, Fan, Du, Mengnan, Chang, Chia-Yuan, Zhong, Shaochen, Liu, Zirui, Xu, Zhaozhuo, Zhou, Kaixiong, Cai, Xuanting, and Hu, Xia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Explainable machine learning significantly improves the transparency of deep neural networks. However, existing work is constrained to explaining the behavior of individual model predictions, and lacks the ability to transfer the explanation across various models and tasks. This limitation results in explaining various tasks being time- and resource-consuming. To address this problem, we introduce a Transferable Vision Explainer (TVE) that can effectively explain various vision models in downstream tasks. Specifically, the transferability of TVE is realized through a pre-training process on large-scale datasets towards learning the meta-attribution. This meta-attribution leverages the versatility of generic backbone encoders to comprehensively encode the attribution knowledge for the input instance, which enables TVE to seamlessly transfer to explain various downstream tasks, without the need for training on task-specific data. Empirical studies involve explaining three different architectures of vision models across three diverse downstream datasets. The experimental results indicate TVE is effective in explaining these tasks without the need for additional training on downstream data.
Published: 2023

30. Chasing Fairness in Graphs: A GNN Architecture Perspective

Author: Jiang, Zhimeng, Han, Xiaotian, Fan, Chao, Liu, Zirui, Zou, Na, Mostafavi, Ali, and Hu, Xia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: There has been significant progress in improving the performance of graph neural networks (GNNs) through enhancements in graph data, model architecture design, and training strategies. For fairness in graphs, recent studies achieve fair representations and predictions through either graph data pre-processing (e.g., node feature masking, and topology rewiring) or fair training strategies (e.g., regularization, adversarial debiasing, and fair contrastive learning). How to achieve fairness in graphs from the model architecture perspective is less explored. More importantly, GNNs exhibit worse fairness performance compared to multilayer perception since their model architecture (i.e., neighbor aggregation) amplifies biases. To this end, we aim to achieve fairness via a new GNN architecture. We propose \textsf{F}air \textsf{M}essage \textsf{P}assing (FMP) designed within a unified optimization framework for GNNs. Notably, FMP \textit{explicitly} renders sensitive attribute usage in \textit{forward propagation} for node classification task using cross-entropy loss without data pre-processing. In FMP, the aggregation is first adopted to utilize neighbors' information and then the bias mitigation step explicitly pushes demographic group node presentation centers together. In this way, FMP scheme can aggregate useful information from neighbors and mitigate bias to achieve better fairness and prediction tradeoff performance. Experiments on node classification tasks demonstrate that the proposed FMP outperforms several baselines in terms of fairness and accuracy on three real-world datasets. The code is available in {\url{https://github.com/zhimengj0326/FMP}}., Comment: Accepted by AAAI Conference on Artificial Intelligence (AAAI) 2024. arXiv admin note: substantial text overlap with arXiv:2202.04187
Published: 2023

31. CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models

Author: Zhang, Hailin, Liu, Zirui, Chen, Boxuan, Zhao, Yikai, Zhao, Tong, Yang, Tong, and Cui, Bin
Subjects: Computer Science - Machine Learning
Abstract: Recently, the growing memory demands of embedding tables in Deep Learning Recommendation Models (DLRMs) pose great challenges for model training and deployment. Existing embedding compression solutions cannot simultaneously meet three key design requirements: memory efficiency, low latency, and adaptability to dynamic data distribution. This paper presents CAFE, a Compact, Adaptive, and Fast Embedding compression framework that addresses the above requirements. The design philosophy of CAFE is to dynamically allocate more memory resources to important features (called hot features), and allocate less memory to unimportant ones. In CAFE, we propose a fast and lightweight sketch data structure, named HotSketch, to capture feature importance and report hot features in real time. For each reported hot feature, we assign it a unique embedding. For the non-hot features, we allow multiple features to share one embedding by using hash embedding technique. Guided by our design philosophy, we further propose a multi-level hash embedding framework to optimize the embedding tables of non-hot features. We theoretically analyze the accuracy of HotSketch, and analyze the model convergence against deviation. Extensive experiments show that CAFE significantly outperforms existing embedding compression methods, yielding 3.92% and 3.68% superior testing AUC on Criteo Kaggle dataset and CriteoTB dataset at a compression ratio of 10000x. The source codes of CAFE are available at GitHub.
Published: 2023

32. Quantitative analysis model of wind energy resources in adjacent area of existing wind farms

Author: Ma, Junpeng, Liu, Feiyan, Wang, Kairan, Xiao, Chenggang, and Liu, Zirui
Published: 2024
Full Text: View/download PDF

33. Gaidai multimodal risk evaluation methodology based on cargo vessel onboard measurements, given structural damage accumulation

Author: Gaidai, Oleg, Ashraf, Alia, Cao, Yu, Sheng, Jinlu, Li, Hongchen, Liu, Zirui, and Zhu, Yan
Published: 2024
Full Text: View/download PDF

34. Identification of diagnostic markers pyrodeath-related genes in non-alcoholic fatty liver disease based on machine learning and experiment validation

Author: Lei, Liping, Li, Jixue, Liu, Zirui, Zhang, Dongdong, Liu, Zihan, Wang, Qing, Gao, Yi, Mo, Biwen, and Li, Jiangfa
Published: 2024
Full Text: View/download PDF

35. Onboard multivariate hazard assessment for UIKKU chemical tanker by Gaidai reliability method

Author: Gaidai, Oleg, Ashraf, Alia, Sheng, Jinlu, Zhu, Yan, Yakimov, Vladimir, Li, Hongchen, Liu, Zirui, and Cao, Yu
Published: 2024
Full Text: View/download PDF

36. Molecular characterization of atmospheric organic aerosols in typical megacities in China

Author: Zhang, Miaomiao, Cai, Dongmei, Lin, Jingxin, Liu, Zirui, Li, Mei, Wang, Yuesi, and Chen, Jianmin
Published: 2024
Full Text: View/download PDF

37. Insight into wet scavenging effects on sulfur and nitrogen containing organic compounds in urban Beijing

Author: Zhang, Chunyan, Wang, Yonghong, Liu, Jun, Chen, Tianzeng, Huang, Wei, Liu, Zirui, Chu, Biwu, Ma, Qingxin, and He, Hong
Published: 2024
Full Text: View/download PDF

38. Relative humidity driven nocturnal HONO formation mechanism in autumn haze events of Beijing

Author: Xuan, Huiying, Liu, Jun, Zhao, Yaqi, Cao, Qing, Chen, Tianzeng, Wang, Yonghong, Liu, Zirui, Sun, Xu, Li, Hao, Zhang, Peng, Chu, Biwu, Ma, Qingxin, and He, Hong
Published: 2024
Full Text: View/download PDF

39. FPSO/LNG hawser system lifetime assessment by Gaidai multivariate risk assessment method

Author: Gaidai, Oleg, Cao, Yu, Ashraf, Alia, Sheng, Jinlu, Zhu, Yan, and Liu, Zirui
Published: 2024
Full Text: View/download PDF

40. Lifetime assessment of semi-submersible wind turbines by Gaidai risk evaluation method

Author: Gaidai, Oleg, Ashraf, Alia, Cao, Yu, Sheng, Jinlu, Zhu, Yan, and Liu, Zirui
Published: 2024
Full Text: View/download PDF

41. The CDE region of feline Calicivirus VP1 protein is a potential candidate subunit vaccine

Author: Li, Lisha, Liu, Zirui, Shi, Jiale, Yang, Mengfang, Yan, Yuanyuan, Fu, Yanan, Shen, Zhou, and Peng, Guiqing
Published: 2024
Full Text: View/download PDF

42. Single-cell RNA sequencing in donor and end-stage heart failure patients identifies NLRP3 as a therapeutic target for arrhythmogenic right ventricular cardiomyopathy

Author: Fu, Mengxia, Hua, Xiumeng, Shu, Songren, Xu, Xinjie, Zhang, Hang, Peng, Zhiming, Mo, Han, Liu, Yanyun, Chen, Xiao, Yang, Yicheng, Zhang, Ningning, Wang, Xiaohu, Liu, Zirui, Yue, Guangxin, Hu, Shengshou, and Song, Jiangping
Published: 2024
Full Text: View/download PDF

43. 4400 TEU cargo ship dynamic analysis by Gaidai reliability method

Author: Gaidai, Oleg, Wang, Fang, Cao, Yu, and Liu, Zirui
Published: 2024
Full Text: View/download PDF

44. Enhanced exclusive enteral nutrition delivery during the first 7 days is associated with decreased 28-day mortality in critically ill patients with normal lactate level: a post hoc analysis of a multicenter randomized trial

Author: Chen, Yizhe, Liu, Zirui, Wang, Qiuhui, Gao, Fei, Xu, Hongyang, Ke, Lu, Lee, Zheng-Yii, Stoppe, Christian, Heyland, Daren K., Liang, Fengming, and Lin, Jiajia
Published: 2024
Full Text: View/download PDF

45. Experimental Analysis of Large-scale Learnable Vector Storage Compression

Author: Zhang, Hailin, Zhao, Penghao, Miao, Xupeng, Shao, Yingxia, Liu, Zirui, Yang, Tong, and Cui, Bin
Subjects: Computer Science - Machine Learning, Computer Science - Databases, Computer Science - Information Retrieval
Abstract: Learnable embedding vector is one of the most important applications in machine learning, and is widely used in various database-related domains. However, the high dimensionality of sparse data in recommendation tasks and the huge volume of corpus in retrieval-related tasks lead to a large memory consumption of the embedding table, which poses a great challenge to the training and deployment of models. Recent research has proposed various methods to compress the embeddings at the cost of a slight decrease in model quality or the introduction of other overheads. Nevertheless, the relative performance of these methods remains unclear. Existing experimental comparisons only cover a subset of these methods and focus on limited metrics. In this paper, we perform a comprehensive comparative analysis and experimental evaluation of embedding compression. We introduce a new taxonomy that categorizes these techniques based on their characteristics and methodologies, and further develop a modular benchmarking framework that integrates 14 representative methods. Under a uniform test environment, our benchmark fairly evaluates each approach, presents their strengths and weaknesses under different memory budgets, and recommends the best method based on the use case. In addition to providing useful guidelines, our study also uncovers the limitations of current methods and suggests potential directions for future research.
Published: 2023

46. Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots

Author: Tang, Ruixiang, Yuan, Jiayi, Li, Yiming, Liu, Zirui, Chen, Rui, and Hu, Xia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: In the field of natural language processing, the prevalent approach involves fine-tuning pretrained language models (PLMs) using local samples. Recent research has exposed the susceptibility of PLMs to backdoor attacks, wherein the adversaries can embed malicious prediction behaviors by manipulating a few training samples. In this study, our objective is to develop a backdoor-resistant tuning procedure that yields a backdoor-free model, no matter whether the fine-tuning dataset contains poisoned samples. To this end, we propose and integrate a honeypot module into the original PLM, specifically designed to absorb backdoor information exclusively. Our design is motivated by the observation that lower-layer representations in PLMs carry sufficient backdoor features while carrying minimal information about the original tasks. Consequently, we can impose penalties on the information acquired by the honeypot module to inhibit backdoor creation during the fine-tuning process of the stem network. Comprehensive experiments conducted on benchmark datasets substantiate the effectiveness and robustness of our defensive strategy. Notably, these results indicate a substantial reduction in the attack success rate ranging from 10\% to 40\% when compared to prior state-of-the-art methods.
Published: 2023

47. Label-free single-vesicle based surface enhanced Raman spectroscopy: A robust approach for investigating the biomolecular composition of small extracellular vesicles

Author: Liu, Zirui, Ng, Martin, Srivastava, Siddharth, Li, Tieyi, Liu, Jun, Phu, Tuan Anh, Mateescu, Bogdan, Wang, Yi-Ting, Tsai, Chia-Feng, Liu, Tao, Raffai, Robert L, and Xie, Ya-Hong
Subjects: Analytical Chemistry, Biological Sciences, Chemical Sciences, Generic health relevance, Spectrum Analysis, Raman, Extracellular Vesicles, Humans, Chromatography, Gel, Machine Learning, Mass Spectrometry, General Science & Technology
Abstract: Small extracellular vesicles (sEVs) are cell-released vesicles ranging from 30-150nm in size. They have garnered increasing attention because of their potential for both the diagnosis and treatment of disease. The diversity of sEVs derives from their biological composition and cargo content. Currently, the isolation of sEV subpopulations is primarily based on bio-physical and affinity-based approaches. Since a standardized definition for sEV subpopulations is yet to be fully established, it is important to further investigate the correlation between the biomolecular composition of sEVs and their physical properties. In this study, we employed a platform combining single-vesicle surface-enhanced Raman spectroscopy (SERS) and machine learning to examine individual sEVs isolated by size-exclusion chromatography (SEC). The biomolecular composition of each vesicle examined was reflected by its corresponding SERS spectral features (biomolecular "fingerprints"), with their roots in the composition of their collective Raman-active bonds. Origins of the SERS spectral features were validated through a comparative analysis between SERS and mass spectrometry (MS). SERS fingerprinting of individual vesicles was effective in overcoming the challenges posed by EV population averaging, allowing for the possibility of analyzing the variations in biomolecular composition between the vesicles of similar and/or different sizes. Using this approach, we uncovered that each of the size-based fractions of sEVs contained particles with predominantly similar SERS spectral features. Indeed, more than 84% of the vesicles residing within a particular group were clearly distinguishable from that of the other EV sub-populations, despite some spectral variations within each sub-population. Our results suggest the possibility that size-based EV fractionation methods produce samples where similarly eluted sEVs are correlated with their respective biochemical contents, as reflected by their SERS spectra. Our findings therefore highlight the possibility that the biogenesis and respective biological functionalities of the various sEV fractions may be inherently different.
Published: 2024

48. Seasonal Changes of Brown Carbon in an Industrial City of Central China: Importance of Water-Insoluble BrC

Author: Yan, Guangxuan, Wang, Ziyi, Hou, Mingyang, Wang, Jieqi, Shang, Sai, Shang, Ziyang, Li, Jiayi, Yu, Hao, Cao, Zhiguo, and Liu, Zirui
Published: 2024
Full Text: View/download PDF

49. Fabrication of red-emitting perovskite LEDs by stabilizing their octahedral structure

Author: Kong, Lingmei, Sun, Yuqi, Zhao, Bin, Ji, Kangyu, Feng, Jie, Dong, Jianchao, Wang, Yuanzhi, Liu, Zirui, Maqbool, Shabnum, Li, Yunguo, Yang, Yingguo, Dai, Linjie, Lee, Wanhee, Cho, Changsoon, Stranks, Samuel D., Friend, Richard H., Wang, Ning, Greenham, Neil C., and Yang, Xuyong
Published: 2024
Full Text: View/download PDF

50. Efficient GNN Explanation via Learning Removal-based Attribution

Author: Rong, Yao, Wang, Guanchu, Feng, Qizhang, Liu, Ninghao, Liu, Zirui, Kasneci, Enkelejda, and Hu, Xia
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: As Graph Neural Networks (GNNs) have been widely used in real-world applications, model explanations are required not only by users but also by legal regulations. However, simultaneously achieving high fidelity and low computational costs in generating explanations has been a challenge for current methods. In this work, we propose a framework of GNN explanation named LeArn Removal-based Attribution (LARA) to address this problem. Specifically, we introduce removal-based attribution and demonstrate its substantiated link to interpretability fidelity theoretically and experimentally. The explainer in LARA learns to generate removal-based attribution which enables providing explanations with high fidelity. A strategy of subgraph sampling is designed in LARA to improve the scalability of the training process. In the deployment, LARA can efficiently generate the explanation through a feed-forward pass. We benchmark our approach with other state-of-the-art GNN explanation methods on six datasets. Results highlight the effectiveness of our framework regarding both efficiency and fidelity. In particular, LARA is 3.5 times faster and achieves higher fidelity than the state-of-the-art method on the large dataset ogbn-arxiv (more than 160K nodes and 1M edges), showing its great potential in real-world applications. Our source code is available at https://anonymous.4open.science/r/LARA-10D8/README.md.
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,266 results on '"Liu, Zirui"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources