60,552 results on '"ZHANG Rui"'
Search Results
102. Development and Implementation of Advanced Beam Diagnostic and Abort Systems in SuperKEKB
- Author
-
Yoshihara, Keisuke, Abe, Tetsuro, Aversano, Michele, Gale, Alexander, Ikeda, Hitomi, Kaji, Hiroshi, Kakuno, Hidekazu, Koga, Taichiro, Iijima, Toru, Kato, Shinnosuke, Kusudo, Ami, Liu, Yuxin, Maeda, Akane, Mitra, Sayan, Mitsuka, Gaku, Miyabayashi, Kenkichi, Nakamura, Isamu, Nakayama, Hiroyuki, Nakazawa, Yu, Nomaru, Riku, Okada, Iori, Shi, Xiao-Dong, Tanaka, Shuji, Uno, Kenta, Ushiroda, Yutaka, Urbschat, Bela, and Zhang, Rui
- Subjects
High Energy Physics - Experiment ,Physics - Instrumentation and Detectors - Abstract
The SuperKEKB/Belle II experiment aims to collect high-statistics data of B meson pairs to explore new physics beyond the Standard Model (SM). SuperKEKB, an upgraded version of the KEKB accelerator, has achieved a world-record luminosity of $4.71 \times 10^{34} \, \mathrm{cm^{-2}s^{-1}}$ in 2022 but continues to strive for higher luminosities. One of the major obstacles is Sudden Beam Loss (SBL) events, which cause substantial beam losses and damage to the Belle~II detector. To find a hint for addressing SBL challenges, advanced beam diagnostic systems and enhanced beam abort systems have been developed. The diagnostic system aims to accurately pinpoint the start of beam losses, while the upgraded abort system quickly disposes of anomalous beams to minimize damage. This paper details the development and implementation of these systems, including high-speed loss monitors, time synchronization with the White Rabbit system, and data acquisition systems. Efforts to understand the mechanisms of SBL events, using acoustic sensors to detect discharges, are also discussed. These measures aim to improve the operational stability and luminosity of SuperKEKB, contributing to the experiment's success., Comment: 17 pages, 25 figures
- Published
- 2024
103. Large Language Models for Disease Diagnosis: A Scoping Review
- Author
-
Zhou, Shuang, Xu, Zidu, Zhang, Mian, Xu, Chunpu, Guo, Yawen, Zhan, Zaifu, Ding, Sirui, Wang, Jiashuo, Xu, Kaishuai, Fang, Yi, Xia, Liqiao, Yeung, Jeremy, Zha, Daochen, Melton, Genevieve B., Lin, Mingquan, and Zhang, Rui
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Automatic disease diagnosis has become increasingly valuable in clinical practice. The advent of large language models (LLMs) has catalyzed a paradigm shift in artificial intelligence, with growing evidence supporting the efficacy of LLMs in diagnostic tasks. Despite the increasing attention in this field, a holistic view is still lacking. Many critical aspects remain unclear, such as the diseases and clinical data to which LLMs have been applied, the LLM techniques employed, and the evaluation methods used. In this article, we perform a comprehensive review of LLM-based methods for disease diagnosis. Our review examines the existing literature across various dimensions, including disease types and associated clinical specialties, clinical data, LLM techniques, and evaluation methods. Additionally, we offer recommendations for applying and evaluating LLMs for diagnostic tasks. Furthermore, we assess the limitations of current research and discuss future directions. To our knowledge, this is the first comprehensive review for LLM-based disease diagnosis., Comment: 69 pages
- Published
- 2024
104. TVG: A Training-free Transition Video Generation Method with Diffusion Models
- Author
-
Zhang, Rui, Chen, Yaosen, Liu, Yuegen, Wang, Wei, Wen, Xuming, and Wang, Hongxia
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Transition videos play a crucial role in media production, enhancing the flow and coherence of visual narratives. Traditional methods like morphing often lack artistic appeal and require specialized skills, limiting their effectiveness. Recent advances in diffusion model-based video generation offer new possibilities for creating transitions but face challenges such as poor inter-frame relationship modeling and abrupt content changes. We propose a novel training-free Transition Video Generation (TVG) approach using video-level diffusion models that addresses these limitations without additional training. Our method leverages Gaussian Process Regression ($\mathcal{GPR}$) to model latent representations, ensuring smooth and dynamic transitions between frames. Additionally, we introduce interpolation-based conditional controls and a Frequency-aware Bidirectional Fusion (FBiF) architecture to enhance temporal control and transition reliability. Evaluations of benchmark datasets and custom image pairs demonstrate the effectiveness of our approach in generating high-quality smooth transition videos. The code are provided in https://sobeymil.github.io/tvg.com.
- Published
- 2024
105. A Recursion-Based SNR Determination Method for Short Packet Transmission: Analysis and Applications
- Author
-
Yin, Chengzhe, Zhang, Rui, Li, Yongzhao, Ruan, Yuhan, Li, Tao, and Lu, Jiaheng
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
The short packet transmission (SPT) has gained much attention in recent years. In SPT, the most significant characteristic is that the finite blocklength code (FBC) is adopted. With FBC, the signal-to-noise ratio (SNR) cannot be expressed as an explicit function with respect to the other transmission parameters. This raises the following two problems for the resource allocation in SPTs: (i) The exact value of the SNR is hard to determine, and (ii) The property of SNR w.r.t. the other parameters is hard to analyze, which hinders the efficient optimization of them. To simultaneously tackle these problems, we have developed a recursion method in our prior work. To emphasize the significance of this method, we further analyze the convergence rate of the recursion method and investigate the property of the recursion function in this paper. Specifically, we first analyze the convergence rate of the recursion method, which indicates it can determine the SNR with low complexity. Then, we analyze the property of the recursion function, which facilitates the optimization of the other parameters during the recursion. Finally, we also enumerate some applications for the recursion method. Simulation results indicate that the recursion method converges faster than the other SNR determination methods. Besides, the results also show that the recursion-based methods can almost achieve the optimal solution of the application cases.
- Published
- 2024
106. Minimizing Movement Delay for Movable Antennas via Trajectory Optimization
- Author
-
Li, Qingliang, Mei, Weidong, Ning, Boyu, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Movable antennas (MAs) have received increasing attention in wireless communications due to their capability of antenna position adjustment to reconfigure wireless channels. However, moving MAs results in non-negligible delay, which may decrease the effective data transmission time. To reduce the movement delay, we study in this paper a new MA trajectory optimization problem. In particular, given the desired destination positions of multiple MAs, we aim to jointly optimize their associations with the initial MA positions and the trajectories for moving them from their respective initial to destination positions within a given two-dimensional (2D) region, such that the delay of antenna movement is minimized, subject to the inter-MA minimum distance constraints in the movement. However, this problem is a continuous-time mixed-integer linear programming (MILP) problem that is challenging to solve. To tackle this challenge, we propose a two-stage optimization framework that sequentially optimizes the MAs' position associations and trajectories, respectively. First, we relax the inter-MA distance constraints and optimally solve the resulted delay minimization problem. Next, we check if the obtained MA association and trajectory solutions satisfy the inter-MA distance constraints. If not satisfied, we then employ a successive convex approximation (SCA) algorithm to adjust the MAs' trajectories until they satisfy the given constraints. Simulation results are provided to show the effectiveness of our proposed trajectory optimization method in reducing the movement delay as well as draw useful insights., Comment: 6 pages,6 figures, submit to GLOBECOM 2024 Workshop - IRAFWCC
- Published
- 2024
107. Near-Field Multiuser Communications Aided by Movable Antennas
- Author
-
Ding, Jingze, Zhu, Lipeng, Zhou, Zijian, Jiao, Bingli, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
This letter investigates movable antenna (MA)-aided downlink (DL) multiuser communication systems under the near-field channel condition, in which both the base station (BS) and the users are equipped with MAs to fully exploit the degrees of freedom (DoFs) in antenna position optimization by leveraging the wireless channel variation in spatial regions of large size. The objective is to minimize the transmit power by jointly optimizing the beamformers and the MA positions while satisfying the minimum-achievable-rate requirement for each user. We propose a two-loop dynamic neighborhood pruning particle swarm optimization (DNPPSO) algorithm that significantly reduces computational complexity while effectively maintaining the performance of the standard particle swarm optimization (PSO) algorithm. Simulation results validate the effectiveness and advantages of the proposed scheme in power-saving for multiuser communications., Comment: 5 pages
- Published
- 2024
108. Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models
- Author
-
Zhao, Yunpu, Zhang, Rui, Xiao, Junbin, Ke, Changxin, Hou, Ruibo, Hao, Yifan, Guo, Qi, and Chen, Yunji
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Large Vision-Language Models (LVLMs) have shown significant capability in vision-language understanding. However, one critical issue that persists in these models is sycophancy, which means models are unduly influenced by leading or deceptive prompts, resulting in biased outputs and hallucinations. Despite the progress in LVLMs, evaluating and mitigating sycophancy is yet much under-explored. In this work, we fill this gap by systematically analyzing sycophancy on various VL benchmarks with curated leading queries and further proposing a text contrastive decoding method for mitigation. While the specific sycophantic behavior varies significantly among models, our analysis reveals the severe deficiency of all LVLMs in resilience of sycophancy across various tasks. For improvement, we propose Leading Query Contrastive Decoding (LQCD), a model-agnostic method focusing on calibrating the LVLMs' over-reliance on leading cues by identifying and suppressing the probabilities of sycophancy tokens at the decoding stage. Extensive experiments show that LQCD effectively mitigate sycophancy, outperforming both prompt engineering methods and common methods for hallucination mitigation. We further demonstrate that LQCD does not hurt but even slightly improves LVLMs' responses to neutral queries, suggesting it being a more effective strategy for general-purpose decoding but not limited to sycophancy.
- Published
- 2024
109. DiffZOO: A Purely Query-Based Black-Box Attack for Red-teaming Text-to-Image Generative Model via Zeroth Order Optimization
- Author
-
Dang, Pucheng, Hu, Xing, Li, Dong, Zhang, Rui, Guo, Qi, and Xu, Kaidi
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Current text-to-image (T2I) synthesis diffusion models raise misuse concerns, particularly in creating prohibited or not-safe-for-work (NSFW) images. To address this, various safety mechanisms and red teaming attack methods are proposed to enhance or expose the T2I model's capability to generate unsuitable content. However, many red teaming attack methods assume knowledge of the text encoders, limiting their practical usage. In this work, we rethink the case of \textit{purely black-box} attacks without prior knowledge of the T2l model. To overcome the unavailability of gradients and the inability to optimize attacks within a discrete prompt space, we propose DiffZOO which applies Zeroth Order Optimization to procure gradient approximations and harnesses both C-PRV and D-PRV to enhance attack prompts within the discrete prompt domain. We evaluated our method across multiple safety mechanisms of the T2I diffusion model and online servers. Experiments on multiple state-of-the-art safety mechanisms show that DiffZOO attains an 8.5% higher average attack success rate than previous works, hence its promise as a practical red teaming tool for T2l models.
- Published
- 2024
110. From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning
- Author
-
Zhang, Ranran Haoran, Uçar, Bensu, Dey, Soumik, Wu, Hansi, Li, Binbin, and Zhang, Rui
- Subjects
Computer Science - Information Retrieval ,Computer Science - Computation and Language - Abstract
Open-vocabulary Extreme Multi-label Classification (OXMC) extends traditional XMC by allowing prediction beyond an extremely large, predefined label set (typically $10^3$ to $10^{12}$ labels), addressing the dynamic nature of real-world labeling tasks. However, self-selection bias in data annotation leads to significant missing labels in both training and test data, particularly for less popular inputs. This creates two critical challenges: generation models learn to be "lazy'" by under-generating labels, and evaluation becomes unreliable due to insufficient annotation in the test set. In this work, we introduce Positive-Unlabeled Sequence Learning (PUSL), which reframes OXMC as an infinite keyphrase generation task, addressing the generation model's laziness. Additionally, we propose to adopt a suite of evaluation metrics, F1@$\mathcal{O}$ and newly proposed B@$k$, to reliably assess OXMC models with incomplete ground truths. In a highly imbalanced e-commerce dataset with substantial missing labels, PUSL generates 30% more unique labels, and 72% of its predictions align with actual user queries. On the less skewed EURLex-4.3k dataset, PUSL demonstrates superior F1 scores, especially as label counts increase from 15 to 30. Our approach effectively tackles both the modeling and evaluation challenges in OXMC with missing labels.
- Published
- 2024
111. Movable Antenna for Wireless Communications:Prototyping and Experimental Results
- Author
-
Dong, Zhenjun, Zhou, Zhiwen, Xiao, Zhiqiang, Zhang, Chaoyue, Li, Xinrui, Min, Hongqi, Zeng, Yong, Jin, Shi, and Zhang, Rui
- Subjects
Computer Science - Information Theory ,Electrical Engineering and Systems Science - Signal Processing - Abstract
Movable antenna (MA), which can flexibly change the position of antenna in three-dimensional (3D) continuous space, is an emerging technology for achieving full spatial performance gains. In this paper, a prototype of MA communication system with ultra-accurate movement control is presented to verify the performance gain of MA in practical environments. The prototype utilizes the feedback control to ensure that each power measurement is performed after the MA moves to a designated position. The system operates at 3.5 GHz or 27.5 GHz, where the MA moves along a one-dimensional horizontal line with a step size of 0.01{\lambda} and in a two-dimensional square region with a step size of 0.05{\lambda}, respectively, with {\lambda} denoting the signal wavelength. The scenario with mixed line-of-sight (LoS) and non-LoS (NLoS) links is considered. Extensive experimental results are obtained with the designed prototype and compared with the simulation results, which validate the great potential of MA technology in improving wireless communication performance. For example, the maximum variation of measured power reaches over 40 dB and 23 dB at 3.5 GHz and 27.5 GHz, respectively, thanks to the flexible antenna movement. In addition, experimental results indicate that the power gain of MA system relies on the estimated path state information (PSI), including the number of paths, their delays, elevation and azimuth angles of arrival (AoAs), as well as the power ratio of each path.
- Published
- 2024
112. Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding
- Author
-
Huang, Lei, Guo, Jiaming, He, Guanhua, Zhang, Xishan, Zhang, Rui, Peng, Shaohui, Liu, Shaoli, and Chen, Tianshi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Generating long-term texts such as novels using artificial intelligence has always been a challenge. A common approach is to use large language models (LLMs) to construct a hierarchical framework that first plans and then writes. Despite the fact that the generated novels reach a sufficient length, they exhibit poor logical coherence and appeal in their plots and deficiencies in character and event depiction, ultimately compromising the overall narrative quality. In this paper, we propose a method named Extracting Excelsior and Expanding. Ex3 initially extracts structure information from raw novel data. By combining this structure information with the novel data, an instruction-following dataset is meticulously crafted. This dataset is then utilized to fine-tune the LLM, aiming for excelsior generation performance. In the final stage, a tree-like expansion method is deployed to facilitate the generation of arbitrarily long novels. Evaluation against previous methods showcases Ex3's ability to produce higher-quality long-form novels.
- Published
- 2024
113. SparkRA: A Retrieval-Augmented Knowledge Service System Based on Spark Large Language Model
- Author
-
Wu, Dayong, Li, Jiaqi, Wang, Baoxin, Zhao, Honghong, Xue, Siyuan, Yang, Yanjie, Chang, Zhijun, Zhang, Rui, Qian, Li, Wang, Bo, Wang, Shijin, Zhang, Zhixiong, and Hu, Guoping
- Subjects
Computer Science - Computation and Language - Abstract
Large language models (LLMs) have shown remarkable achievements across various language tasks.To enhance the performance of LLMs in scientific literature services, we developed the scientific literature LLM (SciLit-LLM) through pre-training and supervised fine-tuning on scientific literature, building upon the iFLYTEK Spark LLM. Furthermore, we present a knowledge service system Spark Research Assistant (SparkRA) based on our SciLit-LLM. SparkRA is accessible online and provides three primary functions: literature investigation, paper reading, and academic writing. As of July 30, 2024, SparkRA has garnered over 50,000 registered users, with a total usage count exceeding 1.3 million.
- Published
- 2024
114. Systematic Uncertainties from Gribov Copies in Lattice Calculation of Parton Distributions in the Coulomb gauge
- Author
-
Gao, Xiang, He, Jinchen, Zhang, Rui, and Zhao, Yong
- Subjects
High Energy Physics - Lattice ,High Energy Physics - Phenomenology ,Nuclear Theory - Abstract
Recently, it has been proposed to compute parton distributions from boosted correlators fixed in the Coulomb gauge within the framework of Large-Momentum Effective Theory. This method does not involve Wilson lines and could greatly improve the efficiency and precision of lattice QCD calculations. However, there are concerns about whether the systematic uncertainties from Gribov copies, which correspond to the ambiguity in lattice gauge-fixing, are under control. This work gives an assessment of the Gribov copies' effect in the Coulomb-gauge-fixed quark correlators. We utilize different strategies for the Coulomb-gauge fixing, selecting two different groups of Gribov copies based on the lattice gauge configurations. We test the difference in the resulted spatial quark correlators in the vacuum and a pion state. Our findings indicate that the statistical errors of the matrix elements from both Gribov copies, regardless of the correlation range, decrease proportionally to the square root of the number of gauge configurations. The difference between the strategies does not show statistical significance compared to the gauge noise. This demonstrates that the effect of the Gribov copies can be neglected in the practical lattice calculation of the quark parton distributions.
- Published
- 2024
115. Comments on 'Non-local Nucleon Matrix Elements in the Rest Frame'
- Author
-
Gao, Xiang, He, Jinchen, Su, Yushan, Zhang, Rui, and Zhao, Yong
- Subjects
High Energy Physics - Lattice - Abstract
In a recent paper, "Non-local Nucleon Matrix Elements in the Rest Frame" (arXiv: 2407.16577), it was demonstrated that the next-to-leading order perturbative theory can describe, to a few percent accuracy, the lattice QCD static nucleon matrix elements of spatial correlators with separations up to 0.6~fm. We argue that perturbative QCD breaks down at such a distance scale after resumming the associated large logarithms, while the ansatz used in the analysis there did not account for resummation or the leading renormalon, both of which significantly affect the convergence of perturbation theory. Besides, we provide an explanation why the ansatz appears to describe the lattice data despite the breakdown of perturbation theory at large distances., Comment: 3 pages, 3 figures; comment on arXiv:2407.16577
- Published
- 2024
116. Cold plasma with zirconia nanoparticles for lung cancer via TGF-\b{eta} signaling pathway
- Author
-
Huang, Yueye, Zhang, Rui, Chen, Xiao, Cao, Fei, Fang, Qiujie, Xu, Qingnan, Huang, Shicong, Wang, Yufan, Chen, Guojun, and Chen, Zhitong
- Subjects
Physics - Biological Physics ,Physics - Medical Physics - Abstract
Despite advancements in lung cancer therapy, the prognosis for advanced or metastatic patients remains poor, yet many patients eventually develop resistance to standard treatments leading to disease progression and poor survival. Here, we described a combination of CAP and nanoparticles (ZrO2 NPs (zirconium oxide nanoparticle) and 3Y-TZP NPs (3% mol Yttria Tetragonal Zirconia Polycrystal Nanoparticle)) for lung cancer therapy. We found that ZrO2 NPs caused obvious damage to the inside of the lung cancer cells. CAP and ZrO2 NPs mainly affected the mitochondria function, leading to a decrease in mitochondrial membrane potential and ATP levels, and causing endoplasmic reticulum stress and cell nucleus internal DNA damage, etc. CAP combined with ZrO2 NPs (CAP@ZrO2) induced lung cancer cell apoptosis by activating the TGF-\b{eta} pathway. CAP@ZrO2 offers a new therapy for the clinical treatment of lung cancer., Comment: 48 pages
- Published
- 2024
117. Embedding Compression in Recommender Systems: A Survey
- Author
-
Li, Shiwei, Guo, Huifeng, Tang, Xing, Tang, Ruiming, Hou, Lu, Li, Ruixuan, and Zhang, Rui
- Subjects
Computer Science - Information Retrieval - Abstract
To alleviate the problem of information explosion, recommender systems are widely deployed to provide personalized information filtering services. Usually, embedding tables are employed in recommender systems to transform high-dimensional sparse one-hot vectors into dense real-valued embeddings. However, the embedding tables are huge and account for most of the parameters in industrial-scale recommender systems. In order to reduce memory costs and improve efficiency, various approaches are proposed to compress the embedding tables. In this survey, we provide a comprehensive review of embedding compression approaches in recommender systems. We first introduce deep learning recommendation models and the basic concept of embedding compression in recommender systems. Subsequently, we systematically organize existing approaches into three categories, namely low-precision, mixed-dimension, and weight-sharing, respectively. Lastly, we summarize the survey with some general suggestions and provide future prospects for this field., Comment: Accepted by ACM Computing Surveys
- Published
- 2024
- Full Text
- View/download PDF
118. Enhancing Spatial Multiplexing and Interference Suppression for Near- and Far-Field Communications with Sparse MIMO
- Author
-
Wang, Huizhi, Feng, Chao, Zeng, Yong, Jin, Shi, Yuen, Chau, Clerckx, Bruno, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Multiple-input multiple-output has been a key technology for wireless systems for decades. For typical MIMO communication systems, antenna array elements are usually separated by half of the carrier wavelength, thus termed as conventional MIMO. In this paper, we investigate the performance of multi-user MIMO communication, with sparse arrays at both the transmitter and receiver side, i.e., the array elements are separated by more than half wavelength. Given the same number of array elements, the performance of sparse MIMO is compared with conventional MIMO. On one hand, sparse MIMO has a larger aperture, which can achieve narrower main lobe beams that make it easier to resolve densely located users. Besides, increased array aperture also enlarges the near-field communication region, which can enhance the spatial multiplexing gain, thanks to the spherical wavefront property in the near-field region. On the other hand, element spacing larger than half wavelength leads to undesired grating lobes, which, if left unattended, may cause severe inter-user interference. To gain further insights, we first study the spatial multiplexing gain of the basic single-user sparse MIMO communication system, where a closed-form expression of the near-field effective degree of freedom is derived. The result shows that the EDoF increases with the array sparsity for sparse MIMO before reaching its upper bound, which equals to the minimum value between the transmit and receive antenna numbers. Furthermore, the scaling law for the achievable data rate with varying array sparsity is analyzed and an array sparsity-selection strategy is proposed. We then consider the more general multi-user sparse MIMO communication system. It is shown that sparse MIMO is less likely to experience severe IUI than conventional MIMO., Comment: 13 pages
- Published
- 2024
119. Channel Estimation for Movable-Antenna MIMO Systems Via Tensor Decomposition
- Author
-
Zhang, Ruoyu, Cheng, Lei, Zhang, Wei, Guan, Xinrong, Cai, Yueming, Wu, Wen, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
In this letter, we investigate the channel estimation problem for MIMO wireless communication systems with movable antennas (MAs) at both the transmitter (Tx) and receiver (Rx). To achieve high channel estimation accuracy with low pilot training overhead, we propose a tensor decomposition-based method for estimating the parameters of multi-path channel components, including their azimuth and elevation angles, as well as complex gain coefficients, thereby reconstructing the wireless channel between any pair of Tx and Rx MA positions in the Tx and Rx regions. First, we introduce a two-stage Tx-Rx successive antenna movement pattern for pilot training, such that the received pilot signals in both stages can be expressed as a third-order tensor. Then, we obtain the factor matrices of the tensor via the canonical polyadic decomposition, and thereby estimate the angle/gain parameters for enabling the channel reconstruction between arbitrary Tx/Rx MA positions. In addition, we analyze the uniqueness condition of the tensor decomposition, which ensures the complete channel reconstruction between the whole Tx and Rx regions based on the channel measurements at only a finite number of Tx/Rx MA positions. Finally, simulation results are presented to evaluate the proposed tensor decomposition-based method as compared to existing methods, in terms of channel estimation accuracy and pilot overhead., Comment: 5 pages, 3 figures
- Published
- 2024
120. LinguaLinker: Audio-Driven Portraits Animation with Implicit Facial Control Enhancement
- Author
-
Zhang, Rui, Fang, Yixiao, Lu, Zhengnan, Cheng, Pei, Huang, Zebiao, and Fu, Bin
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This study delves into the intricacies of synchronizing facial dynamics with multilingual audio inputs, focusing on the creation of visually compelling, time-synchronized animations through diffusion-based techniques. Diverging from traditional parametric models for facial animation, our approach, termed LinguaLinker, adopts a holistic diffusion-based framework that integrates audio-driven visual synthesis to enhance the synergy between auditory stimuli and visual responses. We process audio features separately and derive the corresponding control gates, which implicitly govern the movements in the mouth, eyes, and head, irrespective of the portrait's origin. The advanced audio-driven visual synthesis mechanism provides nuanced control but keeps the compatibility of output video and input audio, allowing for a more tailored and effective portrayal of distinct personas across different languages. The significant improvements in the fidelity of animated portraits, the accuracy of lip-syncing, and the appropriate motion variations achieved by our method render it a versatile tool for animating any portrait in any language.
- Published
- 2024
121. MoveLight: Enhancing Traffic Signal Control through Movement-Centric Deep Reinforcement Learning
- Author
-
Shao, Junqi, Zheng, Chenhao, Chen, Yuxuan, Huang, Yucheng, and Zhang, Rui
- Subjects
Computer Science - Machine Learning - Abstract
This paper introduces MoveLight, a novel traffic signal control system that enhances urban traffic management through movement-centric deep reinforcement learning. By leveraging detailed real-time data and advanced machine learning techniques, MoveLight overcomes the limitations of traditional traffic signal control methods. It employs a lane-level control approach using the FRAP algorithm to achieve dynamic and adaptive traffic signal control, optimizing traffic flow, reducing congestion, and improving overall efficiency. Our research demonstrates the scalability and effectiveness of MoveLight across single intersections, arterial roads, and network levels. Experimental results using real-world datasets from Cologne and Hangzhou show significant improvements in metrics such as queue length, delay, and throughput compared to existing methods. This study highlights the transformative potential of deep reinforcement learning in intelligent traffic signal control, setting a new standard for sustainable and efficient urban transportation systems.
- Published
- 2024
122. Near-Field Integrated Sensing and Communication with Extremely Large-Scale Antenna Array
- Author
-
Hua, Haocheng, Xu, Jie, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
This paper studies a near-field integrated sensing and communication (ISAC) system with extremely large-scale antenna array (ELAA), in which a base station (BS) deployed with enormous number of antennas transmits wireless signals to communicate with multiple communication users (CUs) and simultaneously uses the echo signals to localize multiple point targets in the three-dimension (3D) space. To balance the performance tradeoff between communication and target localization, we design the transmit covariance matrix at the BS to optimize the localization performance while ensuring the signal-to-interference-plus-noise ratio (SINR) constraints at individual CUs. In particular, we formulate three design problems by considering different 3D localization performance metrics, including minimizing the sum Cram\'er-Rao bound (CRB), maximizing the minimum target illumination power, and maximizing the minimum target echo signal power. Although the three design problems are non-convex in general, we obtain their global optimal solutions via the technique of semi-definite relaxation (SDR). It is shown that the three problems have low-rank solution structures depending on the sensing and communication channel matrices, helping reduce the complexity of the SDR-based solutions. Interestingly, we find that in the special case with a single collocated target/CU present towards the middle of a symmetric uniform planar array (UPA), the optimal solutions to the three problems become identical to the SINR-maximization design and have a closed form, while in other cases they can be different in general. Besides, when the target/CU moves away from the transmitter/receiver, the CRB may first decrease and then increase. These two phenomena differ from those in the far-field. Numerical results show the benefits of the proposed designs for near-field ISAC, by exploiting the beam focusing capabilities of ELAA., Comment: 13 pages (14 pages for Arxiv..), 31 figures, submitted for journal publication
- Published
- 2024
123. Joint Active and Passive Beamforming Design for IRS-aided MIMO ISAC Based on Sensing Mutual Information
- Author
-
Li, Jin, Zhou, Gui, Gong, Tantao, Liu, Nan, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
In this paper, we investigate the intelligent reflecting surface (IRS)/reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) system based on sensing mutual information (MI). Specifically, the base station (BS) perceives the sensing target via the reflected sensing signal by the IRS, while communicating with the users simultaneously. Our aim is to maximize the sensing MI, subject to the quality of service (QoS) constraints for all communication users, the transmit power constraint at the BS, and the unit-modulus constraint on the IRS's passive reflection. We solve this problem under two cases: one simplified case assuming a line-of-sight (LoS) channel between the BS and IRS and no clutter interference to sensing, and the other generalized case considering the Rician fading channel of the BS-IRS link and the presence of clutter interference to sensing. For the first case, we show that the dedicated sensing beamformer cannot enhance the sensing MI if the BS-user direct links are blocked, and develop a low-complexity iterative algorithm to jointly optimize the BS and IRS active/passive beamformers. In contrast, for the second case, we propose an alternative iterative algorithm, which can also be applied to the first case, to solve the beamforming design problem under the general setup. Numerical results are provided to validate the performance of the proposed algorithms, as compared to various benchmark schemes.
- Published
- 2024
124. Backdoor Attacks against Hybrid Classical-Quantum Neural Networks
- Author
-
Guo, Ji, Jiang, Wenbo, Zhang, Rui, Fan, Wenshu, Li, Jiachen, and Lu, Guoming
- Subjects
Computer Science - Cryptography and Security - Abstract
Hybrid Quantum Neural Networks (HQNNs) represent a promising advancement in Quantum Machine Learning (QML), yet their security has been rarely explored. In this paper, we present the first systematic study of backdoor attacks on HQNNs. We begin by proposing an attack framework and providing a theoretical analysis of the generalization bounds and minimum perturbation requirements for backdoor attacks on HQNNs. Next, we employ two classic backdoor attack methods on HQNNs and Convolutional Neural Networks (CNNs) to further investigate the robustness of HQNNs. Our experimental results demonstrate that HQNNs are more robust than CNNs, requiring more significant image modifications for successful attacks. Additionally, we introduce the Qcolor backdoor, which utilizes color shifts as triggers and employs the Non-dominated Sorting Genetic Algorithm II (NSGA-II) to optimize hyperparameters. Through extensive experiments, we demonstrate the effectiveness, stealthiness, and robustness of the Qcolor backdoor.
- Published
- 2024
125. vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
- Author
-
Xu, Jiale, Zhang, Rui, Guo, Cong, Hu, Weiming, Liu, Zihan, Wu, Feiyang, Feng, Yu, Sun, Shixuan, Shao, Changxu, Guo, Yuhong, Zhao, Junping, Zhang, Ke, Guo, Minyi, and Leng, Jingwen
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning - Abstract
Large Language Models (LLMs) are widely used across various domains, processing millions of daily requests. This surge in demand poses significant challenges in optimizing throughput and latency while keeping costs manageable. The Key-Value (KV) cache, a standard method for retaining previous computations, makes LLM inference highly bounded by memory. While batching strategies can enhance performance, they frequently lead to significant memory fragmentation. Even though cutting-edge systems like vLLM mitigate KV cache fragmentation using paged Attention mechanisms, they still suffer from inefficient memory and computational operations due to the tightly coupled page management and computation kernels. This study introduces the vTensor, an innovative tensor structure for LLM inference based on GPU virtual memory management (VMM). vTensor addresses existing limitations by decoupling computation from memory defragmentation and offering dynamic extensibility. Our framework employs a CPU-GPU heterogeneous approach, ensuring efficient, fragmentation-free memory management while accommodating various computation kernels across different LLM architectures. Experimental results indicate that vTensor achieves an average speedup of 1.86x across different models, with up to 2.42x in multi-turn chat scenarios. Additionally, vTensor provides average speedups of 2.12x and 3.15x in kernel evaluation, reaching up to 3.92x and 3.27x compared to SGLang Triton prefix-prefilling kernels and vLLM paged Attention kernel, respectively. Furthermore, it frees approximately 71.25% (57GB) of memory on the NVIDIA A100 GPU compared to vLLM, enabling more memory-intensive workloads., Comment: 16 pages, 12 figures
- Published
- 2024
126. Power Measurement Enabled Channel Autocorrelation Matrix Estimation for IRS-Assisted Wireless Communication
- Author
-
Yan, Ge, Zhu, Lipeng, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
By reconfiguring wireless channels via passive signal reflection, intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-involved links. Prior works on IRS CSI acquisition mainly estimate IRS-cascaded channels based on the extra pilot signals received at the users/base station (BS) with time-varying IRS reflections, which, however, needs to modify the existing channel training/estimation protocols of wireless systems. To address this issue, we propose in this paper a new channel estimation scheme for IRS-assisted communication systems based on the received signal power measured at the user terminal, which is practically attainable without the need of changing the current protocol. Due to the lack of signal phase information in measured power, the autocorrelation matrix of the BS-IRS-user cascaded channel is estimated by solving an equivalent rank-minimization problem. To this end, a low-rank-approaching (LRA) algorithm is proposed by employing the fractional programming and alternating optimization techniques. To reduce computational complexity, an approximate LRA (ALRA) algorithm is also developed. Furthermore, these two algorithms are extended to be robust against the receiver noise and quantization error in power measurement. Simulation results are provided to verify the effectiveness of the proposed channel estimation algorithms as well as the IRS passive reflection design based on the estimated channel autocorrelation matrix., Comment: 17 pages, 16 figures, part of this work was presented at the IEEE Global Communications Conference Workshops 2023, Kuala Lumpur, Malaysia. arXiv admin note: text overlap with arXiv:2310.11038
- Published
- 2024
127. Fine-grained Knowledge Graph-driven Video-Language Learning for Action Recognition
- Author
-
Zhang, Rui, Lu, Yafen, Ji, Pengli, Xue, Junxiao, and Yan, Xiaoran
- Subjects
Computer Science - Multimedia - Abstract
Recent work has explored video action recognition as a video-text matching problem and several effective methods have been proposed based on large-scale pre-trained vision-language models. However, these approaches primarily operate at a coarse-grained level without the detailed and semantic understanding of action concepts by exploiting fine-grained semantic connections between actions and body movements. To address this gap, we propose a contrastive video-language learning framework guided by a knowledge graph, termed KG-CLIP, which incorporates structured information into the CLIP model in the video domain. Specifically, we construct a multi-modal knowledge graph composed of multi-grained concepts by parsing actions based on compositional learning. By implementing a triplet encoder and deviation compensation to adaptively optimize the margin in the entity distance function, our model aims to improve alignment of entities in the knowledge graph to better suit complex relationship learning. This allows for enhanced video action recognition capabilities by accommodating nuanced associations between graph components. We comprehensively evaluate KG-CLIP on Kinetics-TPS, a large-scale action parsing dataset, demonstrating its effectiveness compared to competitive baselines. Especially, our method excels at action recognition with few sample frames or limited training data, which exhibits excellent data utilization and learning capabilities.
- Published
- 2024
128. Group Movable Antenna With Flexible Sparsity: Joint Array Position and Sparsity Optimization
- Author
-
Lu, Haiquan, Zeng, Yong, Jin, Shi, and Zhang, Rui
- Subjects
Computer Science - Information Theory ,Electrical Engineering and Systems Science - Signal Processing - Abstract
Movable antenna (MA) is a promising technology to exploit the spatial variation of wireless channel for performance enhancement, by dynamically varying the antenna position within a certain region. However, for multi-antenna communication systems, moving each antenna independently not only requires prohibitive complexity to find the optimal antenna positions, but also incurs sophisticated movement control in practice. To address this issue, this letter proposes a new MA architecture termed group MA (GMA), enabling the group movement of all elements collectively in a continuous manner, and simultaneously achieving flexible array architecture by antenna selection (AS). In this letter, we focus on the uniform sparse array based GMA, where equally spaced antenna elements are selected to achieve desired array sparsity. The array position and sparsity level are jointly optimized to maximize the sum rate of the multi-user communication system. Numerical results verify the necessity to optimize the position and sparsity of GMA, and considerable performance gain is achieved as compared to the conventional fixed-position antenna (FPA)., Comment: 5 pages, 5 figures
- Published
- 2024
129. Dimension-reduced Reconstruction Map Learning for Parameter Estimation in Likelihood-Free Inference Problems
- Author
-
Zhang, Rui, Chkrebtii, Oksana A., and Xiu, Dongbin
- Subjects
Statistics - Methodology ,Statistics - Computation ,Statistics - Machine Learning - Abstract
Many application areas rely on models that can be readily simulated but lack a closed-form likelihood, or an accurate approximation under arbitrary parameter values. Existing parameter estimation approaches in this setting are generally approximate. Recent work on using neural network models to reconstruct the mapping from the data space to the parameters from a set of synthetic parameter-data pairs suffers from the curse of dimensionality, resulting in inaccurate estimation as the data size grows. We propose a dimension-reduced approach to likelihood-free estimation which combines the ideas of reconstruction map estimation with dimension-reduction approaches based on subject-specific knowledge. We examine the properties of reconstruction map estimation with and without dimension reduction and explore the trade-off between approximation error due to information loss from reducing the data dimension and approximation error. Numerical examples show that the proposed approach compares favorably with reconstruction map estimation, approximate Bayesian computation, and synthetic likelihood estimation.
- Published
- 2024
130. KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration
- Author
-
Yan, Youfu, Hou, Yu, Xiao, Yongkang, Zhang, Rui, and Wang, Qianwen
- Subjects
Computer Science - Human-Computer Interaction - Abstract
The increasing reliance on Large Language Models (LLMs) for health information seeking can pose severe risks due to the potential for misinformation and the complexity of these topics. This paper introduces KNOWNET a visualization system that integrates LLMs with Knowledge Graphs (KG) to provide enhanced accuracy and structured exploration. Specifically, for enhanced accuracy, KNOWNET extracts triples (e.g., entities and their relations) from LLM outputs and maps them into the validated information and supported evidence in external KGs. For structured exploration, KNOWNET provides next-step recommendations based on the neighborhood of the currently explored entities in KGs, aiming to guide a comprehensive understanding without overlooking critical aspects. To enable reasoning with both the structured data in KGs and the unstructured outputs from LLMs, KNOWNET conceptualizes the understanding of a subject as the gradual construction of graph visualization. A progressive graph visualization is introduced to monitor past inquiries, and bridge the current query with the exploration history and next-step recommendations. We demonstrate the effectiveness of our system via use cases and expert interviews., Comment: 9 pages, 9 figures, accepted by IEEE VIS 2024
- Published
- 2024
- Full Text
- View/download PDF
131. Backdoor Attacks against Image-to-Image Networks
- Author
-
Jiang, Wenbo, Li, Hongwei, He, Jiaming, Zhang, Rui, Xu, Guowen, Zhang, Tianwei, and Lu, Rongxing
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Recently, deep learning-based Image-to-Image (I2I) networks have become the predominant choice for I2I tasks such as image super-resolution and denoising. Despite their remarkable performance, the backdoor vulnerability of I2I networks has not been explored. To fill this research gap, we conduct a comprehensive investigation on the susceptibility of I2I networks to backdoor attacks. Specifically, we propose a novel backdoor attack technique, where the compromised I2I network behaves normally on clean input images, yet outputs a predefined image of the adversary for malicious input images containing the trigger. To achieve this I2I backdoor attack, we propose a targeted universal adversarial perturbation (UAP) generation algorithm for I2I networks, where the generated UAP is used as the backdoor trigger. Additionally, in the backdoor training process that contains the main task and the backdoor task, multi-task learning (MTL) with dynamic weighting methods is employed to accelerate convergence rates. In addition to attacking I2I tasks, we extend our I2I backdoor to attack downstream tasks, including image classification and object detection. Extensive experiments demonstrate the effectiveness of the I2I backdoor on state-of-the-art I2I network architectures, as well as the robustness against different mainstream backdoor defenses.
- Published
- 2024
132. DDFAD: Dataset Distillation Framework for Audio Data
- Author
-
Jiang, Wenbo, Zhang, Rui, Li, Hongwei, Liu, Xiaoyuan, Yang, Haomiao, and Yu, Shui
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Databases ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Deep neural networks (DNNs) have achieved significant success in numerous applications. The remarkable performance of DNNs is largely attributed to the availability of massive, high-quality training datasets. However, processing such massive training data requires huge computational and storage resources. Dataset distillation is a promising solution to this problem, offering the capability to compress a large dataset into a smaller distilled dataset. The model trained on the distilled dataset can achieve comparable performance to the model trained on the whole dataset. While dataset distillation has been demonstrated in image data, none have explored dataset distillation for audio data. In this work, for the first time, we propose a Dataset Distillation Framework for Audio Data (DDFAD). Specifically, we first propose the Fused Differential MFCC (FD-MFCC) as extracted features for audio data. After that, the FD-MFCC is distilled through the matching training trajectory distillation method. Finally, we propose an audio signal reconstruction algorithm based on the Griffin-Lim Algorithm to reconstruct the audio signal from the distilled FD-MFCC. Extensive experiments demonstrate the effectiveness of DDFAD on various audio datasets. In addition, we show that DDFAD has promising application prospects in many applications, such as continual learning and neural architecture search.
- Published
- 2024
133. Understanding the Importance of Evolutionary Search in Automated Heuristic Design with Large Language Models
- Author
-
Zhang, Rui, Liu, Fei, Lin, Xi, Wang, Zhenkun, Lu, Zhichao, and Zhang, Qingfu
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Artificial Intelligence - Abstract
Automated heuristic design (AHD) has gained considerable attention for its potential to automate the development of effective heuristics. The recent advent of large language models (LLMs) has paved a new avenue for AHD, with initial efforts focusing on framing AHD as an evolutionary program search (EPS) problem. However, inconsistent benchmark settings, inadequate baselines, and a lack of detailed component analysis have left the necessity of integrating LLMs with search strategies and the true progress achieved by existing LLM-based EPS methods to be inadequately justified. This work seeks to fulfill these research queries by conducting a large-scale benchmark comprising four LLM-based EPS methods and four AHD problems across nine LLMs and five independent runs. Our extensive experiments yield meaningful insights, providing empirical grounding for the importance of evolutionary search in LLM-based AHD approaches, while also contributing to the advancement of future EPS algorithmic development. To foster accessibility and reproducibility, we have fully open-sourced our benchmark and corresponding results., Comment: Accepted by the 18th International Conference on Parallel Problem Solving From Nature (PPSN 2024)
- Published
- 2024
134. CodeV: Empowering LLMs for Verilog Generation through Multi-Level Summarization
- Author
-
Zhao, Yang, Huang, Di, Li, Chongxiao, Jin, Pengwei, Nan, Ziyuan, Ma, Tianyun, Qi, Lei, Pan, Yansong, Zhang, Zhenxing, Zhang, Rui, Zhang, Xishan, Du, Zidong, Guo, Qi, Hu, Xing, and Chen, Yunji
- Subjects
Computer Science - Programming Languages ,Computer Science - Artificial Intelligence - Abstract
The increasing complexity and high costs associated with modern processor design have led to a surge in demand for processor design automation. Instruction-tuned large language models (LLMs) have demonstrated remarkable performance in automatically generating code for general-purpose programming languages like Python. However, these methods fail on hardware description languages (HDLs) like Verilog due to the scarcity of high-quality instruction tuning data, as even advanced LLMs like GPT-3.5 exhibit limited performance on Verilog generation. Regarding this issue, we observe that (1) Verilog code collected from the real world has higher quality than those generated by LLMs. (2) LLMs like GPT-3.5 excel in summarizing Verilog code rather than generating it. Based on these observations, this paper introduces CodeV, a series of open-source instruction-tuned Verilog generation LLMs. Instead of generating descriptions first and then getting the corresponding code from advanced LLMs, we prompt the LLM with Verilog code and let the LLM generate the corresponding natural language description by multi-level summarization. Experimental results show that CodeV relatively surpasses the previous open-source SOTA by 14.4% (BetterV in VerilogEval) and 11.3% (RTLCoder in RTLLM) respectively, and also relatively outperforms previous commercial SOTA GPT-4 by 22.1% in VerilogEval., Comment: 16 pages, 8 figures, conference
- Published
- 2024
135. In-plane staging in lithium-ion intercalation of bilayer graphene
- Author
-
Astles, Thomas, McHugh, James G., Zhang, Rui, Guo, Qian, Howe, Madeleine, Wu, Zefei, Indykiewicz, Kornelia, Summerfield, Alex, Goodwin, Zachary A. H., Slizovskiy, Sergey, Domaretskiy, Daniil, Geim, Andre K., Falko, Vladimir, and Grigorieva, Irina V.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
The ongoing efforts to optimize Li-ion batteries led to the interest in intercalation of nanoscale layered compounds, including bilayer graphene. Its lithium intercalation has been demonstrated recently but the mechanisms underpinning the storage capacity remain poorly understood. Here, using magnetotransport measurements, we report in-operando intercalation dynamics of bilayer graphene. Unexpectedly, we find four distinct intercalation stages that correspond to well-defined Li-ion densities. We refer to these stages as 'in-plane', with no in-plane analogues in bulk graphite. The fully intercalated bilayers represent a stoichiometric compound C14LiC14 with a Li density of 2.7x10^{14} cm^{-2}, notably lower than fully intercalated graphite. Combining the experimental findings and DFT calculations, we show that the critical step in bilayer intercalation is a transition from AB to AA stacking which occurs at a density of 0.9x10^{14} cm^{-2}. Our findings reveal the mechanism and limits for electrochemical intercalation of bilayer graphene and suggest possible avenues for increasing the Li storage capacity., Comment: 30 pages, 17 figures
- Published
- 2024
- Full Text
- View/download PDF
136. Interpretable Differential Diagnosis with Dual-Inference Large Language Models
- Author
-
Zhou, Shuang, Ding, Sirui, Wang, Jiashuo, Lin, Mingquan, Melton, Genevieve B., and Zhang, Rui
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Methodological advancements to automate the generation of differential diagnosis (DDx) to predict a list of potential diseases as differentials given patients' symptom descriptions are critical to clinical reasoning and applications such as decision support. However, providing reasoning or interpretation for these differential diagnoses is more meaningful. Fortunately, large language models (LLMs) possess powerful language processing abilities and have been proven effective in various related tasks. Motivated by this potential, we investigate the use of LLMs for interpretable DDx. First, we develop a new DDx dataset with expert-derived interpretation on 570 public clinical notes. Second, we propose a novel framework, named Dual-Inf, that enables LLMs to conduct bidirectional inference for interpretation. Both human and automated evaluation demonstrate the effectiveness of Dual-Inf in predicting differentials and diagnosis explanations. Specifically, the performance improvement of Dual-Inf over the baseline methods exceeds 32% w.r.t. BERTScore in DDx interpretation. Furthermore, experiments verify that Dual-Inf (1) makes fewer errors in interpretation, (2) has great generalizability, (3) is promising for rare disease diagnosis and explanation., Comment: 15 pages
- Published
- 2024
137. Detection-Triggered Recursive Impact Mitigation against Secondary False Data Injection Attacks in Microgrids
- Author
-
Liu, Mengxiang, Zhang, Xin, Zhang, Rui, Zhou, Zhuoran, Zhang, Zhenyong, and Deng, Ruilong
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
The cybersecurity of microgrid has received widespread attentions due to the frequently reported attack accidents against distributed energy resource (DER) manufactures. Numerous impact mitigation schemes have been proposed to reduce or eliminate the impacts of false data injection attacks (FDIAs). Nevertheless, the existing methods either requires at least one neighboring trustworthy agent or may bring in unacceptable cost burdens. This paper aims to propose a detection-triggered recursive impact mitigation scheme that can timely and precisely counter the secondary FDIAs (SFDIAs) against the communication links among DERs. Once triggering attack alarms, the power line current readings will be utilised to observe the voltage bias injections through the physical interconnections among DERs, based on which the current bias injections can be recursively reconstructed from the residuals generated by unknown input observers (UIOs). The attack impacts are eliminated by subtracting the reconstructed bias from the incoming compromised data. The proposed mitigation method can work even in the worst case where all communication links are under SFDIAs and only require extra current sensors. The bias reconstruction performance under initial errors and system noises is theoretically analysed and the reconstruction error is proved to be bounded regardless of the electrical parameters. To avoid deploying current sensors on all power lines, a cost-effective deployment strategy is presented to secure a spanning tree set of communication links that can guarantee the secondary control performance. Extensive validation studies are conducted in MATLAB/SIMULINK and hardware-in-the-loop (HIL) testbeds to validate the proposed method's effectiveness against single/multiple and continuous/discontinuous SFDIAs., Comment: Submitted to IEEE Transactions on Smart Grid
- Published
- 2024
138. InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct
- Author
-
Wu, Yutong, Huang, Di, Shi, Wenxuan, Wang, Wei, Gao, Lingzhe, Liu, Shihao, Nan, Ziyuan, Yuan, Kaizhao, Zhang, Rui, Zhang, Xishan, Du, Zidong, Guo, Qi, Pu, Yewen, Yin, Dawei, Hu, Xing, and Chen, Yunji
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Software Engineering - Abstract
Recent advancements in open-source code large language models (LLMs) have demonstrated remarkable coding abilities by fine-tuning on the data generated from powerful closed-source LLMs such as GPT-3.5 and GPT-4 for instruction tuning. This paper explores how to further improve an instruction-tuned code LLM by generating data from itself rather than querying closed-source LLMs. Our key observation is the misalignment between the translation of formal and informal languages: translating formal language (i.e., code) to informal language (i.e., natural language) is more straightforward than the reverse. Based on this observation, we propose INVERSE-INSTRUCT, which summarizes instructions from code snippets instead of the reverse. Specifically, given an instruction tuning corpus for code and the resulting instruction-tuned code LLM, we ask the code LLM to generate additional high-quality instructions for the original corpus through code summarization and self-evaluation. Then, we fine-tune the base LLM on the combination of the original corpus and the self-generated one, which yields a stronger instruction-tuned LLM. We present a series of code LLMs named InverseCoder, which surpasses the performance of the original code LLMs on a wide range of benchmarks, including Python text-to-code generation, multilingual coding, and data-science code generation.
- Published
- 2024
139. Fixed and Movable Antenna Technology for 6G Integrated Sensing and Communication
- Author
-
Zeng, Yong, Dong, Zhenjun, Wang, Huizhi, Zhu, Lipeng, Hong, Ziyao, Jiang, Qingji, Wang, Dongming, Jin, Shi, and Zhang, Rui
- Subjects
Computer Science - Hardware Architecture - Abstract
By deploying antenna arrays at the transmitter/receiver to provide additional spatial-domain degrees of freedom (DoFs), multi-antenna technology greatly improves the reliability and efficiency of wireless communication. Meanwhile, the application of multi-antenna technology in the radar field has achieved spatial angle resolution and improved sensing DoF, thus significantly enhancing wireless sensing performance. However, wireless communication and radar sensing have undergone independent development over the past few decades. As a result, although multi-antenna technology has dramatically advanced in these two fields separately, it has not been deeply integrated by exploiting their synergy. A new opportunity to fill up this gap arises as the integration of sensing and communication has been identified as one of the typical usage scenarios of the 6G communication network. Motivated by the above, this article aims to explore the multi-antenna technology for 6G ISAC, with the focus on its future development trends such as continuous expansion of antenna array scale, more diverse array architectures, and more flexible antenna designs. First, we introduce several new and promising antenna architectures, including the centralized antenna architectures based on traditional compact arrays or emerging sparse arrays, the distributed antenna architectures exemplified by the cell-free massive MIMO, and the movable/fluid antennas with flexible positions and/or orientations in a given 3D space. Next, for each antenna architecture mentioned above, we present the corresponding far-field/near-field channel models and analyze the communication and sensing performance. Finally, we summarize the characteristics of different antenna architectures and look forward to new ideas for solving the difficulties in acquiring CSI caused by the continuous expansion of antenna array scale and flexible antenna designs., Comment: in Chinese language
- Published
- 2024
140. HHH Whitepaper
- Author
-
Brigljevic, Vuko, Ferencek, Dinko, Landsberg, Greg, Robens, Tania, Stamenkovic, Marko, Susa, Tatjana, Abouabid, Hamza, Arhrib, Abdesslam, Arnold, Hannah, Azevedo, Duarte, Diaz, Daniel, Duarte, Javier, Pree, Tristan du, Falaki, Jaouad El, Ferreira, Pedro. M., Fuks, Benjamin, Ganguly, Sanmay, Kolosova, Marina, Konigsberg, Jacobo, Liu, Bingxuan, Moser, Brian, Muehlleitner, Margarete, Papaefstathiou, Andreas, Pasechnik, Roman, Santos, Rui, Sheldon, Brian, Soyez, Gregory, Stylianou, Panagiotis, Tetlalmatzi-Xolocotzi, Gilberto, Weiglein, Georg, Zanderighi, Giulia, and Zhang, Rui
- Subjects
High Energy Physics - Phenomenology - Abstract
We here report on the progress of the HHH Workshop, that took place in Dubrovnik in July 2023. After the discovery of a particle that complies with the properties of the Higgs boson of the Standard Model, all SM parameters are in principle determined. However, in order to verify or falsify the model, the full form of the potential has to be determined. This includes the measurement of the triple and quartic scalar couplings. We here report on ongoing progress of measurements for multi scalar final states, with an emphasis on three SM-like scalar bosons at 125 GeV, but also mentioning other options. We discuss both experimental progress and challenges as well as theoretical studies and models that can enhance such rates with respect to the SM predictions., Comment: 117 pages, 56 figures; Whitepaper resulting from HHH Workshop in Dubrovnik 2023, https://indico.cern.ch/event/1232581/; v2: small typos corrected
- Published
- 2024
141. Dark Photon Dark Matter in Quantum Electromagnetodynamics and Detection at Haloscope Experiments
- Author
-
Li, Tong, Zhang, Rui-Jia, and Dai, Chang-Jie
- Subjects
High Energy Physics - Phenomenology ,High Energy Physics - Experiment - Abstract
The ultralight dark photon is one of intriguing dark matter candidates. The interaction between the visible photon and dark photon is introduced by the gauge kinetic mixing between the field strength tensors of the Abelian gauge groups in the Standard Model and dark sector. The relativistic electrodynamics was generalized to quantum electromagnetodynamics (QEMD) in the presence of both electric and magnetic charges. The photon is described by two four-potentials corresponding to two $U(1)$ gauge groups and satisfying non-trivial commutation relations. In this work, we construct the low-energy dark photon-photon interactions in the framework of QEMD and obtain new dark photon-photon kinetic mixings. The consequent field equations and the new Maxwell's equations are derived in this framework. We also investigate the detection strategies of dark photon as light dark matter as well as the generic kinetic mixings at haloscope experiments., Comment: 14 pages, 2 figures
- Published
- 2024
142. Lattice QCD Calculation of $x$-dependent Meson Distribution Amplitudes at Physical Pion Mass with Threshold Logarithm Resummation
- Author
-
Cloet, Ian, Gao, Xiang, Mukherjee, Swagato, Syritsyn, Sergey, Karthik, Nikhil, Petreczky, Peter, Zhang, Rui, and Zhao, Yong
- Subjects
High Energy Physics - Lattice ,High Energy Physics - Experiment ,High Energy Physics - Phenomenology ,Nuclear Theory - Abstract
We present a lattice QCD calculation of the $x$-dependent pion and kaon distribution amplitudes (DA) in the framework of large momentum effective theory. This calculation is performed on a fine lattice of $a=0.076$~fm at physical pion mass, with the pion boosted to $1.8$~GeV and kaon boosted to $2.3$~GeV. We renormalize the matrix elements in the hybrid scheme and match to $\overline{\rm MS}$ with a subtraction of the leading renormalon in the Wilson-line mass. The perturbative matching is improved by resumming the large logarithms related to the small quark and gluon momenta in the soft-gluon limit. After resummation, we demonstrate that we are able to calculate a range of $x\in[x_0,1-x_0]$ with $x_0=0.25$ for pion and $x_0=0.2$ for kaon with systematics under control. The kaon DA is shown to be slighted skewed, and narrower than pion DA. Although the $x$-dependence cannot be direct calculated beyond these ranges, we estimate higher moments of the pion and kaon DAs {by complementing} our calculation with short-distance factorization.
- Published
- 2024
143. Crystalline-Symmetry-Protected Majorana Modes in Coupled Quantum Dots
- Author
-
Pandey, Bradraj, Alvarez, Gonzalo, Dagotto, Elbio, and Zhang, Rui-Xing
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Superconductivity - Abstract
We propose a minimalist architecture for achieving various crystalline-symmetry-protected Majorana modes in an array of coupled quantum dots. Our framework is motivated by the recent experimental demonstrations of two-site and three-site artificial Kitaev chains in a similar setup. We find that introducing a $\pi$-phase domain wall in the Kitaev chain leads to a pair of mirror-protected Majorana zero modes located at or near the junction. Joining two $\pi$-junctions into a closed loop, we can simulate two distinct classes of two-dimensional higher-order topological superconducting phases, both carrying symmetry-protected Majorana modes around the sample corners. As an extension of the $\pi$-junction, we further consider a general vertex structure where $n$ Kitaev chains meet, i.e., a Kitaev $n$-vertex. We prove that such an $n$-vertex, if respecting a dihedral symmetry group $D_n$, necessarily carries $n$ vertex-bound Majorana modes protected by the $D_n$ symmetry. Resilience of the junction and vertex Majorana bound states against disorder and correlation effects is also discussed. Our architecture paves the way for designing, constructing, and exploring a wide variety of artificial topological crystalline phases in experiments., Comment: 8 pages, 5 figures
- Published
- 2024
144. Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation
- Author
-
Ahn, Jihyun Janice, Kamoi, Ryo, Cheng, Lu, Zhang, Rui, and Yin, Wenpeng
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Mainstream LLM research has primarily focused on enhancing their generative capabilities. However, even the most advanced LLMs experience uncertainty in their outputs, often producing varied results on different runs or when faced with minor changes in input, despite no substantial change in content. Given multiple responses from the same LLM to the same input, we advocate leveraging the LLMs' discriminative capability to reduce this generative uncertainty, aiding in identifying the correct answers. Specifically, we propose and analyze three discriminative prompts: direct, inverse, and hybrid, to explore the potential of both closed-source and open-source LLMs in self-improving their generative performance on two benchmark datasets. Our insights reveal which discriminative prompt is most promising and when to use it. To our knowledge, this is the first work to systematically analyze LLMs' discriminative capacity to address generative uncertainty., Comment: 4 pages, 3 tables
- Published
- 2024
145. MammothModa: Multi-Modal Large Language Model
- Author
-
She, Qi, Pan, Junwen, Wan, Xin, Zhang, Rui, Lu, Dawei, and Huang, Kai
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
In this report, we introduce MammothModa, yet another multi-modal large language model (MLLM) designed to achieve state-of-the-art performance starting from an elementary baseline. We focus on three key design insights: (i) Integrating Visual Capabilities while Maintaining Complex Language Understanding: In addition to the vision encoder, we incorporated the Visual Attention Experts into the LLM to enhance its visual capabilities. (ii) Extending Context Window for High-Resolution and Long-Duration Visual Feature: We explore the Visual Merger Module to effectively reduce the token number of high-resolution images and incorporated frame position ids to avoid position interpolation. (iii) High-Quality Bilingual Datasets: We meticulously curated and filtered a high-quality bilingual multimodal dataset to reduce visual hallucinations. With above recipe we build MammothModa that consistently outperforms the state-of-the-art models, e.g., LLaVA-series, across main real-world visual language benchmarks without bells and whistles., Comment: Technical report
- Published
- 2024
146. Adversarial Contrastive Decoding: Boosting Safety Alignment of Large Language Models via Opposite Prompt Optimization
- Author
-
Zhao, Zhengyue, Zhang, Xiaoyun, Xu, Kaidi, Hu, Xing, Zhang, Rui, Du, Zidong, Guo, Qi, and Chen, Yunji
- Subjects
Computer Science - Computation and Language - Abstract
With the widespread application of Large Language Models (LLMs), it has become a significant concern to ensure their safety and prevent harmful responses. While current safe-alignment methods based on instruction fine-tuning and Reinforcement Learning from Human Feedback (RLHF) can effectively reduce harmful responses from LLMs, they often require high-quality datasets and heavy computational overhead during model training. Another way to align language models is to modify the logit of tokens in model outputs without heavy training. Recent studies have shown that contrastive decoding can enhance the performance of language models by reducing the likelihood of confused tokens. However, these methods require the manual selection of contrastive models or instruction templates. To this end, we propose Adversarial Contrastive Decoding (ACD), an optimization-based framework to generate two opposite system prompts for prompt-based contrastive decoding. ACD only needs to apply a lightweight prompt tuning on a rather small anchor dataset (< 3 min for each model) without training the target model. Experiments conducted on extensive models and benchmarks demonstrate that the proposed method achieves much better safety performance than previous model training-free decoding methods without sacrificing its original generation ability.
- Published
- 2024
147. LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
- Author
-
Du, Jiangshu, Wang, Yibo, Zhao, Wenting, Deng, Zhongfen, Liu, Shuaiqi, Lou, Renze, Zou, Henry Peng, Venkit, Pranav Narayanan, Zhang, Nan, Srinath, Mukund, Zhang, Haoran Ranran, Gupta, Vipul, Li, Yinghui, Li, Tao, Wang, Fei, Liu, Qin, Liu, Tianlin, Gao, Pengzhi, Xia, Congying, Xing, Chen, Cheng, Jiayang, Wang, Zhaowei, Su, Ying, Shah, Raj Sanjay, Guo, Ruohao, Gu, Jing, Li, Haoran, Wei, Kangda, Wang, Zihao, Cheng, Lu, Ranathunga, Surangika, Fang, Meng, Fu, Jie, Liu, Fei, Huang, Ruihong, Blanco, Eduardo, Cao, Yixin, Zhang, Rui, Yu, Philip S., and Yin, Wenpeng
- Subjects
Computer Science - Computation and Language - Abstract
This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as they have to spend more time reading, writing, and reviewing papers. This raises the question: how can LLMs potentially assist researchers in alleviating their heavy workload? This study focuses on the topic of LLMs assist NLP Researchers, particularly examining the effectiveness of LLM in assisting paper (meta-)reviewing and its recognizability. To address this, we constructed the ReviewCritique dataset, which includes two types of information: (i) NLP papers (initial submissions rather than camera-ready) with both human-written and LLM-generated reviews, and (ii) each review comes with "deficiency" labels and corresponding explanations for individual segments, annotated by experts. Using ReviewCritique, this study explores two threads of research questions: (i) "LLMs as Reviewers", how do reviews generated by LLMs compare with those written by humans in terms of quality and distinguishability? (ii) "LLMs as Metareviewers", how effectively can LLMs identify potential issues, such as Deficient or unprofessional review segments, within individual paper reviews? To our knowledge, this is the first work to provide such a comprehensive analysis., Comment: Accepted by EMNLP 2024 main conference
- Published
- 2024
148. Full-Space Wireless Sensing Enabled by Multi-Sector Intelligent Surfaces
- Author
-
Zhang, Yumeng, Shao, Xiaodan, Li, Hongyu, Clerckx, Bruno, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Information Theory - Abstract
The multi-sector intelligent surface (IS), benefiting from a smarter wave manipulation capability, has been shown to enhance channel gain and offer full-space coverage in communications. However, the benefits of multi-sector IS in wireless sensing remain unexplored. This paper introduces the application of multi-sector IS for wireless sensing/localization. Specifically, we propose a new self-sensing system, where an active source controller uses the multi-sector IS geometry to reflect/scatter the emitted signals towards the entire space, thereby achieving full-space coverage for wireless sensing. Additionally, dedicated sensors are installed aligned with the IS elements at each sector, which collect echo signals from the target and cooperate to sense the target angle. In this context, we develop a maximum likelihood estimator of the target angle for the proposed multi-sector IS self-sensing system, along with the corresponding theoretical limits defined by the Cram\'er-Rao Bound. The analysis reveals that the advantages of the multi-sector IS self-sensing system stem from two aspects: enhancing the probing power on targets (thereby improving power efficiency) and increasing the rate of target angle (thereby enhancing the transceiver's sensitivity to target angles). Finally, our analysis and simulations confirm that the multi-sector IS self-sensing system, particularly the 4-sector architecture, achieves full-space sensing capability beyond the single-sector IS configuration. Furthermore, similarly to communications, employing directive antenna patterns on each sector's IS elements and sensors significantly enhances sensing capabilities. This enhancement originates from both aspects of improved power efficiency and target angle sensitivity, with the former also being observed in communications while the latter being unique in sensing., Comment: 13 pages, 9 figures
- Published
- 2024
149. Integrating Base Station with Intelligent Surface for 6G Wireless Networks: Architectures, Design Issues, and Future Directions
- Author
-
Huang, Yuwei, Zhu, Lipeng, and Zhang, Rui
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Networking and Internet Architecture - Abstract
Intelligent surface (IS) is envisioned as a promising technology for the sixth-generation (6G) wireless networks, which can effectively reconfigure the wireless propagation environment via dynamically controllable signal reflection/transmission. In particular, integrating passive intelligent surface (IS) into the base station (BS) is a novel solution to enhance the wireless network throughput and coverage both cost-effectively and energyefficiently. In this article, we provide an overview of IS-integrated BSs for wireless networks, including their motivations, practical architectures, and main design issues. Moreover, numerical results are presented to compare the performance of different IS-integrated BS architectures as well as the conventional BS without IS. Finally, promising directions are pointed out to stimulate future research on IS-BS/terminal integration in wireless networks., Comment: submitted to IEEE magazine. 5 figures, 1 table
- Published
- 2024
150. Provably Secure Non-interactive Key Exchange Protocol for Group-Oriented Applications in Scenarios with Low-Quality Networks
- Author
-
Zhang, Rui and Zhang, Lei
- Subjects
Computer Science - Cryptography and Security - Abstract
Non-interactive key exchange (NIKE) enables two or multiple parties (just knowing the public system parameters and each other's public key) to derive a (group) session key without the need for interaction. Recently, NIKE in multi-party settings has been attached importance. However, we note that most existing multi-party NIKE protocols, underlying costly cryptographic techniques (i.e., multilinear maps and indistinguishability obfuscation), lead to high computational costs once employed in practice. Therefore, it is a challenging task to achieve multi-party NIKE protocols by using more practical cryptographic primitives. In this paper, we propose a secure and efficient NIKE protocol for secure communications in dynamic groups, whose construction only bases on bilinear maps. This protocol allows multiple parties to negotiate asymmetric group keys (a public group encryption key and each party's decryption key) without any interaction among one another. Additionally, the protocol supports updating of group keys in an efficient and non-interactive way once any party outside a group or any group member joins or leaves the group. Further, any party called a sender (even outside a group) intending to connect with some or all of group members called receivers in a group, just needs to generate a ciphertext with constant size under the public group encryption key, and only the group member who is the real receiver can decrypt the ciphertext to obtain the session key. We prove our protocol captures the correctness and indistinguishability of session key under k-Bilinear Diffie-Hellman exponent (k-BDHE) assumption. Efficiency evaluation shows the efficiency of our protocol.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.