Author: "Li, Peng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Li, Peng"' showing total 54,399 results

Start Over Author "Li, Peng"

54,399 results on '"Li, Peng"'

1. Envisioning Rhythm: Exploring the Visual Dimension of Natural Ecosystems through Digital Media

Author: Chang, Pai-Ling and Li, Peng-Peng
Published: 2023

2. Effects of exogenous ghrelin on duodenal growth and development of african ostrich chicks

Author: Li, Peng, Bai-Li, Tao, Jia-Wang, Xiang, Jing-Pi, Song, Ke-Peng, Mei, and Zhang, Ying
Published: 2022
Full Text: View/download PDF

3. Nonreciprocal interaction and entanglement between two superconducting qubits

Author: Ren, Yu-Meng, Pan, Xue-Feng, Yao, Xiao-Yu, Huo, Xiao-Wen, Zheng, Jun-Cong, Hei, Xin-Lei, Qiao, Yi-Fan, and Li, Peng-Bo
Subjects: Quantum Physics
Abstract: Nonreciprocal interaction between two spatially separated subsystems plays a crucial role in signal processing and quantum networks. Here, we propose an efficient scheme to achieve nonreciprocal interaction and entanglement between two qubits by combining coherent and dissipative couplings in a superconducting platform, where two coherently coupled transmon qubits simultaneously interact with a transmission line waveguide. The coherent interaction between the transmon qubits can be achieved via capacitive coupling or via an intermediary cavity mode, while the dissipative interaction is induced by the transmission line via reservoir engineering. With high tunability of superconducting qubits, their positions along the transmission line can be adjusted to tune the dissipative coupling, enabling to tailor reciprocal and nonreciprocal interactions between the qubits. A fully nonreciprocal interaction can be achieved when the separation between the two qubits is $(4n+3)\lambda_{0} /4$, where $n$ is an integer and $\lambda_{0}$ is the photon wavelength. This nonreciprocal interaction enables the generation of nonreciprocal entanglement between the two transmon qubits. Furthermore, applying a drive field to one of the qubit can stabilize the system into a nonreciprocal steady-state entangled state. Remarkably, the nonreciprocal interaction in this work does not rely on the presence of nonlinearity or complex configurations, which has more potential applications in designing nonreciprocal quantum devices, processing quantum information, and building quantum networks., Comment: 11 pages, 7 figures
Published: 2024

4. Spiking Transformer Hardware Accelerators in 3D Integration

Author: Xu, Boxun, Hwang, Junyoung, Vanna-iampikul, Pruek, Lim, Sung Kyu, and Li, Peng
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Hardware Architecture
Abstract: Spiking neural networks (SNNs) are powerful models of spatiotemporal computation and are well suited for deployment on resource-constrained edge devices and neuromorphic hardware due to their low power consumption. Leveraging attention mechanisms similar to those found in their artificial neural network counterparts, recently emerged spiking transformers have showcased promising performance and efficiency by capitalizing on the binary nature of spiking operations. Recognizing the current lack of dedicated hardware support for spiking transformers, this paper presents the first work on 3D spiking transformer hardware architecture and design methodology. We present an architecture and physical design co-optimization approach tailored specifically for spiking transformers. Through memory-on-logic and logic-on-logic stacking enabled by 3D integration, we demonstrate significant energy and delay improvements compared to conventional 2D CMOS integration.
Published: 2024

5. PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation

Author: Liu, Yun, Li, Peng, Yan, Xuefeng, Nan, Liangliang, Wang, Bing, Chen, Honghua, Gong, Lina, Zhao, Wei, and Wei, Mingqiang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The core of self-supervised point cloud learning lies in setting up appropriate pretext tasks, to construct a pre-training framework that enables the encoder to perceive 3D objects effectively. In this paper, we integrate two prevalent methods, masked point modeling (MPM) and 3D-to-2D generation, as pretext tasks within a pre-training framework. We leverage the spatial awareness and precise supervision offered by these two methods to address their respective limitations: ambiguous supervision signals and insensitivity to geometric information. Specifically, the proposed framework, abbreviated as PointCG, consists of a Hidden Point Completion (HPC) module and an Arbitrary-view Image Generation (AIG) module. We first capture visible points from arbitrary views as inputs by removing hidden points. Then, HPC extracts representations of the inputs with an encoder and completes the entire shape with a decoder, while AIG is used to generate rendered images based on the visible points' representations. Extensive experiments demonstrate the superiority of the proposed method over the baselines in various downstream tasks. Our code will be made available upon acceptance.
Published: 2024

6. $B\to K\bar K(\pi\eta)h$ decays in the presence of isovector scalar resonances $a_0(980,1450)$

Author: Wang, Si-Yang, Zhang, Zhi-Qing, Sun, Zhi-Jie, Chai, Jian, and Li, Peng
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment
Abstract: Different from the previous treatment in a two-body framework, we introduce the dimeson distribution amplitudes (DAs) to describe the strong dynamics between the S-wave resonances $a_0(980, 1450)$ and the $K\bar K (\pi\eta)$ pair, where the Gegenbauer coefficient required is determined from the experimental data on the time-like form factors involved. The branching ratios and direct CP asymmetries of the decays $B \to a^{(\prime)}_0 h \to K\bar K(\pi\eta) h$, with $a_0=a_0(980)$, $a^{\prime}_0=a_0(1450)$ and $h$ referring to a pion or a kaon, are then calculated in the perturbative QCD (PQCD) approach. We find that the branching ratios of the corresponding quasi-two-body decays $B\to a^{(\prime)}_0 K$ obtained with the narrow width approximation are closer to those predicted in the QCD factorization (QCDF) approach compared to the previous PQCD calculations, no matter a three-body or a two-body framework is assumed. Furthermore, all our predictions for these $B\to a^{(\prime)}_0 K$ decays are below the current experimental upper limits except for those of decays $B^0\to a^{(\prime)-}_0K^+$, which are (slightly) larger than the upper limits. Under the narrow width approximation, the branching ratios of the decays $B^+\to a^{(\prime)+}_0\pi^0$, $B^0\to a^{(\prime)+}_0\pi^-$ and $B^0\to a^{(\prime)0}_0\pi^0$ are comparable to or agree well with the previous PQCD and the QCDF calculations. While for the decays $B^+\to a^{(\prime)0}_0\pi^+$ and $B^0\to a^{(\prime)-}_0\pi^+$, their branching ratios are predicted to be unexpectedly large, for example, the obtained branching ratio of decay $B^+\to a^0_0\pi^+$ is even higher than the current experimental upper limit., Comment: 22 pages, 2 figures
Published: 2024

7. StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Author: Lin, Junming, Fang, Zheng, Chen, Chi, Wan, Zihao, Luo, Fuwen, Li, Peng, Liu, Yang, and Sun, Maosong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The rapid development of Multimodal Large Language Models (MLLMs) has expanded their capabilities from image comprehension to video understanding. However, most of these MLLMs focus primarily on offline video comprehension, necessitating extensive processing of all video frames before any queries can be made. This presents a significant gap compared to the human ability to watch, listen, think, and respond to streaming inputs in real time, highlighting the limitations of current MLLMs. In this paper, we introduce StreamingBench, the first comprehensive benchmark designed to evaluate the streaming video understanding capabilities of MLLMs. StreamingBench assesses three core aspects of streaming video understanding: (1) real-time visual understanding, (2) omni-source understanding, and (3) contextual understanding. The benchmark consists of 18 tasks, featuring 900 videos and 4,500 human-curated QA pairs. Each video features five questions presented at different time points to simulate a continuous streaming scenario. We conduct experiments on StreamingBench with 13 open-source and proprietary MLLMs and find that even the most advanced proprietary MLLMs like Gemini 1.5 Pro and GPT-4o perform significantly below human-level streaming video understanding capabilities. We hope our work can facilitate further advancements for MLLMs, empowering them to approach human-level video comprehension and interaction in more realistic scenarios.
Published: 2024

8. Trustworthy Federated Learning: Privacy, Security, and Beyond

Author: Chen, Chunlu, Liu, Ji, Tan, Haowen, Li, Xingjian, Wang, Kevin I-Kai, Li, Peng, Sakurai, Kouichi, and Dou, Dejing
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: While recent years have witnessed the advancement in big data and Artificial Intelligence (AI), it is of much importance to safeguard data privacy and security. As an innovative approach, Federated Learning (FL) addresses these concerns by facilitating collaborative model training across distributed data sources without transferring raw data. However, the challenges of robust security and privacy across decentralized networks catch significant attention in dealing with the distributed data in FL. In this paper, we conduct an extensive survey of the security and privacy issues prevalent in FL, underscoring the vulnerability of communication links and the potential for cyber threats. We delve into various defensive strategies to mitigate these risks, explore the applications of FL across different sectors, and propose research directions. We identify the intricate security challenges that arise within the FL frameworks, aiming to contribute to the development of secure and efficient FL systems., Comment: 32 pages, to appear in KAIS
Published: 2024

9. A versatile framework for attitude tuning of beamlines at advanced light sources

Author: Li, Peng-Cheng, Bi, Xiao-Xue, Zhang, Zhen, Deng, Xiao-Bao, Li, Chun, Wang, Li-Wen, Liu, Gong-Fa, Zhang, Yi, Zhou, Ai-Yu, and Liu, Yu
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment
Abstract: Aside from regular beamline experiments at light sources, the preparation steps before these experiments are also worth systematic consideration in terms of automation; a representative category in these steps is attitude tuning, which typically appears in names like beam focusing, sample alignment etc. With the goal of saving time and manpower in both writing and using in mind, a Mamba-based attitude-tuning framework is created. It supports flexible input/output ports, easy integration of diverse evaluation functions, and free selection of optimisation algorithms; with the help from Mamba's infrastructure, machine learning (ML) and artificial intelligence (AI) technologies can also be readily integrated. The tuning of a polycapillary lens and of an X-ray emission spectrometer are given as examples for the general use of this framework, featuring powerful command-line interfaces (CLIs) and friendly graphical user interfaces (GUIs) that allow comfortable human-in-the-loop control. The tuning of a Raman spectrometer demonstrates more specialised use of the framework with customised optimisation algorithms. With similar applications in mind, our framework is estimated to be capable of fulfilling a majority of attitude-tuning needs. Also reported is a virtual-beamline mechanism based on easily customisable simulated detectors and motors, which facilitates both testing for developers and training for users., Comment: 12 pages, 8 figures
Published: 2024

10. Beyond the EPICS: comprehensive Python IOC development with QueueIOC

Author: Li, Peng-Cheng, Bi, Xiao-Xue, Huang, Ying-Ke, Zhang, Dian-Shuai, Deng, Xiao-Bao, Zhang, Qun, Lei, Ge, Li, Gang, and Liu, Yu
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment
Abstract: Architectural deficiencies in EPICS lead to inefficiency in the development and application of EPICS input/output controllers (IOCs). An unintrusive solution is replacing EPICS IOCs with more maintainable and flexible Python IOCs, only reusing the Channel Access (CA) protocol of EPICS. After a digression about GUI development inspired by EPICS operator interfaces (OPIs), the structural similarity between standalone GUI backends, the Mamba backend, EPICS IOCs and other server programs is analysed. By combining the caproto library and event loops like those in these server programs, the QueueIOC framework for Python IOCs is created, which has the potential to systematically replace most EPICS IOCs currently used. Examples are first given for workalikes of StreamDevice and asyn; examples for seq-like applications include monochromators, motor anti-bumping and motor multiplexing. Also shown is software to use with the ~/iocBoot convention which addresses some issues with a similar solution based on procServ, along with a workalike of procServControl. A QueueIOC-based framework for detector integration, which overcomes areaDetector's limitations in performance and architecture, is presented in an accompanying paper., Comment: 14 pages, 8 figures
Published: 2024

11. Detector integration at HEPS: a systematic, efficient and high-performance approach

Author: Zhang, Qun, Li, Peng-Cheng, Bian, Ling-Zhu, Li, Chun, Yue, Zong-Yang, Zhang, Cheng-Long, Zhao, Zhuo-Feng, Zhang, Yi, Li, Gang, Zhou, Ai-Yu, and Liu, Yu
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment
Abstract: At least 25 kinds of detector-like devices need to be integrated in Phase I of the High Energy Photon Source (HEPS), and the work needs to be carefully planned to maximise productivity with highly limited human resources. After a systematic analysis on the actual work involved in detector integration, a separation of concerns between collaborating groups of personnel is established to minimise the duplication of efforts. To facilitate software development for detector integration, the ADGenICam library, which abstracts repeated code in EPICS modules for cameras, is extended to support a much wider range of detectors. An increasingly considerable fraction of detectors, both inside and outside HEPS, offer performance that exceed capabilities of the areaDetector framework in EPICS. Given this background, areaDetector's limitations in performance and architecture are analysed, and a QueueIOC -based framework that overcomes these limitations is introduced. A simple, flexible ZeroMQ-based protocol is used for data transport in this framework, while RDMA transport and multi-node readout will be explored for higher data throughputs. By calling C/C++ libraries from within Python, the performance of the former and the expressiveness of the latter can coexist nicely; the expressiveness allows for much higher efficiency in the implementation and use of integration modules functionally comparable to their EPICS counterparts., Comment: 11 pages, 3 figures
Published: 2024

12. Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles

Author: Wang, Zhengming, Wang, Junli, Li, Pengfei, Li, Zhaohan, Li, Peng, and Chen, Yilun
Subjects: Computer Science - Robotics, Computer Science - Artificial Intelligence
Abstract: While the capabilities of autonomous driving have advanced rapidly, merging into dense traffic remains a significant challenge, many motion planning methods for this scenario have been proposed but it is hard to evaluate them. Most existing closed-loop simulators rely on rule-based controls for other vehicles, which results in a lack of diversity and randomness, thus failing to accurately assess the motion planning capabilities in highly interactive scenarios. Moreover, traditional evaluation metrics are insufficient for comprehensively evaluating the performance of merging in dense traffic. In response, we proposed a closed-loop evaluation benchmark for assessing motion planning capabilities in merging scenarios. Our approach involves other vehicles trained in large scale datasets with micro-behavioral characteristics that significantly enhance the complexity and diversity. Additionally, we have restructured the evaluation mechanism by leveraging large language models to assess each autonomous vehicle merging onto the main road. Extensive experiments have demonstrated the advanced nature of this evaluation benchmark. Through this benchmark, we have obtained an evaluation of existing methods and identified common issues. The environment and vehicle motion planning models we have designed can be accessed at https://anonymous.4open.science/r/Bench4Merge-EB5D, Comment: 6 pages, 7 figures, IEEE international conference on robotics and automation
Published: 2024

13. Efficient Deep Learning Board: Training Feedback Is Not All You Need

Author: Gong, Lina, Gao, Qi, Li, Peng, Wei, Mingqiang, and Wu, Fei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Current automatic deep learning (i.e., AutoDL) frameworks rely on training feedback from actual runs, which often hinder their ability to provide quick and clear performance predictions for selecting suitable DL systems. To address this issue, we propose EfficientDL, an innovative deep learning board designed for automatic performance prediction and component recommendation. EfficientDL can quickly and precisely recommend twenty-seven system components and predict the performance of DL models without requiring any training feedback. The magic of no training feedback comes from our proposed comprehensive, multi-dimensional, fine-grained system component dataset, which enables us to develop a static performance prediction model and comprehensive optimized component recommendation algorithm (i.e., {\alpha}\b{eta}-BO search), removing the dependency on actually running parameterized models during the traditional optimization search process. The simplicity and power of EfficientDL stem from its compatibility with most DL models. For example, EfficientDL operates seamlessly with mainstream models such as ResNet50, MobileNetV3, EfficientNet-B0, MaxViT-T, Swin-B, and DaViT-T, bringing competitive performance improvements. Besides, experimental results on the CIFAR-10 dataset reveal that EfficientDL outperforms existing AutoML tools in both accuracy and efficiency (approximately 20 times faster along with 1.31% Top-1 accuracy improvement than the cutting-edge methods). Source code, pretrained models, and datasets are available at https://github.com/OpenSELab/EfficientDL.
Published: 2024

14. Semileptonic and nonleptonic decays of $B_{u,d,s,c}^{*}$ in the covariant light-front approach

Author: Wang, Si-Yang, Yang, You-Ya, Sun, Zhi-Jie, Yang, Hao, Li, Peng, and Zhang, Zhi-Qing
Subjects: High Energy Physics - Phenomenology
Abstract: The semileptonic and nonleptonic decays of the b-flavor vector mesons $B^{*}_{u,d,s}$ and $B_{c}^{*}$ are investigated within the covariant light-front quark model (CLFQM). By calculating the form factors of the transitions $B_{u, d, s, c}^{*}\to P$ under the CLFQM, with $P$ denoting a pseudoscalar meson, i.e., $\pi, K, \eta_c(1S,2S), D_{(s)}, B_{(s)}$, we predict and discuss several physical observables, including the branching ratios, polarization fractions $f_{L}, f_{\|}$, and forward-backward asymmetries $A_{FB}$. The total widths of the single-photon radiative decay channels for these b-flavor vector mesons are estimated using their partial widths. In these considered decays, one can find that the semileptonic decays $B_{s}^{*0}\to D_{s}^{-}\ell^{\prime+}{\nu}_{\ell^\prime}$ and $B_{c}^{*+}\to B_{s}^{0}\ell^{\prime+}{\nu}_{\ell^\prime}, \eta_{c}\ell^{\prime+}{\nu}_{\ell^\prime}$, with $\ell^\prime$ being $e$ or $\tau$, and the nonleptonic channels $B_{c}^{*+}\to B^0_{s} \pi^{+}, B^0_{s} \rho^{+}$ have the largest branching ratios, which can reach up to the $10^{-7}$ order, and are most likely to be accessible at the future high-luminosity LHCb and Belle-II experiments., Comment: 30 pages, 4 figures,accepted for publication in Chin. Phys. C
Published: 2024

15. Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking

Author: Zhang, Wei, Li, Pengfei, Wang, Junli, Sun, Bingchuan, Jin, Qihao, Bao, Guangjun, Rui, Shibo, Yu, Yang, Ding, Wenchao, Li, Peng, and Chen, Yilun
Subjects: Computer Science - Robotics
Abstract: Automatic Emergency Braking (AEB) systems are a crucial component in ensuring the safety of passengers in autonomous vehicles. Conventional AEB systems primarily rely on closed-set perception modules to recognize traffic conditions and assess collision risks. To enhance the adaptability of AEB systems in open scenarios, we propose Dual-AEB, a system combines an advanced multimodal large language model (MLLM) for comprehensive scene understanding and a conventional rule-based rapid AEB to ensure quick response times. To the best of our knowledge, Dual-AEB is the first method to incorporate MLLMs within AEB systems. Through extensive experimentation, we have validated the effectiveness of our method. The source code will be available at https://github.com/ChipsICU/Dual-AEB.
Published: 2024

16. Optimized Magnetic Resonance Fingerprinting Using Ziv-Zakai Bound

Author: Gong, Chaoguang, Hu, Yue, Li, Peng, Zou, Lixian, Liu, Congcong, Zhou, Yihang, Zhu, Yanjie, Liang, Dong, and Wang, Haifeng
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Quantitative Biology - Quantitative Methods, Statistics - Applications
Abstract: Magnetic Resonance Fingerprinting (MRF) has emerged as a promising quantitative imaging technique within the field of Magnetic Resonance Imaging (MRI), offers comprehensive insights into tissue properties by simultaneously acquiring multiple tissue parameter maps in a single acquisition. Sequence optimization is crucial for improving the accuracy and efficiency of MRF. In this work, a novel framework for MRF sequence optimization is proposed based on the Ziv-Zakai bound (ZZB). Unlike the Cram\'er-Rao bound (CRB), which aims to enhance the quality of a single fingerprint signal with deterministic parameters, ZZB provides insights into evaluating the minimum mismatch probability for pairs of fingerprint signals within the specified parameter range in MRF. Specifically, the explicit ZZB is derived to establish a lower bound for the discrimination error in the fingerprint signal matching process within MRF. This bound illuminates the intrinsic limitations of MRF sequences, thereby fostering a deeper understanding of existing sequence performance. Subsequently, an optimal experiment design problem based on ZZB was formulated to ascertain the optimal scheme of acquisition parameters, maximizing discrimination power of MRF between different tissue types. Preliminary numerical experiments show that the optimized ZZB scheme outperforms both the conventional and CRB schemes in terms of the reconstruction accuracy of multiple parameter maps., Comment: Accepted at 2024 IEEE International Conference on Imaging Systems and Techniques (IST 2024)
Published: 2024

17. ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Author: Wang, Ziyue, Chen, Chi, Luo, Fuwen, Dong, Yurui, Zhang, Yuanchi, Xu, Yuzhuang, Wang, Xiaolong, Li, Peng, and Liu, Yang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Active perception, a crucial human capability, involves setting a goal based on the current understanding of the environment and performing actions to achieve that goal. Despite significant efforts in evaluating Multimodal Large Language Models (MLLMs), active perception has been largely overlooked. To address this gap, we propose a novel benchmark named ActiView to evaluate active perception in MLLMs. Since comprehensively assessing active perception is challenging, we focus on a specialized form of Visual Question Answering (VQA) that eases the evaluation yet challenging for existing MLLMs. Given an image, we restrict the perceptual field of a model, requiring it to actively zoom or shift its perceptual field based on reasoning to answer the question successfully. We conduct extensive evaluation over 27 models, including proprietary and open-source models, and observe that the ability to read and comprehend multiple images simultaneously plays a significant role in enabling active perception. Results reveal a significant gap in the active perception capability of MLLMs, indicating that this area deserves more attention. We hope that our benchmark could help develop methods for MLLMs to understand multimodal inputs in more natural and holistic ways.
Published: 2024

18. Dissipation-accelerated entanglement generation

Author: Zheng, Xiao-Wei, Zheng, Jun-Cong, Pan, Xue-Feng, Lin, Li-Hua, Han, Pei-Rong, and Li, Peng-Bo
Subjects: Quantum Physics
Abstract: Dissipation is usually considered a negative factor for observing quantum effects and for harnessing them for quantum technologies. Here we propose a scheme for speeding up the generation of quantum entanglement between two coupled qubits by introducing a strong dissipation channel to one of these qubits. The maximal entanglement is conditionally established by evenly distributing a single excitation between these two qubits. When the excitation is initially held by the dissipative qubit, the dissipation accelerates the excitation re-distribution process for the quantum state trajectory without quantum jumps. Our results show that the time needed to conditionally attain the maximal entanglement is monotonously decreased as the dissipative rate is increased. We further show that this scheme can be generalized to accelerate the production of the W state for the three-qubit system, where one NH qubit is symmetrically coupled to two Hermitian qubits., Comment: 2 figures
Published: 2024

19. Stable massless scalar polarization of f(R) gravity

Author: Du, Xin-Dong and Li, Peng-Cheng
Subjects: General Relativity and Quantum Cosmology, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: Polarization is a prominent feature of gravitational wave observation, and it can be used to distinguish different modified gravities. Compared to General Relativity, f(R) gravity has an additional polarization coming from scalar field, which is a mix of the longitudinal and the breathing modes. When the scalar mass of f(R) is zero, the mixed mode will reduce to a pure breathing mode with the disappearance of the longitudinal mode. However, the reducing seems not to be allowed because a positive scalar mass is often needed to maintain the stability of the cosmological perturbation. In fact, the massless case is possible to lead to a stable perturbation, but more detailed constraints need to be considered. For the completeness of the polarization analysis, we explore the possibility that there is a stable massless scalar polarization in viable f(R) models for dark energy. We find that the existence of the massless scalar polarization depends not only on the number of free parameters but also on the model structure., Comment: 10 pages, 5 figures
Published: 2024

20. Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

Author: Wang, Yu, Yin, Yuxuan, and Li, Peng
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Semi-supervised learning (SSL) commonly exhibits confirmation bias, where models disproportionately favor certain classes, leading to errors in predicted pseudo labels that accumulate under a self-training paradigm. Unlike supervised settings, which benefit from a rich, static data distribution, SSL inherently lacks mechanisms to correct this self-reinforced bias, necessitating debiased interventions at each training step. Although the generation of debiased pseudo labels has been extensively studied, their effective utilization remains underexplored. Our analysis indicates that data from biased classes should have a reduced influence on parameter updates, while more attention should be given to underrepresented classes. To address these challenges, we introduce TaMatch, a unified framework for debiased training in SSL. TaMatch employs a scaling ratio derived from both a prior target distribution and the model's learning status to estimate and correct bias at each training step. This ratio adjusts the raw predictions on unlabeled data to produce debiased pseudo labels. In the utilization phase, these labels are differently weighted according to their predicted class, enhancing training equity and minimizing class bias. Additionally, TaMatch dynamically adjust the target distribution in response to the model's learning progress, facilitating robust handling of practical scenarios where the prior distribution is unknown. Empirical evaluations show that TaMatch significantly outperforms existing state-of-the-art methods across a range of challenging image classification tasks, highlighting the critical importance of both the debiased generation and utilization of pseudo labels in SSL., Comment: 11 pages, 4 figures
Published: 2024

21. DataGpt-SQL-7B: An Open-Source Language Model for Text-to-SQL

Author: Wu, Lixia, Li, Peng, Lou, Junhong, and Fu, Lei
Subjects: Computer Science - Artificial Intelligence
Abstract: In addressing the pivotal role of translating natural language queries into SQL commands, we propose a suite of compact, fine-tuned models and self-refine mechanisms to democratize data access and analysis for non-expert users, mitigating risks associated with closed-source Large Language Models. Specifically, we constructed a dataset of over 20K sample for Text-to-SQL as well as the preference dateset, to improve the efficiency in the domain of SQL generation. To further ensure code validity, a code corrector was integrated into the model. Our system, DataGpt-sql, achieved 87.2\% accuracy on the spider-dev, respectively, showcasing the effectiveness of our solution in text-to-SQL conversion tasks. Our code, data, and models are available at \url{https://github.com/CainiaoTechAi/datagpt-sql-7b}
Published: 2024

22. Unveiling High Selectivity Origin of Pt-Bi Catalysts for Alkaline Methanol Electrooxidation via CO-free pathway

Author: Liang, Lecheng, Li, Hengyu, Li, Peng, Liang, Jinhui, Ye, Shao, Zeng, Binwen, Xie, Yanhong, Wang, Yucheng, Ozaki, Taisuke, Chen, Shengli, and Cui, Zhiming
Subjects: Physics - Chemical Physics
Abstract: A long-standing puzzle for methanol electrooxidation is how to achieve a CO-free pathway and accurately understand the origin of electrocatalytic selectivity. Herein, we unequivocally demonstrate that the Bi-modified Pt/C follows a CO-free dominated pathway during alkaline methanol electrooxidation, and unveil the formaldehyde (HCHO) intermediate as a critical factor influencing pathway selectivity. These findings are substantiated by kinetic isotope effects, formate Faradaic efficiency, in situ spectroscopy, ab initio molecular dynamic simulations, and density functional theory calculations. Bi modification significantly increases the HCHO dehydrogenation barrier, which facilitates its desorption and subsequent conversion to the H2COOH- anion at the alkaline interface, intrinsically avoiding CO formation. More specifically, the formation of ensemble sites featuring V-shaped Bi-Pt-Bi configuration inhibits the cleavage of C-H bond, and the weak OH binding energy at Bi adatoms effectively prevents blockage of oxygenated species, allowing such ensemble sites to fulfill their functional role. Our study opens up a novel dimension for designing advanced CO-free catalysts., Comment: 16 pages, 6 figures for main text. 54 pages, 47 figures for supplementary material
Published: 2024

23. DS2TA: Denoising Spiking Transformer with Attenuated Spatiotemporal Attention

Author: Xu, Boxun, Geng, Hejia, Yin, Yuxuan, and Li, Peng
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Vision Transformers (ViT) are current high-performance models of choice for various vision applications. Recent developments have given rise to biologically inspired spiking transformers that thrive in ultra-low power operations on neuromorphic hardware, however, without fully unlocking the potential of spiking neural networks. We introduce DS2TA, a Denoising Spiking transformer with attenuated SpatioTemporal Attention, designed specifically for vision applications. DS2TA introduces a new spiking attenuated spatiotemporal attention mechanism that considers input firing correlations occurring in both time and space, thereby fully harnessing the computational power of spiking neurons at the core of the transformer architecture. Importantly, DS2TA facilitates parameter-efficient spatiotemporal attention computation without introducing extra weights. DS2TA employs efficient hashmap-based nonlinear spiking attention denoisers to enhance the robustness and expressive power of spiking attention maps. DS2TA demonstrates state-of-the-art performances on several widely adopted static image and dynamic neuromorphic datasets. Operated over 4 time steps, DS2TA achieves 94.92% top-1 accuracy on CIFAR10 and 77.47% top-1 accuracy on CIFAR100, as well as 79.1% and 94.44% on CIFAR10-DVS and DVS-Gesture using 10 time steps., Comment: arXiv admin note: text overlap with arXiv:2311.09376
Published: 2024

24. FedLF: Adaptive Logit Adjustment and Feature Optimization in Federated Long-Tailed Learning

Author: Lu, Xiuhua, Li, Peng, and Jiang, Xuefeng
Subjects: Computer Science - Machine Learning
Abstract: Federated learning offers a paradigm to the challenge of preserving privacy in distributed machine learning. However, datasets distributed across each client in the real world are inevitably heterogeneous, and if the datasets can be globally aggregated, they tend to be long-tailed distributed, which greatly affects the performance of the model. The traditional approach to federated learning primarily addresses the heterogeneity of data among clients, yet it fails to address the phenomenon of class-wise bias in global long-tailed data. This results in the trained model focusing on the head classes while neglecting the equally important tail classes. Consequently, it is essential to develop a methodology that considers classes holistically. To address the above problems, we propose a new method FedLF, which introduces three modifications in the local training phase: adaptive logit adjustment, continuous class centred optimization, and feature decorrelation. We compare seven state-of-the-art methods with varying degrees of data heterogeneity and long-tailed distribution. Extensive experiments on benchmark datasets CIFAR-10-LT and CIFAR-100-LT demonstrate that our approach effectively mitigates the problem of model performance degradation due to data heterogeneity and long-tailed distribution. our code is available at https://github.com/18sym/FedLF., Comment: Accepted by ACML 2024
Published: 2024

25. PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Author: Li, Peng, Zheng, Wangguandong, Liu, Yuan, Yu, Tao, Li, Yangguang, Qi, Xingqun, Li, Mengfei, Chi, Xiaowei, Xia, Siyu, Xue, Wei, Luo, Wenhan, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Detailed and photorealistic 3D human modeling is essential for various applications and has seen tremendous progress. However, full-body reconstruction from a monocular RGB image remains challenging due to the ill-posed nature of the problem and sophisticated clothing topology with self-occlusions. In this paper, we propose PSHuman, a novel framework that explicitly reconstructs human meshes utilizing priors from the multiview diffusion model. It is found that directly applying multiview diffusion on single-view human images leads to severe geometric distortions, especially on generated faces. To address it, we propose a cross-scale diffusion that models the joint probability distribution of global full-body shape and local facial characteristics, enabling detailed and identity-preserved novel-view generation without any geometric distortion. Moreover, to enhance cross-view body shape consistency of varied human poses, we condition the generative model on parametric models like SMPL-X, which provide body priors and prevent unnatural views inconsistent with human anatomy. Leveraging the generated multi-view normal and color images, we present SMPLX-initialized explicit human carving to recover realistic textured human meshes efficiently. Extensive experimental results and quantitative evaluations on CAPE and THuman2.1 datasets demonstrate PSHumans superiority in geometry details, texture fidelity, and generalization capability.
Published: 2024

26. Possibility of the experimental study on semi-leptonic and non-leptonic $D^*_{(s)}$ weak decays

Author: Yang, Hao, Zhang, Zhi-Qing, Li, Peng, and Yang, You-Ya
Subjects: High Energy Physics - Phenomenology
Abstract: Just like other heavy flavor mesons, the weak decays of $D^*_{(s)}$ mesons can also provide a platform to check the Standard Model (SM), explore new physics (NP) and understand the mechanisms of weak interactions. At present, the theoretical and experimental researches on $D^*_{(s)}$ mesons are relatively limited. In addition to the dominant electromagnetic decays, the $D^*_{(s)}$ weak decays should also be feasible to explore the $D^*_{(s)}$ mesons. In this paper, we use the covariant light-front quark model (CLFQM) to study the branching ratios of the semi-leptonic decays $D^*_{(s)}\to P\ell^{+}\nu_{\ell}$ and the non-leptonic decays $D^*_{(s)}\to PP, PV$ with $P=\pi, K, \eta^{(\prime)}, V=\rho, K^*, \phi$ and $\ell=e, \mu$, which are within the range $10^{-13}\sim 10^{-6}$. Among these decays, the channels $D_{s}^{*+}\to\eta \ell^{+}\nu_{\ell}$ and $D^{*+}_{s}\to \eta\rho^{+}$ possess the largest branching ratios, which can reach up to $10^{-6}$ order. These decays are most likely to be accessible at the future high-luminosity experiments. One can find that the branching ratios $\mathcal{B}r(D_{s}^{*+}\to\eta \ell^{+}\nu_{\ell})=1.46\times10^{-6}$ and $\mathcal{B}r(D_{s}^{*+}\to\eta \rho^{+})=1.04\times10^{-6}$ correspond to tens of thousands of events in the $e^+e^-$ collider experiments, such as the STCF, CEPC and FCC-ee, and tens of millions of events at the HL-LHC. In a word, it is feasible to study the $D^*_{(s)}$ meson weak decays in the future experiments. Furthermore, we also predict and discuss another two physical observations, that is, the longitudinal polarization fraction $f_{L}$ and the forward-backward asymmetry $A_{FB}$, for our considered decays., Comment: 23 pages, 4 figures. arXiv admin note: text overlap with arXiv:2311.04431
Published: 2024

27. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Author: Liu, Xinyu, He, Yingqing, Guo, Lanqing, Li, Xiang, Jin, Bu, Li, Peng, Li, Yan, Chan, Chi-Min, Chen, Qifeng, Xue, Wei, Luo, Wenhan, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The potential for higher-resolution image generation using pretrained diffusion models is immense, yet these models often struggle with issues of object repetition and structural artifacts especially when scaling to 4K resolution and higher. We figure out that the problem is caused by that, a single prompt for the generation of multiple scales provides insufficient efficacy. In response, we propose HiPrompt, a new tuning-free solution that tackles the above problems by introducing hierarchical prompts. The hierarchical prompts offer both global and local guidance. Specifically, the global guidance comes from the user input that describes the overall content, while the local guidance utilizes patch-wise descriptions from MLLMs to elaborately guide the regional structure and texture generation. Furthermore, during the inverse denoising process, the generated noise is decomposed into low- and high-frequency spatial components. These components are conditioned on multiple prompt levels, including detailed patch-wise descriptions and broader image-level prompts, facilitating prompt-guided denoising under hierarchical semantic guidance. It further allows the generation to focus more on local spatial regions and ensures the generated images maintain coherent local and global semantics, structures, and textures with high definition. Extensive experiments demonstrate that HiPrompt outperforms state-of-the-art works in higher-resolution image generation, significantly reducing object repetition and enhancing structural quality., Comment: https://liuxinyv.github.io/HiPrompt/
Published: 2024

28. Flat Band Generation through Interlayer Geometric Frustration in Intercalated Transition Metal Dichalcogenides

Author: Peng, Yawen, He, Ren, Li, Peng, Zhdanovich, Sergey, Michiardi, Matteo, Gorovikov, Sergey, Zonno, Marta, Damascelli, Andrea, and Miao, Guo-Xing
Subjects: Condensed Matter - Materials Science
Abstract: Electronic flat bands can lead to rich many-body quantum phases by quenching the electron's kinetic energy and enhancing many-body correlation. The reduced bandwidth can be realized by either destructive quantum interference in frustrated lattices, or by generating heavy band folding with avoided band crossing in Moire superlattices. Here we propose a general approach to introduce flat bands into widely studied transition metal dichalcogenide (TMD) materials by dilute intercalation, featuring both destructive interference and band folding. A flat band with vanishing dispersion is observed by angle-resolved photoemission spectroscopy (ARPES) over the entire momentum space in intercalated Mn1/4TaS2. Polarization dependent ARPES measurements combined with symmetry analysis reveal the orbital characters of the flat bands. Supercell tight-binding simulations suggest that such flat bands arise from destructive interference between Mn and Ta wave functions on the S hopping pathways and are ubiquitous in a range of TMD families as well as in different intercalation configurations. Our findings establish a new material platform to manipulate flat band structures and explore their corresponding emergent correlated properties.
Published: 2024

29. NP-TCMtarget: a network pharmacology platform for exploring mechanisms of action of Traditional Chinese medicine

Author: Wang, Aoyi, Wang, Yingdong, Peng, Haoyang, Zhang, Haoran, Cheng, Caiping, Zhao, Jinzhong, Zhang, Wuxia, Chen, Jianxin, and Li, Peng
Subjects: Quantitative Biology - Molecular Networks
Abstract: The biological targets of traditional Chinese medicine (TCM) are the core effectors mediating the interaction between TCM and the human body. Identification of TCM targets is essential to elucidate the chemical basis and mechanisms of TCM for treating diseases. Given the chemical complexity of TCM, both in silico high-throughput drug-target interaction predicting models and biological profile-based methods have been commonly applied for identifying TCM targets based on the structural information of TCM chemical components and biological information, respectively. However, the existing methods lack the integration of TCM chemical and biological information, resulting in difficulty in the systematic discovery of TCM action pathways. To solve this problem, we propose a novel target identification model NP-TCMtarget to explore the TCM target path by combining the overall chemical and biological profiles. First, NP-TCMtarget infers TCM effect targets by calculating associations between drug/disease inducible gene expression profiles and specific gene signatures for 8,233 targets. Then, NP-TCMtarget utilizes a constructed binary classification model to predict binding targets of herbal ingredients. Finally, we can distinguish TCM direct and indirect targets by comparing the effect targets and binding targets to establish the action pathways of herbal components-direct targets-indirect targets by mapping TCM targets in the biological molecular network. We apply NP-TCMtarget to the formula XiaoKeAn to demonstrate the power of revealing the action pathways of herbal formula. We expect that this novel model could provide a systematic framework for exploring the molecular mechanisms of TCM at the target level. NP-TCMtarget is available at http://www.bcxnfz.top/NP-TCMtarget., Comment: 29 pages, 4 figures
Published: 2024

30. Visual-Friendly Concept Protection via Selective Adversarial Perturbations

Author: Mi, Xiaoyue, Tang, Fan, Cao, Juan, Li, Peng, and Liu, Yang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Personalized concept generation by tuning diffusion models with a few images raises potential legal and ethical concerns regarding privacy and intellectual property rights. Researchers attempt to prevent malicious personalization using adversarial perturbations. However, previous efforts have mainly focused on the effectiveness of protection while neglecting the visibility of perturbations. They utilize global adversarial perturbations, which introduce noticeable alterations to original images and significantly degrade visual quality. In this work, we propose the Visual-Friendly Concept Protection (VCPro) framework, which prioritizes the protection of key concepts chosen by the image owner through adversarial perturbations with lower perceptibility. To ensure these perturbations are as inconspicuous as possible, we introduce a relaxed optimization objective to identify the least perceptible yet effective adversarial perturbations, solved using the Lagrangian multiplier method. Qualitative and quantitative experiments validate that VCPro achieves a better trade-off between the visibility of perturbations and protection effectiveness, effectively prioritizing the protection of target concepts in images with less perceptible perturbations., Comment: Under Review
Published: 2024

31. Quasinormal modes of accelerating spacetime

Author: Zhou, Tao and Li, Peng-Cheng
Subjects: General Relativity and Quantum Cosmology
Abstract: We calculate the exact values of the quasinormal frequencies for massless perturbations with spin $s\leq2$ moving in pure accelerating spacetime. We use two different methods to transfer the perturbation equations into the form of hypergeometric differential equations and obtain the same quasinormal frequencies. These purely imaginary spectra are shown to be independent of the spin of the perturbation and match those of the so-called acceleration modes of accelerating black holes after taking the Minkowski limit. This implies that the acceleration modes actually originate from the pure accelerating spacetime and the appearance of black holes would deform the spectra. In addition, we calculate the quasinormal frequencies of scalar, electromagnetic and gravitational perturbations of $D$-dimensional de Sitter spacetime and compare them with previous results to verify the validity of our method., Comment: 12 pages. arXiv admin note: text overlap with arXiv:gr-qc/0605027 by other authors
Published: 2024

32. Data-Efficient Prediction of Minimum Operating Voltage via Inter- and Intra-Wafer Variation Alignment

Author: Yin, Yuxuan, Chen, Rebecca, He, Chen, and Li, Peng
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Predicting the minimum operating voltage ($V_{min}$) of chips stands as a crucial technique in enhancing the speed and reliability of manufacturing testing flow. However, existing $V_{min}$ prediction methods often overlook various sources of variations in both training and deployment phases. Notably, the neglect of wafer zone-to-zone (intra-wafer) variations and wafer-to-wafer (inter-wafer) variations, compounded by process variations, diminishes the accuracy, data efficiency, and reliability of $V_{min}$ predictors. To address this gap, we introduce a novel data-efficient $V_{min}$ prediction flow, termed restricted bias alignment (RBA), which incorporates a novel variation alignment technique. Our approach concurrently estimates inter- and intra-wafer variations. Furthermore, we propose utilizing class probe data to model inter-wafer variations for the first time. We empirically demonstrate RBA's effectiveness and data efficiency on an industrial 16nm automotive chip dataset.
Published: 2024

33. Evidence of electron interaction with an unidentified bosonic mode in superconductor CsCa$_2$Fe$_4$As$_4$F$_2$

Author: Li, Peng, Liao, Sen, Wang, Zhicheng, Li, Huaxun, Su, Shiwu, Zhang, Jiakang, Chen, Ziyuan, Jiang, Zhicheng, Liu, Zhengtai, Yang, Lexian, Huai, Linwei, He, Junfeng, Cui, Shengtao, Sun, Zhe, Yan, Yajun, Cao, Guanghan, Shen, Dawei, Jiang, Juan, and Feng, Donglai
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Strongly Correlated Electrons
Abstract: The kink structure in band dispersion usually refers to a certain electron-boson interaction, which is crucial in understanding the pairing in unconventional superconductors. Here we report the evidence of the observation of a kink structure in Fe-based superconductor CsCa$_2$Fe$_4$As$_4$F$_2$ using angle-resolved photoemission spectroscopy. The kink shows an orbital selective and momentum dependent behavior, which is located at 15 meV below Fermi level along the Gamma-M direction at the band with dxz orbital character and vanishes when approaching the Gamma-X direction, correlated with a slight decrease of the superconducting gap. Most importantly, this kink structure disappears when the superconducting gap closes, indicating that the corresponding bosonic mode (9 meV) is closely related to superconductivity. However, the origin of this mode remains unidentified, since it cannot be related to phonons or the spin resonance mode (15 meV) observed by inelastic neutron scattering. The behavior of this mode is rather unique and challenges our present understanding of the superconducting paring mechanism of the bilayer FeAs-based superconductors., Comment: 14 pages, 4 figures
Published: 2024
Full Text: View/download PDF

34. On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition

Author: Wang, Zihu, Liu, Lingqiao, Weston, Scott Ricardo Figueroa, Tian, Samuel, and Li, Peng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Self-Supervised Learning (SSL) has become a prominent approach for acquiring visual representations across various tasks, yet its application in fine-grained visual recognition (FGVR) is challenged by the intricate task of distinguishing subtle differences between categories. To overcome this, we introduce an novel strategy that boosts SSL's ability to extract critical discriminative features vital for FGVR. This approach creates synthesized data pairs to guide the model to focus on discriminative features critical for FGVR during SSL. We start by identifying non-discriminative features using two main criteria: features with low variance that fail to effectively separate data and those deemed less important by Grad-CAM induced from the SSL loss. We then introduce perturbations to these non-discriminative features while preserving discriminative ones. A decoder is employed to reconstruct images from both perturbed and original feature vectors to create data pairs. An encoder is trained on such generated data pairs to become invariant to variations in non-discriminative dimensions while focusing on discriminative features, thereby improving the model's performance in FGVR tasks. We demonstrate the promising FGVR performance of the proposed approach through extensive evaluation on a wide variety of datasets., Comment: Accepted by ECCV 2024
Published: 2024

35. Incremental high average-utility itemset mining: survey and challenges

Author: Chen, Jing, Yang, Shengyi, Ding, Weiping, Li, Peng, Liu, Aijun, Zhang, Hongjun, and Li, Tian
Subjects: Computer Science - Databases
Abstract: The High Average Utility Itemset Mining (HAUIM) technique, a variation of High Utility Itemset Mining (HUIM), uses the average utility of the itemsets. Historically, most HAUIM algorithms were designed for static databases. However, practical applications like market basket analysis and business decision-making necessitate regular updates of the database with new transactions. As a result, researchers have developed incremental HAUIM (iHAUIM) algorithms to identify HAUIs in a dynamically updated database. Contrary to conventional methods that begin from scratch, the iHAUIM algorithm facilitates incremental changes and outputs, thereby reducing the cost of discovery. This paper provides a comprehensive review of the state-of-the-art iHAUIM algorithms, analyzing their unique characteristics and advantages. First, we explain the concept of iHAUIM, providing formulas and real-world examples for a more in-depth understanding. Subsequently, we categorize and discuss the key technologies used by varying types of iHAUIM algorithms, encompassing Apriori-based, Tree-based, and Utility-list-based techniques. Moreover, we conduct a critical analysis of each mining method's advantages and disadvantages. In conclusion, we explore potential future directions, research opportunities, and various extensions of the iHAUIM algorithm., Comment: 25 pages, 23 figures
Published: 2024
Full Text: View/download PDF

36. ADO-LLM: Analog Design Bayesian Optimization with In-Context Learning of Large Language Models

Author: Yin, Yuxuan, Wang, Yu, Xu, Boxun, and Li, Peng
Subjects: Computer Science - Machine Learning
Abstract: Analog circuit design requires substantial human expertise and involvement, which is a significant roadblock to design productivity. Bayesian Optimization (BO), a popular machine learning based optimization strategy, has been leveraged to automate analog design given its applicability across various circuit topologies and technologies. Traditional BO methods employ black box Gaussian Process surrogate models and optimized labeled data queries to find optimization solutions by trading off between exploration and exploitation. However, the search for the optimal design solution in BO can be expensive from both a computational and data usage point of view, particularly for high dimensional optimization problems. This paper presents ADO-LLM, the first work integrating large language models (LLMs) with Bayesian Optimization for analog design optimization. ADO-LLM leverages the LLM's ability to infuse domain knowledge to rapidly generate viable design points to remedy BO's inefficiency in finding high value design areas specifically under the limited design space coverage of the BO's probabilistic surrogate model. In the meantime, sampling of design points evaluated in the iterative BO process provides quality demonstrations for the LLM to generate high quality design points while leveraging infused broad design knowledge. Furthermore, the diversity brought by BO's exploration enriches the contextual understanding of the LLM and allows it to more broadly search in the design space and prevent repetitive and redundant suggestions. We evaluate the proposed framework on two different types of analog circuits and demonstrate notable improvements in design efficiency and effectiveness., Comment: 8 pages, 3 figures
Published: 2024

37. Frequency ratio of the $^{229\mathrm{m}}$Th nuclear isomeric transition and the $^{87}$Sr atomic clock

Author: Zhang, Chuankun, Ooi, Tian, Higgins, Jacob S., Doyle, Jack F., von der Wense, Lars, Beeks, Kjeld, Leitner, Adrian, Kazakov, Georgy, Li, Peng, Thirolf, Peter G., Schumm, Thorsten, and Ye, Jun
Subjects: Physics - Atomic Physics, Nuclear Experiment, Physics - Optics, Quantum Physics
Abstract: Optical atomic clocks$^{1,2}$ use electronic energy levels to precisely keep track of time. A clock based on nuclear energy levels promises a next-generation platform for precision metrology and fundamental physics studies. Thorium-229 nuclei exhibit a uniquely low energy nuclear transition within reach of state-of-the-art vacuum ultraviolet (VUV) laser light sources and have therefore been proposed for construction of the first nuclear clock$^{3,4}$. However, quantum state-resolved spectroscopy of the $^{229m}$Th isomer to determine the underlying nuclear structure and establish a direct frequency connection with existing atomic clocks has yet to be performed. Here, we use a VUV frequency comb to directly excite the narrow $^{229}$Th nuclear clock transition in a solid-state CaF$_2$ host material and determine the absolute transition frequency. We stabilize the fundamental frequency comb to the JILA $^{87}$Sr clock$^2$ and coherently upconvert the fundamental to its 7th harmonic in the VUV range using a femtosecond enhancement cavity. This VUV comb establishes a frequency link between nuclear and electronic energy levels and allows us to directly measure the frequency ratio of the $^{229}$Th nuclear clock transition and the $^{87}$Sr atomic clock. We also precisely measure the nuclear quadrupole splittings and extract intrinsic properties of the isomer. These results mark the start of nuclear-based solid-state optical clock and demonstrate the first comparison of nuclear and atomic clocks for fundamental physics studies. This work represents a confluence of precision metrology, ultrafast strong field physics, nuclear physics, and fundamental physics., Comment: 22 pages, 5 figures, 1 extended data figure
Published: 2024
Full Text: View/download PDF

38. Federating to Grow Transformers with Constrained Resources without Model Sharing

Author: Shen, Shikun, Zou, Yifei, Yuan, Yuan, Zheng, Yanwei, Li, Peng, Cheng, Xiuzhen, and Yu, Dongxiao
Subjects: Computer Science - Artificial Intelligence
Abstract: The high resource consumption of large-scale models discourages resource-constrained users from developing their customized transformers. To this end, this paper considers a federated framework named Fed-Grow for multiple participants to cooperatively scale a transformer from their pre-trained small models. Under the Fed-Grow, a Dual-LiGO (Dual Linear Growth Operator) architecture is designed to help participants expand their pre-trained small models to a transformer. In Dual-LiGO, the Local-LiGO part is used to address the heterogeneity problem caused by the various pre-trained models, and the Global-LiGO part is shared to exchange the implicit knowledge from the pre-trained models, local data, and training process of participants. Instead of model sharing, only sharing the Global-LiGO strengthens the privacy of our approach. Compared with several state-of-the-art methods in simulation, our approach has higher accuracy, better precision, and lower resource consumption on computations and communications. To the best of our knowledge, most of the previous model-scaling works are centralized, and our work is the first one that cooperatively grows a transformer from multiple pre-trained heterogeneous models with the user privacy protected in terms of local data and models. We hope that our approach can extend the transformers to the broadly distributed scenarios and encourage more resource-constrained users to enjoy the bonus taken by the large-scale transformers.
Published: 2024

39. A Resource-Adaptive Approach for Federated Learning under Resource-Constrained Environments

Author: Zhang, Ruirui, Wu, Xingze, Zou, Yifei, Xie, Zhenzhen, Li, Peng, Cheng, Xiuzhen, and Yu, Dongxiao
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The paper studies a fundamental federated learning (FL) problem involving multiple clients with heterogeneous constrained resources. Compared with the numerous training parameters, the computing and communication resources of clients are insufficient for fast local training and real-time knowledge sharing. Besides, training on clients with heterogeneous resources may result in the straggler problem. To address these issues, we propose Fed-RAA: a Resource-Adaptive Asynchronous Federated learning algorithm. Different from vanilla FL methods, where all parameters are trained by each participating client regardless of resource diversity, Fed-RAA adaptively allocates fragments of the global model to clients based on their computing and communication capabilities. Each client then individually trains its assigned model fragment and asynchronously uploads the updated result. Theoretical analysis confirms the convergence of our approach. Additionally, we design an online greedy-based algorithm for fragment allocation in Fed-RAA, achieving fairness comparable to an offline strategy. We present numerical results on MNIST, CIFAR-10, and CIFAR-100, along with necessary comparisons and ablation studies, demonstrating the advantages of our work. To the best of our knowledge, this paper represents the first resource-adaptive asynchronous method for fragment-based FL with guaranteed theoretical convergence.
Published: 2024

40. FuseGen: PLM Fusion for Data-generation based Zero-shot Learning

Author: Zou, Tianyuan, Liu, Yang, Li, Peng, Zhang, Jianqing, Liu, Jingjing, and Zhang, Ya-Qin
Subjects: Computer Science - Computation and Language
Abstract: Data generation-based zero-shot learning, although effective in training Small Task-specific Models (STMs) via synthetic datasets generated by Pre-trained Language Models (PLMs), is often limited by the low quality of such synthetic datasets. Previous solutions have primarily focused on single PLM settings, where synthetic datasets are typically restricted to specific sub-spaces and often deviate from real-world distributions, leading to severe distribution bias. To mitigate such bias, we propose FuseGen, a novel data generation-based zero-shot learning framework that introduces a new criteria for subset selection from synthetic datasets via utilizing multiple PLMs and trained STMs. The chosen subset provides in-context feedback to each PLM, enhancing dataset quality through iterative data generation. Trained STMs are then used for sample re-weighting as well, further improving data quality. Extensive experiments across diverse tasks demonstrate that FuseGen substantially outperforms existing methods, highly effective in boosting STM performance in a PLM-agnostic way. Code is provided in https://github.com/LindaLydia/FuseGen., Comment: 17 pages, 8 figures, 12 tabels
Published: 2024

41. M-LRM: Multi-view Large Reconstruction Model

Author: Li, Mengfei, Long, Xiaoxiao, Liang, Yixun, Li, Weiyu, Liu, Yuan, Li, Peng, Chi, Xiaowei, Qi, Xingqun, Xue, Wei, Luo, Wenhan, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite recent advancements in the Large Reconstruction Model (LRM) demonstrating impressive results, when extending its input from single image to multiple images, it exhibits inefficiencies, subpar geometric and texture quality, as well as slower convergence speed than expected. It is attributed to that, LRM formulates 3D reconstruction as a naive images-to-3D translation problem, ignoring the strong 3D coherence among the input images. In this paper, we propose a Multi-view Large Reconstruction Model (M-LRM) designed to efficiently reconstruct high-quality 3D shapes from multi-views in a 3D-aware manner. Specifically, we introduce a multi-view consistent cross-attention scheme to enable M-LRM to accurately query information from the input images. Moreover, we employ the 3D priors of the input multi-view images to initialize the tri-plane tokens. Compared to LRM, the proposed M-LRM can produce a tri-plane NeRF with $128 \times 128$ resolution and generate 3D shapes of high fidelity. Experimental studies demonstrate that our model achieves a significant performance gain and faster training convergence than LRM. Project page: https://murphylmf.github.io/M-LRM/
Published: 2024

42. Differentiation of Multi-objective Data-driven Decision Pipeline

Author: Li, Peng, Wu, Lixia, Feng, Chaoqun, Hu, Haoyuan, Fu, Lei, and Ye, Jieping
Subjects: Computer Science - Machine Learning
Abstract: Real-world scenarios frequently involve multi-objective data-driven optimization problems, characterized by unknown problem coefficients and multiple conflicting objectives. Traditional two-stage methods independently apply a machine learning model to estimate problem coefficients, followed by invoking a solver to tackle the predicted optimization problem. The independent use of optimization solvers and prediction models may lead to suboptimal performance due to mismatches between their objectives. Recent efforts have focused on end-to-end training of predictive models that use decision loss derived from the downstream optimization problem. However, these methods have primarily focused on single-objective optimization problems, thus limiting their applicability. We aim to propose a multi-objective decision-focused approach to address this gap. In order to better align with the inherent properties of multi-objective optimization problems, we propose a set of novel loss functions. These loss functions are designed to capture the discrepancies between predicted and true decision problems, considering solution space, objective space, and decision quality, named landscape loss, Pareto set loss, and decision loss, respectively. Our experimental results demonstrate that our proposed method significantly outperforms traditional two-stage methods and most current decision-focused methods.
Published: 2024

43. CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild

Author: Qi, Xingqun, Zhang, Hengyuan, Wang, Yatian, Pan, Jiahao, Liu, Chen, Li, Peng, Chi, Xiaowei, Li, Mengfei, Zhang, Qixun, Xue, Wei, Zhang, Shanghang, Luo, Wenhan, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deriving co-speech 3D gestures has seen tremendous progress in virtual avatar animation. Yet, the existing methods often produce stiff and unreasonable gestures with unseen human speech inputs due to the limited 3D speech-gesture data. In this paper, we propose CoCoGesture, a novel framework enabling vivid and diverse gesture synthesis from unseen human speech prompts. Our key insight is built upon the custom-designed pretrain-fintune training paradigm. At the pretraining stage, we aim to formulate a large generalizable gesture diffusion model by learning the abundant postures manifold. Therefore, to alleviate the scarcity of 3D data, we first construct a large-scale co-speech 3D gesture dataset containing more than 40M meshed posture instances across 4.3K speakers, dubbed GES-X. Then, we scale up the large unconditional diffusion model to 1B parameters and pre-train it to be our gesture experts. At the finetune stage, we present the audio ControlNet that incorporates the human voice as condition prompts to guide the gesture generation. Here, we construct the audio ControlNet through a trainable copy of our pre-trained diffusion model. Moreover, we design a novel Mixture-of-Gesture-Experts (MoGE) block to adaptively fuse the audio embedding from the human speech and the gesture features from the pre-trained gesture experts with a routing mechanism. Such an effective manner ensures audio embedding is temporal coordinated with motion features while preserving the vivid and diverse gesture generation. Extensive experiments demonstrate that our proposed CoCoGesture outperforms the state-of-the-art methods on the zero-shot speech-to-gesture generation. The dataset will be publicly available at: https://mattie-e.github.io/GES-X/, Comment: The dataset will be released as soon as possible
Published: 2024

44. Restricting Voltage Deviation of DC Microgrids with Critical and Ordinary Nodes

Author: Bai, Handong, Li, Peng, and Zhang, Hongwei
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Restricting bus voltage deviation is crucial for normal operation of multi-bus DC microgrids, yet it has received insufficient attention due to the conflict between two main control objectives in DC microgrids, i.e., voltage regulation and current sharing. By revealing a necessary and sufficient condition for achieving these two objectives, this paper proposes a compromised distributed control algorithm, which regulates the voltage deviation of all buses by relaxing the accuracy of current sharing. Moreover, for a class of DC Microgrids consisting of both critical nodes and ordinary nodes, this paper proposes a distributed control algorithm that restricts the voltage deviation of critical nodes and simultaneously keeps the current sharing of ordinary nodes. This algorithm also works under plug-and-play settings. Simulations illustrate our theory.
Published: 2024

45. Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention

Author: Li, Peng, Liu, Yuan, Long, Xiaoxiao, Zhang, Feihu, Lin, Cheng, Li, Mengfei, Qi, Xingqun, Zhang, Shanghang, Luo, Wenhan, Tan, Ping, Wang, Wenping, Liu, Qifeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we introduce Era3D, a novel multiview diffusion method that generates high-resolution multiview images from a single-view image. Despite significant advancements in multiview generation, existing methods still suffer from camera prior mismatch, inefficacy, and low resolution, resulting in poor-quality multiview images. Specifically, these methods assume that the input images should comply with a predefined camera type, e.g. a perspective camera with a fixed focal length, leading to distorted shapes when the assumption fails. Moreover, the full-image or dense multiview attention they employ leads to an exponential explosion of computational complexity as image resolution increases, resulting in prohibitively expensive training costs. To bridge the gap between assumption and reality, Era3D first proposes a diffusion-based camera prediction module to estimate the focal length and elevation of the input image, which allows our method to generate images without shape distortions. Furthermore, a simple but efficient attention layer, named row-wise attention, is used to enforce epipolar priors in the multiview diffusion, facilitating efficient cross-view information fusion. Consequently, compared with state-of-the-art methods, Era3D generates high-quality multiview images with up to a 512*512 resolution while reducing computation complexity by 12x times. Comprehensive experiments demonstrate that Era3D can reconstruct high-quality and detailed 3D meshes from diverse single-view input images, significantly outperforming baseline multiview diffusion methods. Project page: https://penghtyx.github.io/Era3D/.
Published: 2024

46. Reliable Interval Prediction of Minimum Operating Voltage Based on On-chip Monitors via Conformalized Quantile Regression

Author: Yin, Yuxuan, Wang, Xiaoxiao, Chen, Rebecca, He, Chen, and Li, Peng
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Artificial Intelligence, Computer Science - Hardware Architecture
Abstract: Predicting the minimum operating voltage ($V_{min}$) of chips is one of the important techniques for improving the manufacturing testing flow, as well as ensuring the long-term reliability and safety of in-field systems. Current $V_{min}$ prediction methods often provide only point estimates, necessitating additional techniques for constructing prediction confidence intervals to cover uncertainties caused by different sources of variations. While some existing techniques offer region predictions, but they rely on certain distributional assumptions and/or provide no coverage guarantees. In response to these limitations, we propose a novel distribution-free $V_{min}$ interval estimation methodology possessing a theoretical guarantee of coverage. Our approach leverages conformalized quantile regression and on-chip monitors to generate reliable prediction intervals. We demonstrate the effectiveness of the proposed method on an industrial 5nm automotive chip dataset. Moreover, we show that the use of on-chip monitors can reduce the interval length significantly for $V_{min}$ prediction., Comment: Accepted by DATE 2024. Camera-ready version
Published: 2024

47. Skyrmion-mechanical hybrid quantum systems: Manipulation of skyrmion qubits via phonons

Author: Pan, Xue-Feng, Hei, Xin-Lei, Yao, Xiao-Yu, Chen, Jia-Qiang, Ren, Yu-Meng, Dong, Xing-Liang, Qiao, Yi-Fan, and Li, Peng-Bo
Subjects: Quantum Physics, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Skyrmion qubits are a new highly promising logic element for quantum information processing. However, their scalability to multiple interacting qubits remains challenging. We propose a hybrid quantum setup with skyrmion qubits strongly coupled to nanomechanical cantilevers via magnetic coupling, which harnesses phonons as quantum interfaces for the manipulation of distant skyrmion qubits. A linear drive is utilized to achieve the modulation of the stiffness coefficient of the cantilever, resulting in an exponential enhancement of the coupling strength between the skyrmion qubit and the mechanical mode. We also consider the case of a topological resonator array, which allows us to study interactions between skyrmion qubits and topological phonon band structure, as well as chiral skyrmion-skyrmion interactions. The scheme suggested here offers a fascinating platform for investigating quantum information processing and quantum simulation with magnetic microstructures., Comment: To appear in PR Research, 16 pages, 9 figures
Published: 2024

48. Magnon-Skyrmion Hybrid Quantum Systems: Tailoring Interactions via Magnons

Author: Pan, Xue-Feng, Li, Peng-Bo, Hei, Xin-Lei, Zhang, Xichao, Mochizuki, Masahito, Li, Fu-Li, and Nori, Franco
Subjects: Quantum Physics, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Coherent and dissipative interactions between different quantum systems are essential for the construction of hybrid quantum systems and the investigation of novel quantum phenomena. Here, we propose and analyze a magnon-skyrmion hybrid quantum system, consisting of a micromagnet and nearby magnetic skyrmions. We predict a strong coupling mechanism between the magnonic mode of the micromagnet and the quantized helicity degree of freedom of the skyrmion. We show that with this hybrid setup it is possible to induce magnon-mediated nonreciprocal interactions and responses between distant skyrmion qubits or between skyrmion qubits and other quantum systems like superconducting qubits. This work provides a quantum platform for the investigation of diverse quantum effects and quantum information processing with magnetic microstructures., Comment: To appear in PRL, 9 pages, 4 figures
Published: 2024
Full Text: View/download PDF

49. Tangram: High-resolution Video Analytics on Serverless Platform with SLO-aware Batching

Author: Peng, Haosong, Zhan, Yufeng, Li, Peng, and Xia, Yuanqing
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Cloud-edge collaborative computing paradigm is a promising solution to high-resolution video analytics systems. The key lies in reducing redundant data and managing fluctuating inference workloads effectively. Previous work has focused on extracting regions of interest (RoIs) from videos and transmitting them to the cloud for processing. However, a naive Infrastructure as a Service (IaaS) resource configuration falls short in handling highly fluctuating workloads, leading to violations of Service Level Objectives (SLOs) and inefficient resource utilization. Besides, these methods neglect the potential benefits of RoIs batching to leverage parallel processing. In this work, we introduce Tangram, an efficient serverless cloud-edge video analytics system fully optimized for both communication and computation. Tangram adaptively aligns the RoIs into patches and transmits them to the scheduler in the cloud. The system employs a unique ``stitching'' method to batch the patches with various sizes from the edge cameras. Additionally, we develop an online SLO-aware batching algorithm that judiciously determines the optimal invoking time of the serverless function. Experiments on our prototype reveal that Tangram can reduce bandwidth consumption and computation cost up to 74.30\% and 66.35\%, respectively, while maintaining SLO violations within 5\% and the accuracy loss negligible., Comment: Accepted by IEEE International Conference on Distributed Computing Systems (ICDCS) 2024
Published: 2024
Full Text: View/download PDF

50. Drug-target interaction prediction by integrating heterogeneous information with mutual attention network

Author: Zhang, Yuanyuan, Wang, Yingdong, Wu, Chaoyong, Zhana, Lingmin, Wang, Aoyi, Cheng, Caiping, Zhao, Jinzhong, Zhang, Wuxia, Chen, Jianxin, and Li, Peng
Subjects: Quantitative Biology - Quantitative Methods
Abstract: Identification of drug-target interactions is an indispensable part of drug discovery. While conventional shallow machine learning and recent deep learning methods based on chemogenomic properties of drugs and target proteins have pushed this prediction performance improvement to a new level, these methods are still difficult to adapt to novel structures. Alternatively, large-scale biological and pharmacological data provide new ways to accelerate drug-target interaction prediction. Here, we propose DrugMAN, a deep learning model for predicting drug-target interaction by integrating multiplex heterogeneous functional networks with a mutual attention network (MAN). DrugMAN uses a graph attention network-based integration algorithm to learn network-specific low-dimensional features for drugs and target proteins by integrating four drug networks and seven gene/protein networks, respectively. DrugMAN then captures interaction information between drug and target representations by a mutual attention network to improve drug-target prediction. DrugMAN achieves the best prediction performance under four different scenarios, especially in real-world scenarios. DrugMAN spotlights heterogeneous information to mine drug-target interactions and can be a powerful tool for drug discovery and drug repurposing.
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

54,399 results on '"Li, Peng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources