Author: "Li, Hai" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

1. Towards Automated Model Design on Recommender Systems

Author: Zhang, Tunhou, Cheng, Dehua, He, Yuchen, Chen, Zhengxing, Dai, Xiaoliang, Xiong, Liang, Liu, Yudong, Cheng, Feng, Cao, Yufan, Yan, Feng, Li, Hai, Chen, Yiran, and Wen, Wei
Subjects: Computer Science - Information Retrieval
Abstract: The increasing popularity of deep learning models has created new opportunities for developing AI-based recommender systems. Designing recommender systems using deep neural networks requires careful architecture design, and further optimization demands extensive co-design efforts on jointly optimizing model architecture and hardware. Design automation, such as Automated Machine Learning (AutoML), is necessary to fully exploit the potential of recommender model design, including model choices and model-hardware co-design strategies. We introduce a novel paradigm that utilizes weight sharing to explore abundant solution spaces. Our paradigm creates a large supernet to search for optimal architectures and co-design strategies to address the challenges of data multi-modality and heterogeneity in the recommendation domain. From a model perspective, the supernet includes a variety of operators, dense connectivity, and dimension search options. From a co-design perspective, it encompasses versatile Processing-In-Memory (PIM) configurations to produce hardware-efficient models. Our solution space's scale, heterogeneity, and complexity pose several challenges, which we address by proposing various techniques for training and evaluating the supernet. Our crafted models show promising results on three Click-Through Rates (CTR) prediction benchmarks, outperforming both manually designed and AutoML-crafted models with state-of-the-art performance when focusing solely on architecture search. From a co-design perspective, we achieve 2x FLOPs efficiency, 1.8x energy efficiency, and 1.5x performance improvements in recommender models., Comment: Accepted in ACM Transactions on Recommender Systems. arXiv admin note: substantial text overlap with arXiv:2207.07187
Published: 2024

2. Subtraction of the $t\bar{t}$ contribution in $tW\bar{b}$ production at the one-loop level

Author: Dong, Liang, Li, Hai Tao, Li, Zheng-Yu, and Wang, Jian
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment
Abstract: The $tW\bar{b}$ production contributes to the real corrections to the $tW$ cross section. It would interfere with the top quark pair production, causing difficulties in a clear definition of the $tW{\bar b}$ events. The subtraction of the $t\bar{t}$ contributions has been performed in the diagram removal or diagram subtraction schemes for the tree-level processes. However, these schemes rely on the ability to identify the double resonant diagrams and thus can not be extended to loop diagrams. We propose a new scheme to subtract the $t\bar{t}$ contributions by power expansion of the squared amplitude in the resonant region. In order to cancel the infra-red divergences of the loop amplitudes, a widely used method is to introduce the dipole counter-terms, an ingredient in calculations of the full next-to-leading order QCD corrections. In our scheme, these counter-terms are also power-expanded. As a proof of principle, we calculate the one-loop correction to the $d\bar{d}\to \bar{b}Wt$ process, and present the invariant mass distribution of the $W\bar{b}$ system., Comment: 21 pages, 8 figures
Published: 2024

3. Poor Man's Training on MCUs: A Memory-Efficient Quantized Back-Propagation-Free Approach

Author: Zhao, Yequan, Li, Hai, Young, Ian, and Zhang, Zheng
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Neural and Evolutionary Computing, I.2, C.3
Abstract: Back propagation (BP) is the default solution for gradient computation in neural network training. However, implementing BP-based training on various edge devices such as FPGA, microcontrollers (MCUs), and analog computing platforms face multiple major challenges, such as the lack of hardware resources, long time-to-market, and dramatic errors in a low-precision setting. This paper presents a simple BP-free training scheme on an MCU, which makes edge training hardware design as easy as inference hardware design. We adopt a quantized zeroth-order method to estimate the gradients of quantized model parameters, which can overcome the error of a straight-through estimator in a low-precision BP scheme. We further employ a few dimension reduction methods (e.g., node perturbation, sparse training) to improve the convergence of zeroth-order training. Experiment results show that our BP-free training achieves comparable performance as BP-based training on adapting a pre-trained image classifier to various corrupted data on resource-constrained edge devices (e.g., an MCU with 1024-KB SRAM for dense full-model training, or an MCU with 256-KB SRAM for sparse training). This method is most suitable for application scenarios where memory cost and time-to-market are the major concerns, but longer latency can be tolerated.
Published: 2024

4. qGDP: Quantum Legalization and Detailed Placement for Superconducting Quantum Computers

Author: Zhang, Junyao, Zhou, Guanglei, Cheng, Feng, Ku, Jonathan, Ding, Qi, Gu, Jiaqi, Wang, Hanrui, Li, Hai "Helen", and Chen, Yiran
Subjects: Quantum Physics, Electrical Engineering and Systems Science - Systems and Control
Abstract: Noisy Intermediate-Scale Quantum (NISQ) computers are currently limited by their qubit numbers, which hampers progress towards fault-tolerant quantum computing. A major challenge in scaling these systems is crosstalk, which arises from unwanted interactions among neighboring components such as qubits and resonators. An innovative placement strategy tailored for superconducting quantum computers can systematically address crosstalk within the constraints of limited substrate areas. Legalization is a crucial stage in placement process, refining post-global-placement configurations to satisfy design constraints and enhance layout quality. However, existing legalizers are not supported to legalize quantum placements. We aim to address this gap with qGDP, developed to meticulously legalize quantum components by adhering to quantum spatial constraints and reducing resonator crossing to alleviate various crosstalk effects. Our results indicate that qGDP effectively legalizes and fine-tunes the layout, addressing the quantum-specific spatial constraints inherent in various device topologies. By evaluating diverse NISQ benchmarks. qGDP consistently outperforms state-of-the-art legalization engines, delivering substantial improvements in fidelity and reducing spatial violation, with average gains of 34.4x and 16.9x, respectively.
Published: 2024

5. Conceptual Design of the Muonium-to-Antimuonium Conversion Experiment (MACE)

Author: Bai, Ai-Yu, Cai, Hanjie, Chen, Chang-Lin, Chen, Siyuan, Chen, Xurong, Chen, Yu, Cheng, Weibin, Dai, Ling-Yun, Fan, Rui-Rui, Gong, Li, Guo, Zihao, He, Yuan, Hou, Zhilong, Huang, Yinyuan, Jia, Huan, Jiang, Hao, Jing, Han-Tao, Kang, Xiaoshen, Li, Hai-Bo, Li, Jincheng, Li, Yang, Liu, Shulin, Lu, Guihao, Miao, Han, Ning, Yunsong, Niu, Jianwei, Peng, Huaxing, Petrov, Alexey A., Qin, Yuanshuai, Sun, Mingchen, Tang, Jian, Tang, Jing-Yu, Tian, Ye, Wang, Rong, Wang, Xiaodong, Wang, Zhichao, Wu, Chen, Xing, Tian-Yu, Xiong, Weizhi, Xu, Yu, Yan, Baojun, Yao, De-Liang, Yu, Tao, Yuan, Ye, Yuan, Yi, Zhang, Yao, Zhang, Yongchao, Zhang, Zhilv, Zhao, Guang, and Zhao, Shihan
Subjects: High Energy Physics - Experiment, High Energy Physics - Phenomenology, Physics - Accelerator Physics, Physics - Instrumentation and Detectors
Abstract: The spontaneous conversion of muonium to antimuonium is one of the interesting charged lepton flavor violation phenomena, offering a sensitive probe of potential new physics and serving as a tool to constrain the parameter space beyond the Standard Model. Utilizing a high-intensity muon beam, a Michel electron magnetic spectrometer and a positron transport solenoid together with a positron detection system, MACE aims to discover or constrain this rare process at the conversion probability beyond the level of $10^{-13}$. This report provides an overview of the theoretical framework and detailed experimental design in the search for the muonium-to-antimuonium conversion., Comment: 115 pages, 59 figures
Published: 2024

6. Axion effects on gamma-ray spectral irregularities. II: EBL absorption models

Author: Li, Hai-Jun, Chao, Wei, Tan, Xiu-Hui, and Zhou, Yu-Feng
Subjects: High Energy Physics - Phenomenology, Astrophysics - High Energy Astrophysical Phenomena
Abstract: In this study, we explore how the extragalactic background light (EBL) absorption effect influences the photon to axionlike particle (ALP) conversions from the very-high-energy gamma-ray spectral irregularities. For our analysis, we select two well-known BL Lac blazars: Markarian 421 and Markarian 501 with their low and well-defined redshifts $z_0=0.031$ and 0.034, respectively. Their gamma-ray data are recently measured by Fermi-LAT and HAWC with the 1038 days of exposure from 2015 June to 2018 July. We first discuss the EBL absorption effect on the gamma-ray spectral energy distributions by using three common EBL spectral models: Franceschini-08, Finke-10, and Gilmore-12. Then we consider the photon-ALP conversions in the astrophysical magnetic fields. Under the ALP assumption with the parameter space of $\{m_a, g_{a\gamma}\}$, we calculate the best-fit chi-square distribution of the EBL models and define a new delta chi-square $\chi_d^2$ to quantify the chi-square difference. Our results show that the impact from these different EBL spectral models are non-dominated at the low-redshift gamma-ray axionscope., Comment: 17 pages, 5 figures
Published: 2024

7. High Mobility SiGe/Ge 2DHG Heterostructure Quantum Wells for Semiconductor Hole Spin Qubits

Author: Kong, Zhenzhen, Li, Zonghu, Zhou, Yuchen, Cao, Gang, Li, Hai-Ou, Su, Jiale, Zhang, Yiwen, Liu, Jinbiao, Guo, Guo-Ping, Li, Junfeng, Luo, Jun, Zhao, Chao, Ye, Tianchun, and Wang, Guilei
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Materials Science
Abstract: Strong spin-orbit coupling and relatively weak hyperfine interactions make germanium hole spin qubits a promising candidate for semiconductor quantum processors. The two-dimensional hole gas structure of strained Ge quantum wells serves as the primary material platform for spin hole qubits.A low disorder material environment is essential for this process. In this work, we fabricated a Ge/SiGe heterojunction with a 60 nm buried quantum well layer on a Si substrate using reduced pressure chemical vapor deposition technology. At a temperature of 16 mK, when the carrier density is 1.87*10^11/cm2, we obtained a mobility as high as 308.64*10^4cm2/Vs. Concurrently, double quantum dot and planar germanium coupling with microwave cavities were also successfully achieved.This fully demonstrates that this structure can be used for the preparation of higher-performance hole spin qubits.
Published: 2024

8. On the temperature effects in QCD axion mass mixing

Author: Li, Hai-Jun
Subjects: High Energy Physics - Phenomenology, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: In this work, we extend the QCD axion mass mixing in the early Universe and investigate the temperature effects in the mixing. We explore the scenario where two $Z_{\mathcal N}$ QCD axions undergo mass mixing during the QCD phase transition, yielding three distinct mixing scenarios: mixing I, II, and III. These scenarios are realized through fine-tuning of the axion decay constants, the temperature parameters, as well as the value of $\mathcal N$. We conduct a thorough analysis of the level crossing phenomena in these three mixing scenarios, detailing the conditions under which they occur. Notably, in the mixing I and II, the level crossing precedes the critical temperature of the QCD phase transition ($T_{\rm QCD}$), with minimal non-essential discrepancies in the cosmological evolution of the mass eigenvalues at $T_{\rm QCD}$. In contrast, the mixing III exhibits a unique double level crossings, occurring both before and at $T_{\rm QCD}$. Despite superficial similarities in axion evolution between the mixing II and III, we uncover fundamental differences between them. Additionally, we briefly address the transition in energy density between the two axions within our mixing scenarios. This work contributes to a deeper understanding of the role of the QCD axion in the early Universe and its potential implications for dark matter., Comment: 16 pages, 7 figures
Published: 2024

9. Determination of the $K^{+}\bar{K}^{0}$ scattering length and effective range and its relation to the $a_{0}^{+}(980)$ from the $\chi_{c1}\to\pi^{+}\pi^{-}\eta$ reaction

Author: Li, Hai-Peng, Lin, Jia-Xin, Liang, Wei-Hong, and Oset, Eulogio
Subjects: High Energy Physics - Phenomenology
Abstract: We analyze the clean cusp, seen in the $\eta \pi$ mass distribution with high precision of the $\chi_{c1} \to \eta \pi^+ \pi^-$ reaction in the BESIII experiment, with the aim of making a precise determination of the scattering length $a$ and effective range $r_0$ of $K^+ \bar{K}^{0}$. For that, we follow a previous theoretical work that gave a good reproduction of these data using the chiral unitary approach for the meson-meson interaction, and allow some flexibility in the input to carry a better fit to the data. The important task of determining the uncertainties in the scattering parameters is done using the resampling method and an accuracy in $a$ and $r_0$ is obtained better than $20\%$. The effective range is determined for the first time with this analysis., Comment: 7 pages, 4 figures, 3 tables; V2: New fit with background included
Published: 2024

10. SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality

Author: Zhai, Hongjia, Zhang, Xiyu, Zhao, Boming, Li, Hai, He, Yijia, Cui, Zhaopeng, Bao, Hujun, and Zhang, Guofeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Visual localization plays an important role in the applications of Augmented Reality (AR), which enable AR devices to obtain their 6-DoF pose in the pre-build map in order to render virtual content in real scenes. However, most existing approaches can not perform novel view rendering and require large storage capacities for maps. To overcome these limitations, we propose an efficient visual localization method capable of high-quality rendering with fewer parameters. Specifically, our approach leverages 3D Gaussian primitives as the scene representation. To ensure precise 2D-3D correspondences for pose estimation, we develop an unbiased 3D scene-specific descriptor decoder for Gaussian primitives, distilled from a constructed feature volume. Additionally, we introduce a salient 3D landmark selection algorithm that selects a suitable primitive subset based on the saliency score for localization. We further regularize key Gaussian primitives to prevent anisotropic effects, which also improves localization performance. Extensive experiments on two widely used datasets demonstrate that our method achieves superior or comparable rendering and localization performance to state-of-the-art implicit-based visual localization approaches. Project page: \href{https://zju3dv.github.io/splatloc}{https://zju3dv.github.io/splatloc}.
Published: 2024

11. Highly tunable 2D silicon quantum dot array with coupling beyond nearest neighbors

Author: Wang, Ning, Kang, Jia-Min, Lu, Wen-Long, Wang, Shao-Min, Wang, You-Jia, Li, Hai-Ou, Cao, Gang, Wang, Bao-Chuan, and Guo, Guo-Ping
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Scaling up quantum dots to two-dimensional (2D) arrays is a crucial step for advancing semiconductor quantum computation. However, maintaining excellent tunability of quantum dot parameters, including both nearest-neighbor and next-nearest-neighbor couplings, during 2D scaling is challenging, particularly for silicon quantum dots due to their relatively small size. Here, we present a highly controllable and interconnected 2D quantum dot array in planar silicon, demonstrating independent control over electron fillings and the tunnel couplings of nearest-neighbor dots. More importantly, we also demonstrate the wide tuning of tunnel couplings between next-nearest-neighbor dots,which plays a crucial role in 2D quantum dot arrays. This excellent tunability enables us to alter the coupling configuration of the array as needed. These results open up the possibility of utilizing silicon quantum dot arrays as versatile platforms for quantum computing and quantum simulation.
Published: 2024
Full Text: View/download PDF

12. Pursuing high-fidelity control of spin qubits in natural Si/SiGe quantum dot

Author: Wang, Ning, Wang, Shao-Min, Zhang, Run-Ze, Kang, Jia-Min, Lu, Wen-Long, Li, Hai-Ou, Cao, Gang, Wang, Bao-Chuan, and Guo, Guo-Ping
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Electron spin qubits in silicon are a promising platform for fault-tolerant quantum computing. Low-frequency noise, including nuclear spin fluctuations and charge noise, is a primary factor limiting gate fidelities. Suppressing this noise is crucial for high-fidelity qubit operations. Here, we report on a two-qubit quantum device in natural silicon with universal qubit control, designed to investigate the upper limits of gate fidelities in a non-purified Si/SiGe quantum dot device. By employing advanced device structures, qubit manipulation techniques, and optimization methods, we have achieved single-qubit gate fidelities exceeding 99% and a two-qubit Controlled-Z (CZ) gate fidelity of 91%. Decoupled CZ gates are used to prepare Bell states with a fidelity of 91%, typically exceeding previously reported values in natural silicon devices. These results underscore that even natural silicon has the potential to achieve high-fidelity gate operations, particularly with further optimization methods to suppress low-frequency noise.
Published: 2024
Full Text: View/download PDF

13. N$^{\mathbf{3}}$LL + $\mathcal{O}(\alpha_s^2)$ predictions of lepton-jet azimuthal angular distribution in deep-inelastic scattering

Author: Fang, Shen, Gao, Mei-Sen, Li, Hai Tao, and Shao, Ding Yu
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment, Nuclear Experiment, Nuclear Theory
Abstract: We present an analysis of lepton-jet azimuthal decorrelation in deep-inelastic scattering (DIS) at next-to-next-to-next-to-leading logarithmic (N$^{3}$LL) accuracy, combined with fixed-order corrections at $\mathcal{O}(\alpha_s^2)$. In this study, jets are defined in the lab frame using the anti-$k_T$ clustering algorithm and the winner-take-all recombination scheme. The N$^{3}$LL resummation results are derived from the transverse-momentum dependent factorization formula within the soft-collinear effective theory, while the $\mathcal{O}(\alpha_s^2)$ fixed-order matching distribution is calculated using the {\tt NLOJET++} event generator. The azimuthal decorrelation between the jet and electron serves as a critical probe of the three-dimensional structure of the nucleon. Our numerical predictions provide a robust framework for precision studies of QCD and the nucleon's internal structure through jet observables in DIS. These results are particularly significant for analyses involving jets in HERA data and the forthcoming electron-ion collider experiments., Comment: 27 pages, 5 figures
Published: 2024

14. FedProphet: Memory-Efficient Federated Adversarial Training via Theoretic-Robustness and Low-Inconsistency Cascade Learning

Author: Tang, Minxue, Wang, Yitu, Zhang, Jingyang, DiValentin, Louis, Ding, Aolin, Hass, Amin, Chen, Yiran, and Li, Hai "Helen"
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Federated Learning (FL) provides a strong privacy guarantee by enabling local training across edge devices without training data sharing, and Federated Adversarial Training (FAT) further enhances the robustness against adversarial examples, promoting a step toward trustworthy artificial intelligence. However, FAT requires a large model to preserve high accuracy while achieving strong robustness, and it is impractically slow when directly training with memory-constrained edge devices due to the memory-swapping latency. Moreover, existing memory-efficient FL methods suffer from poor accuracy and weak robustness in FAT because of inconsistent local and global models, i.e., objective inconsistency. In this paper, we propose FedProphet, a novel FAT framework that can achieve memory efficiency, adversarial robustness, and objective consistency simultaneously. FedProphet partitions the large model into small cascaded modules such that the memory-constrained devices can conduct adversarial training module-by-module. A strong convexity regularization is derived to theoretically guarantee the robustness of the whole model, and we show that the strong robustness implies low objective inconsistency in FedProphet. We also develop a training coordinator on the server of FL, with Adaptive Perturbation Adjustment for utility-robustness balance and Differentiated Module Assignment for objective inconsistency mitigation. FedProphet empirically shows a significant improvement in both accuracy and robustness compared to previous memory-efficient methods, achieving almost the same performance of end-to-end FAT with 80% memory reduction and up to 10.8x speedup in training time., Comment: Preprint
Published: 2024

15. MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data

Author: Zhang, Jianyi, Yang, Hao Frank, Li, Ang, Guo, Xin, Wang, Pu, Wang, Haiming, Chen, Yiran, and Li, Hai
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Previous studies on federated learning (FL) often encounter performance degradation due to data heterogeneity among different clients. In light of the recent advances in multimodal large language models (MLLMs), such as GPT-4v and LLaVA, which demonstrate their exceptional proficiency in multimodal tasks, such as image captioning and multimodal question answering. We introduce a novel federated learning framework, named Multimodal Large Language Model Assisted Federated Learning (MLLM-FL), which which employs powerful MLLMs at the server end to address the heterogeneous and long-tailed challenges. Owing to the advanced cross-modality representation capabilities and the extensive open-vocabulary prior knowledge of MLLMs, our framework is adept at harnessing the extensive, yet previously underexploited, open-source data accessible from websites and powerful server-side computational resources. Hence, the MLLM-FL not only enhances the performance but also avoids increasing the risk of privacy leakage and the computational burden on local devices, distinguishing it from prior methodologies. Our framework has three key stages. Initially, prior to local training on local datasets of clients, we conduct global visual-text pretraining of the model. This pretraining is facilitated by utilizing the extensive open-source data available online, with the assistance of multimodal large language models. Subsequently, the pretrained model is distributed among various clients for local training. Finally, once the locally trained models are transmitted back to the server, a global alignment is carried out under the supervision of MLLMs to further enhance the performance. Experimental evaluations on established benchmarks, show that our framework delivers promising performance in the typical scenarios with data heterogeneity and long-tail distribution across different clients in FL.
Published: 2024

16. How to unravel the nature of the $\Sigma^*(1430) (1/2^-)$ state from correlation functions

Author: Li, Hai-Peng, Xiao, Chu-Wen, Liang, Wei-Hong, Wu, Jia-Jun, Wang, En, and Oset, Eulogio
Subjects: High Energy Physics - Phenomenology
Abstract: We calculate the correlation functions for the $\bar K^0 p, \pi^+ \Sigma^0, \pi^0 \Sigma^+, \pi^+ \Lambda$, and $\eta \Sigma^+$ states, which in the chiral unitary approach predict an excited $\Sigma^*(1/2^-)$ state at the $\bar K N$ threshold, recently observed by the Belle collaboration. Once this is done, we tackle the inverse problem of seeing how much information one can obtain from these correlation functions. With the resampling method, one can determine the scattering parameters of all the channels with relative precision by means of the analysis in a general framework, and find a clear cusp-like structure corresponding to the $\Sigma^*(1/2^-)$ in the different amplitudes at the $\bar{K}N$ threshold., Comment: 10 pages, 4 figures, 7 tables
Published: 2024

17. Maximizing orientation of a three-state molecule in a cavity with analytically designed pulses

Author: Fan, Li-Bao, Li, Hai-Ji, Chen, Qi, Zhou, Hang, Liu, Heng, and Shu, Chuan-Cun
Subjects: Quantum Physics
Abstract: We theoretically explore the precise control of a molecular polariton by strongly coupling the lowest three rotational states of a single molecule with a single-mode cavity. We examine two distinct cavity resonance configurations: a fundamental frequency cavity ($\omega_c = 2B$ with the rotational constant $B$) resonating with the lowest two rotational states, and a second harmonic cavity ($\omega_c = 4B$) coupling with the first and second excited rotational states. We propose two control schemes based on the two polariton configurations and derive the corresponding pulse-area theorems to achieve a theoretical maximum orientation of 0.7746, identical to the molecule in the absence of the cavity. The control schemes are analyzed in Carbonyl Sulfide (OCS) molecules in their ground rotational state. Our numerical simulation results demonstrate the theoretical control schemes and analyze the sensitivity of the molecular polariton orientation degree to the control field bandwidth and phases. This work provides a valuable reference for achieving maximum field-free orientation of ultracold three-state molecules in a cavity using analytically designed pulses., Comment: 22 pages, 7 figures
Published: 2024

18. Investigating Material Interface Diffusion Phenomena through Graph Neural Networks in Applied Materials

Author: Zhao, Zirui and Li, Hai-Feng
Subjects: Condensed Matter - Materials Science, Physics - Computational Physics
Abstract: Understanding and predicting interface diffusion phenomena in materials is crucial for various industrial applications, including semiconductor manufacturing, battery technology, and catalysis. In this study, we propose a novel approach utilizing Graph Neural Networks (GNNs) to investigate and model material interface diffusion. We begin by collecting experimental and simulated data on diffusion coefficients, concentration gradients, and other relevant parameters from diverse material systems. The data are preprocessed, and key features influencing interface diffusion are extracted. Subsequently, we construct a GNN model tailored to the diffusion problem, with a graph representation capturing the atomic structure of materials. The model architecture includes multiple graph convolutional layers for feature aggregation and update, as well as optional graph attention layers to capture complex relationships between atoms. We train and validate the GNN model using the preprocessed data, achieving accurate predictions of diffusion coefficients, diffusion rates, concentration profiles, and potential diffusion pathways. Our approach offers insights into the underlying mechanisms of interface diffusion and provides a valuable tool for optimizing material design and engineering. Additionally, our method offers possible strategies to solve the longstanding problems related to materials interface diffusion.
Published: 2024

19. Deep learning-driven evaluation and prediction of ion-doped NASICON materials for enhanced solid-state battery performance

Author: Zhao, Zirui, Wang, Xiaoke, Wu, Si, Zhou, Pengfei, Zhao, Qian, Xu, Guanping, Sun, Kaitong, and Li, Hai-Feng
Subjects: Condensed Matter - Materials Science, J.2, I.2.8
Abstract: We developed a convolutional neural network (CNN) model capable of predicting the performance of various ion-doped NASICON compounds by leveraging extensive datasets from prior experimental investigation.The model demonstrated high accuracy and efficiency in predicting ionic conductivity and electrochemical properties. Key findings include the successful synthesis and validation of three NASICON materials predicted by the model, with experimental results closely matching the model predictions. This research not only enhances the understanding of ion-doping effects in NASICON materials but also establishes a robust framework for material design and practical applications. It bridges the gap between theoretical predictions and experimental validations.
Published: 2024

20. Mass mixing between QCD axions

Author: Li, Hai-Jun and Zhou, Yu-Feng
Subjects: High Energy Physics - Phenomenology, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: We introduce a novel level crossing in the mass mixing between two QCD axions, one canonical QCD axion and one $Z_{\mathcal N}$ QCD axion. The level crossing can take place at the QCD phase transition critical temperature or slightly before it, depending on the ratio of the axion decay constants $\sim1.69$. The cosmological evolution of the mass eigenvalues in these two cases is similar, however, the transition of axion energy density is completely different. Finally, we estimate the relic density of the QCD axion dark matter. This level crossing may also have some cosmological implications., Comment: 6 pages, 4 figures
Published: 2024

21. Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey

Author: Miyai, Atsuyuki, Yang, Jingkang, Zhang, Jingyang, Ming, Yifei, Lin, Yueqian, Yu, Qing, Irie, Go, Joty, Shafiq, Li, Yixuan, Li, Hai, Liu, Ziwei, Yamasaki, Toshihiko, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine learning systems and has shaped the field of OOD detection. Meanwhile, several other problems are closely related to OOD detection, including anomaly detection (AD), novelty detection (ND), open set recognition (OSR), and outlier detection (OD). To unify these problems, a generalized OOD detection framework was proposed, taxonomically categorizing these five problems. However, Vision Language Models (VLMs) such as CLIP have significantly changed the paradigm and blurred the boundaries between these fields, again confusing researchers. In this survey, we first present a generalized OOD detection v2, encapsulating the evolution of AD, ND, OSR, OOD detection, and OD in the VLM era. Our framework reveals that, with some field inactivity and integration, the demanding challenges have become OOD detection and AD. In addition, we also highlight the significant shift in the definition, problem settings, and benchmarks; we thus feature a comprehensive review of the methodology for OOD detection, including the discussion over other related tasks to clarify their relationship to OOD detection. Finally, we explore the advancements in the emerging Large Vision Language Model (LVLM) era, such as GPT-4V. We conclude this survey with open challenges and future directions., Comment: survey paper. We welcome questions, issues, and paper requests via https://github.com/AtsuMiyai/Awesome-OOD-VLM
Published: 2024

22. Constraints on Interacting Dark Energy Models from the DESI Baryon Acoustic Oscillation and DES Supernovae Data

Author: Li, Tian-Nuo, Wu, Peng-Ju, Du, Guo-Hong, Jin, Shang-Jie, Li, Hai-Li, Zhang, Jing-Fei, and Zhang, Xin
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, General Relativity and Quantum Cosmology, High Energy Physics - Phenomenology
Abstract: The recent results from the first year baryon acoustic oscillations (BAO) data released by the Dark Energy Spectroscopic Instrument (DESI), combined with cosmic microwave background (CMB) and type Ia supernova (SN) data, have shown a detection of significant deviation from a cosmological constant for dark energy. In this work, we utilize the latest DESI BAO data in combination with the SN data from the full five-year observations of the Dark Energy Survey and the CMB data from the Planck satellite to explore potential interactions between dark energy and dark matter. We consider four typical forms of the interaction term $Q$. Our findings suggest that interacting dark energy (IDE) models with $Q \propto \rho_{\rm de}$ support the presence of an interaction where dark energy decays into dark matter. Specifically, the deviation from $\Lambda$CDM for the IDE model with $Q=\beta H_0\rho_{\rm de}$ reaches the $3\sigma$ level. These models yield a lower value of Akaike information criterion than the $\Lambda$CDM model, indicating a preference for these IDE models based on the current observational data. For IDE models with $Q\propto\rho_{\rm c}$, the existence of interaction depends on the form of the proportionality coefficient $\Gamma$. The IDE model with $Q=\beta H\rho_{\rm c}$ yields $\beta=0.0003\pm 0.0011$, which essentially does not support the presence of the interaction. In general, whether the observational data support the existence of interaction is closely related to the model. Our analysis helps to elucidate which type of IDE model can better explain the current observational data., Comment: 11 pages, 4 figures; published in ApJ
Published: 2024
Full Text: View/download PDF

23. Improved constraint on Higgs boson self-couplings with quartic and cubic power dependence in the cross section

Author: Li, Hai Tao, Si, Zong-Guo, Wang, Jian, Zhang, Xiao, and Zhao, Dan
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment
Abstract: Precise information on the Higgs boson self-couplings provides the foundation for unveiling the electroweak symmetry breaking mechanism. Due to the scarcity of Higgs boson pair events at the LHC, only loose limits have been obtained. This is based on the assumption that the cross section is a quadratic function of the trilinear Higgs self-coupling in the $\kappa$ framework. However, if higher-order corrections of virtual Higgs bosons are included, the function form would dramatically change. In particular, new quartic and cubic power dependence on the trilinear Higgs self-coupling would appear. To get this new function form, we have performed a specialized renormalization procedure suitable for tracking all the Higgs self-couplings in each calculation step. Moreover, we introduce renormalization of the scaling parameter in the $\kappa$ framework to ensure the cancellation of all ultraviolet divergences. With the new function forms of the cross sections in both the gluon-gluon fusion and vector boson fusion channels, the upper limit of $\kappa_{\lambda_3}=\lambda_{\rm 3H}/\lambda_{\rm 3H}^{\rm SM}$ by the ATLAS (CMS) collaboration is reduced from 6.6 (6.49) to 5.4 (5.37). However, it is still hard to extract a meaningful constraint on the quartic Higgs self-coupling $\lambda_{\rm 4H}$ from Higgs boson pair production data. We also present the invariant mass distributions of the Higgs boson pair at different values of $\kappa_{\lambda}$, which could help to set optimal cuts in the experimental analysis., Comment: 12 pages, 4 figures
Published: 2024

24. Predicting doping strategies for ternary nickel-cobalt-manganese cathode materials to enhance battery performance using graph neural networks

Author: Zhao, Zirui, Luo, Dong, Wu, Shuxing, Sun, Kaitong, Lin, Zhan, and Li, Hai-Feng
Subjects: Condensed Matter - Materials Science, Physics - Computational Physics, I.2.8, J.2
Abstract: The exceptional electrochemical performance of lithium-ion batteries has spurred considerable interest in advanced battery technologies, particularly those utilizing ternary nickel-cobalt-manganese (NCM) cathode materials, which are renowned for their robust electrochemical performance and structural stability. Building upon this research, investigators have explored doping additional elements into NCM cathode materials to further enhance their electrochemical performance and structural integrity. However, the multitude of doping strategies available for NCM battery systems presents a challenge in determining the most effective approach. In this study, we elucidate the potential of ternary NCM systems as cathode materials for lithium-ion batteries. We compile a comprehensive database of lithium-ion batteries employing NCM systems from various sources of prior research and develop a corresponding data-driven model utilizing graph neural networks to predict optimal doping strategies. Our aim is to provide insights into the NCM-based battery systems for both fundamental understanding and practical applications.
Published: 2024
Full Text: View/download PDF

25. MonoSparse-CAM: Harnessing Monotonicity and Sparsity for Enhanced Tree Model Processing on CAMs

Author: Molom-Ochir, Tergel, Taylor, Brady, Li, Hai, and Chen, Yiran
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Hardware Architecture
Abstract: Despite significant advancements in AI driven by neural networks, tree-based machine learning (TBML) models excel on tabular data. These models exhibit promising energy efficiency, and high performance, particularly when accelerated on analog content-addressable memory (aCAM) arrays. However, optimizing their hardware deployment, especially in leveraging TBML model structure and aCAM circuitry, remains challenging. In this paper, we introduce MonoSparse-CAM, a novel content-addressable memory (CAM) based computing optimization technique. MonoSparse-CAM efficiently leverages TBML model sparsity and CAM array circuits, enhancing processing performance. Our experiments show that MonoSparse-CAM reduces energy consumption by up to 28.56x compared to raw processing and 18.51x compared to existing deployment optimization techniques. Additionally, it consistently achieves at least 1.68x computational efficiency over current methods. By enabling energy-efficient CAM-based computing while preserving performance regardless of the array sparsity, MonoSparse-CAM addresses the high energy consumption problem of CAM which hinders processing of large arrays. Our contributions are twofold: we propose MonoSparse-CAM as an effective deployment optimization solution for CAM-based computing, and we investigate the impact of TBML model structure on array sparsity. This work provides crucial insights for energy-efficient TBML on hardware, highlighting a significant advancement in sustainable AI technologies.
Published: 2024

26. Suppression of quantum dissipation: A cooperative effect of quantum squeezing and quantum measurement

Author: Xia, Yi-Ming, Wang, Yi-Fei, Zhang, Xiao-Yun, Li, Hai-Chao, and Xiong, Wei
Subjects: Quantum Physics
Abstract: The ability to isolate a quantum system from its environment is of fundamental interest and importance in optical quantum science and technology. Here we propose an experimentally feasible scheme for beating environment-induced dissipation in an open two-level system coupled to a parametrically driven cavity. The mechanism relies on a novel cooperation between light-matter coupling enhancement and frequent measurements. We demonstrate that, in the presence of the cooperation, the system dynamics can be completely dominated by the effective system-cavity interaction and the dissipative effects from the system-environment coupling can be surprisingly ignored. This work provides a generic method of dissipation suppression in a variety of quantum mechanical platforms, including natural atoms and superconducting circuits.
Published: 2024

27. Multi-agent Cooperative Games Using Belief Map Assisted Training

Author: Huang, Qinwei, Luo, Chen, Wu, Alex B., Khan, Simon, Li, Hai, and Qiu, Qinru
Subjects: Computer Science - Multiagent Systems, Computer Science - Machine Learning
Abstract: In a multi-agent system, agents share their local observations to gain global situational awareness for decision making and collaboration using a message passing system. When to send a message, how to encode a message, and how to leverage the received messages directly affect the effectiveness of the collaboration among agents. When training a multi-agent cooperative game using reinforcement learning (RL), the message passing system needs to be optimized together with the agent policies. This consequently increases the model's complexity and poses significant challenges to the convergence and performance of learning. To address this issue, we propose the Belief-map Assisted Multi-agent System (BAMS), which leverages a neuro-symbolic belief map to enhance training. The belief map decodes the agent's hidden state to provide a symbolic representation of the agent's understanding of the environment and other agent's status. The simplicity of symbolic representation allows the gathering and comparison of the ground truth information with the belief, which provides an additional channel of feedback for the learning. Compared to the sporadic and delayed feedback coming from the reward in RL, the feedback from the belief map is more consistent and reliable. Agents using BAMS can learn a more effective message passing network to better understand each other, resulting in better performance in a cooperative predator and prey game with varying levels of map complexity and compare it to previous multi-agent message passing models. The simulation results showed that BAMS reduced training epochs by 66\%, and agents who apply the BAMS model completed the game with 34.62\% fewer steps on average.
Published: 2024
Full Text: View/download PDF

28. FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Author: Lv, Yuanjun, Li, Hai, Yan, Ying, Liu, Junhui, Xie, Danming, and Xie, Lei
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Vocoders reconstruct speech waveforms from acoustic features and play a pivotal role in modern TTS systems. Frequent-domain GAN vocoders like Vocos and APNet2 have recently seen rapid advancements, outperforming time-domain models in inference speed while achieving comparable audio quality. However, these frequency-domain vocoders suffer from large parameter sizes, thus introducing extra memory burden. Inspired by PriorGrad and SpecGrad, we employ pseudo-inverse to estimate the amplitude spectrum as the initialization roughly. This simple initialization significantly mitigates the parameter demand for vocoder. Based on APNet2 and our streamlined Amplitude prediction branch, we propose our FreeV, compared with its counterpart APNet2, our FreeV achieves 1.8 times inference speed improvement with nearly half parameters. Meanwhile, our FreeV outperforms APNet2 in resynthesis quality, marking a step forward in pursuing real-time, high-fidelity speech synthesis. Code and checkpoints is available at: https://github.com/BakerBunker/FreeV, Comment: Accepted by InterSpeech 2024; 5 pages, 5 figures
Published: 2024

29. PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction

Author: Chen, Danpeng, Li, Hai, Ye, Weicai, Wang, Yifan, Xie, Weijian, Zhai, Shangjin, Wang, Nan, Liu, Haomin, Bao, Hujun, and Zhang, Guofeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, 3D Gaussian Splatting (3DGS) has attracted widespread attention due to its high-quality rendering, and ultra-fast training and rendering speed. However, due to the unstructured and irregular nature of Gaussian point clouds, it is difficult to guarantee geometric reconstruction accuracy and multi-view consistency simply by relying on image reconstruction loss. Although many studies on surface reconstruction based on 3DGS have emerged recently, the quality of their meshes is generally unsatisfactory. To address this problem, we propose a fast planar-based Gaussian splatting reconstruction representation (PGSR) to achieve high-fidelity surface reconstruction while ensuring high-quality rendering. Specifically, we first introduce an unbiased depth rendering method, which directly renders the distance from the camera origin to the Gaussian plane and the corresponding normal map based on the Gaussian distribution of the point cloud, and divides the two to obtain the unbiased depth. We then introduce single-view geometric, multi-view photometric, and geometric regularization to preserve global geometric accuracy. We also propose a camera exposure compensation model to cope with scenes with large illumination variations. Experiments on indoor and outdoor scenes show that our method achieves fast training and rendering while maintaining high-fidelity rendering and geometric reconstruction, outperforming 3DGS-based and NeRF-based methods., Comment: project page: https://zju3dv.github.io/pgsr/
Published: 2024

30. Quasinormal modes and ringdown waveform of the Frolov black hole

Author: Song, Zhijun, Gong, Huajie, Li, Hai-Li, Fu, Guoyang, Zhu, Li-Gang, and Wu, Jian-Pin
Subjects: General Relativity and Quantum Cosmology
Abstract: In this paper we investigate scalar perturbation over a Frolov black hole (BH), which is a regular BH induced by the quantum gravity effect. The quasinormal frequencies of a scalar field always consistently reside in the lower half-plane, and the time-domain evolution of the field demonstrates a decaying behavior, with the late-time tail exhibiting a power-law pattern. These observations collectively suggest the stability of a Frolov BH against scalar perturbation. Additionally, our study reveals that the quantum gravity effect leads to slower decay modes. For the case of the angular quantum number $l=0$, the oscillation exhibits non-monotonic behavior with the quantum gravity parameter $\alpha_0$. However, once $l\geq 1$, the angular quantum number surpasses the influence of the quantum gravity effect., Comment: 15 pages, 7 figures; accepted for publication in Communications in Theoretical Physics
Published: 2024
Full Text: View/download PDF

31. Can Dense Connectivity Benefit Outlier Detection? An Odyssey with NAS

Author: Fu, Hao, Zhang, Tunhou, Li, Hai, and Chen, Yiran
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advances in Out-of-Distribution (OOD) Detection is the driving force behind safe and reliable deployment of Convolutional Neural Networks (CNNs) in real world applications. However, existing studies focus on OOD detection through confidence score and deep generative model-based methods, without considering the impact of DNN structures, especially dense connectivity in architecture fabrications. In addition, existing outlier detection approaches exhibit high variance in generalization performance, lacking stability and confidence in evaluating and ranking different outlier detectors. In this work, we propose a novel paradigm, Dense Connectivity Search of Outlier Detector (DCSOD), that automatically explore the dense connectivity of CNN architectures on near-OOD detection task using Neural Architecture Search (NAS). We introduce a hierarchical search space containing versatile convolution operators and dense connectivity, allowing a flexible exploration of CNN architectures with diverse connectivity patterns. To improve the quality of evaluation on OOD detection during search, we propose evolving distillation based on our multi-view feature learning explanation. Evolving distillation stabilizes training for OOD detection evaluation, thus improves the quality of search. We thoroughly examine DCSOD on CIFAR benchmarks under OOD detection protocol. Experimental results show that DCSOD achieve remarkable performance over widely used architectures and previous NAS baselines. Notably, DCSOD achieves state-of-the-art (SOTA) performance on CIFAR benchmark, with AUROC improvement of $\sim$1.0%.
Published: 2024

32. Mechanical dynamics around higher-order exceptional point in magno-optomechanics

Author: He, Wen-Di, Fan, Xiao-Hong, Liu, Ming-Yue, Zhang, Guo-Qiang, Li, Hai-Chao, and Xiong, Wei
Subjects: Quantum Physics
Abstract: We theoretically study diverse exceptional points (EPs) in an experimentally feasible magno-optomechanics consisting of an optomechanical subsystem coupled to a magnomechanical subsystem via physically direct contact. By adiabatically eliminating both the cavity and the Kittel mode, dissipative and parity-time symmetric exceptional points can be observed. When only the cavity mode is eliminated, a second (third) -order pseudo-Hermitian EP emerges for nondegenerate (degenerate) mechanical modes. The distinct dynamical behavior of two mechanical modes around these EPs are further studied. Our proposal provides a promising way to engineer diverse EPs and quantify non-Hermitian phase transition with exceptional dynamical behavior in magno-optomechanics., Comment: 7 pages, 6 figures
Published: 2024

33. Upper limit on the axion-photon coupling from Markarian 421

Author: Li, Hai-Jun, Chao, Wei, and Zhou, Yu-Feng
Subjects: High Energy Physics - Phenomenology, Astrophysics - High Energy Astrophysical Phenomena
Abstract: Markarian 421 is a well-known nearby BL Lac blazar at the redshift $z=0.031$. Many previous works were investigated to constrain the axion-photon coupling from its TeV gamma-ray observations, showing the upper limit on the coupling constant $g_{a\gamma} \lesssim 2.0\times 10^{-11} \rm \, GeV^{-1}$ for the axion mass $[5.0\times10^{-10} \, {\rm eV} \lesssim m_a \lesssim 5.0\times10^{-7} \, {\rm eV}]$. While in this work, we obtain a more stringent upper limit on the axion-photon coupling from the 1038 days gamma-ray observations of the blazar Markarian 421. The long-term gamma-ray spectra are measured by the collaborations Large Area Telescope on board NASA's Fermi Gamma-ray Space Telescope (Fermi-LAT) and High Altitude Water Cherenkov (HAWC) Gamma-Ray Observatory from 2015 June to 2018 July. We show the best-fit spectral energy distributions (SEDs) of Markarian 421 under the null and axion hypotheses. Then we set the axion-photon limit in the $\{m_a, \, g_{a\gamma}\}$ plane. The 99% $\rm C.L.$ upper limit set by Markarian 421 is $g_{a\gamma} \lesssim 4.0\times 10^{-12} \rm \, GeV^{-1}$ for the axion mass $[1.0\times10^{-9} \, {\rm eV} \lesssim m_a \lesssim 1.0\times10^{-8} \, {\rm eV}]$. It is the most stringent upper limit in this axion mass region., Comment: 11 pages, 4 figures. Published in PLB
Published: 2024

34. DrHouse: An LLM-empowered Diagnostic Reasoning System through Harnessing Outcomes from Sensor Data and Expert Knowledge

Author: Yang, Bufang, Jiang, Siyang, Xu, Lilin, Liu, Kaiwei, Li, Hai, Xing, Guoliang, Chen, Hongkai, Jiang, Xiaofan, and Yan, Zhenyu
Subjects: Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have the potential to transform digital healthcare, as evidenced by recent advances in LLM-based virtual doctors. However, current approaches rely on patient's subjective descriptions of symptoms, causing increased misdiagnosis. Recognizing the value of daily data from smart devices, we introduce a novel LLM-based multi-turn consultation virtual doctor system, DrHouse, which incorporates three significant contributions: 1) It utilizes sensor data from smart devices in the diagnosis process, enhancing accuracy and reliability. 2) DrHouse leverages continuously updating medical databases such as Up-to-Date and PubMed to ensure our model remains at diagnostic standard's forefront. 3) DrHouse introduces a novel diagnostic algorithm that concurrently evaluates potential diseases and their likelihood, facilitating more nuanced and informed medical assessments. Through multi-turn interactions, DrHouse determines the next steps, such as accessing daily data from smart devices or requesting in-lab tests, and progressively refines its diagnoses. Evaluations on three public datasets and our self-collected datasets show that DrHouse can achieve up to an 18.8% increase in diagnosis accuracy over the state-of-the-art baselines. The results of a 32-participant user study show that 75% medical experts and 91.7% patients are willing to use DrHouse.
Published: 2024

35. A new ferromagnetic semiconductor system of Eu$_{1-x}$Sr$_x$AgP $(x = 0.0-0.6)$ compounds: Crystallographic, magnetic, and magneto-resistive properties

Author: Zhao, Qian, Sun, Kaitong, Xia, Junchao, and Li, Hai-Feng
Subjects: Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons, Physics - Applied Physics
Abstract: Adjusting chemical pressure through doping is a highly effective method for customizing the chemical and physical properties of materials, along with their respective phase diagrams, thereby uncovering novel quantum phenomena. Here, we successfully synthesized Sr-doped Eu$_{1-x}$Sr$_x$AgP $(x = 0.0-0.6)$ and conducted a comprehensive investigation involving crystallography, magnetization, heat capacity, and magnetoresistance. Utilizing X-ray diffraction and PPMS DynaCool measurements, we studied Eu$_{1-x}$Sr$_x$AgP in detail. The hexagonal structure of parent EuAgP at room temperature, with the $P6_3/mmc$ space group, remains unaltered, while the lattice constants expand. The magnetic phase transition from paramagnetism to ferromagnetism, as temperature decreases, is suppressed through the gradual introduction of strontium doping. Heat capacity measurements reveal a shift from magnon-dominated to predominantly phonon and electron contributions near the ferromagnetic phase with increasing doping levels. The resistivity-temperature relationship displays distinct characteristics, emphasizing the impact of Sr doping on modifying charge transport. Magnetoresistance measurements uncover novel phenomena, illustrating the adjustability of magnetoresistance through Sr doping. Notably, Sr doping results in both positive magnetoresistance (up to 20\%) and negative magnetoresistance (approximately -60\%). The resistivity and magnetic phase diagram were established for the first time, revealing the pronounced feasibility of Sr doping in modulating EuAgP's resistivity. This study has provided valuable insights into the intricate interplay between structural modifications and diverse physical properties. The potential for technological advancements and the exploration of novel quantum states make Sr-doped Eu$_{1-x}$Sr$_x$AgP a compelling subject for continued research in the field of applied physics., Comment: 15 pages, 5 figures
Published: 2024
Full Text: View/download PDF

36. ATDM:An Anthropomorphic Aerial Tendon-driven Manipulator with Low-Inertia and High-Stiffness

Author: Xu, Quman, Li, Zhan, Li, Hai, Yu, Xinghu, and Yang, Yipeng
Subjects: Computer Science - Robotics, Electrical Engineering and Systems Science - Systems and Control
Abstract: Aerial Manipulator Systems (AMS) have garnered significant interest for their utility in aerial operations. Nonetheless, challenges related to the manipulator's limited stiffness and the coupling disturbance with manipulator movement persist. This paper introduces the Aerial Tendon-Driven Manipulator (ATDM), an innovative AMS that integrates a hexrotor Unmanned Aerial Vehicle (UAV) with a 4-degree-of-freedom (4-DOF) anthropomorphic tendon-driven manipulator. The design of the manipulator is anatomically inspired, emulating the human arm anatomy from the shoulder joint downward. To enhance the structural integrity and performance, finite element topology optimization and lattice optimization are employed on the links to replicate the radially graded structure characteristic of bone, this approach effectively reduces weight and inertia while simultaneously maximizing stiffness. A novel tensioning mechanism with adjustable tension is introduced to address cable relaxation, and a Tension-amplification tendon mechanism is implemented to increase the manipulator's overall stiffness and output. The paper presents a kinematic model based on virtual coupled joints, a comprehensive workspace analysis, and detailed calculations of output torques and stiffness for individual arm joints. The prototype arm has a total weight of 2.7 kg, with the end effector contributing only 0.818 kg. By positioning all actuators at the base, coupling disturbance are minimized. The paper includes a detailed mechanical design and validates the system's performance through semi-physical multi-body dynamics simulations, confirming the efficacy of the proposed design.
Published: 2024

37. A diverse set of two-qubit gates for spin qubits in semiconductor quantum dots

Author: Ni, Ming, Ma, Rong-Long, Kong, Zhen-Zhen, Chu, Ning, Zhu, Sheng-Kai, Wang, Chu, Li, Ao-Ran, Liao, Wei-Zhu, Cao, Gang, Wang, Gui-Lei, Guo, Guang-Can, Hu, Xuedong, Li, Hai-Ou, and Guo, Guo-Ping
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Quantum Physics
Abstract: To realize large-scale quantum information processes, an ideal scheme for two-qubit operations should enable diverse operations with given hardware and physical interaction. However, for spin qubits in semiconductor quantum dots, the common two-qubit operations, including CPhase gates, SWAP gates, and CROT gates, are realized with distinct parameter regions and control waveforms, posing challenges for their simultaneous implementation. Here, taking advantage of the inherent Heisenberg interaction between spin qubits, we propose and verify a fast composite two-qubit gate scheme to extend the available two-qubit gate types as well as reduce the requirements for device properties. Apart from the formerly proposed CPhase (controlled-phase) gates and SWAP gates, theoretical results indicate that the iSWAP-family gate and Fermionic simulation (fSim) gate set are additionally available for spin qubits. Meanwhile, our gate scheme limits the parameter requirements of all essential two-qubit gates to a common J~{\Delta}E_Z region, facilitate the simultaneous realization of them. Furthermore, we present the preliminary experimental demonstration of the composite gate scheme, observing excellent match between the measured and simulated results. With this versatile composite gate scheme, broad-spectrum two-qubit operations allow us to efficiently utilize the hardware and the underlying physics resources, helping accelerate and broaden the scope of the upcoming noise intermediate-scale quantum (NISQ) computing., Comment: 23 pages, 6 figures
Published: 2024

38. CSCO: Connectivity Search of Convolutional Operators

Author: Zhang, Tunhou, Li, Shiyu, Cheng, Hsin-Pai, Yan, Feng, Li, Hai, and Chen, Yiran
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Exploring dense connectivity of convolutional operators establishes critical "synapses" to communicate feature vectors from different levels and enriches the set of transformations on Computer Vision applications. Yet, even with heavy-machinery approaches such as Neural Architecture Search (NAS), discovering effective connectivity patterns requires tremendous efforts due to either constrained connectivity design space or a sub-optimal exploration process induced by an unconstrained search space. In this paper, we propose CSCO, a novel paradigm that fabricates effective connectivity of convolutional operators with minimal utilization of existing design motifs and further utilizes the discovered wiring to construct high-performing ConvNets. CSCO guides the exploration via a neural predictor as a surrogate of the ground-truth performance. We introduce Graph Isomorphism as data augmentation to improve sample efficiency and propose a Metropolis-Hastings Evolutionary Search (MH-ES) to evade locally optimal architectures and advance search quality. Results on ImageNet show ~0.6% performance improvement over hand-crafted and NAS-crafted dense connectivity. Our code is publicly available., Comment: To appear on Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (2024)
Published: 2024

39. Do Counterfactual Examples Complicate Adversarial Training?

Author: Yeats, Eric, Darwin, Cameron, Ortega, Eduardo, Liu, Frank, and Li, Hai
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: We leverage diffusion models to study the robustness-performance tradeoff of robust classifiers. Our approach introduces a simple, pretrained diffusion method to generate low-norm counterfactual examples (CEs): semantically altered data which results in different true class membership. We report that the confidence and accuracy of robust models on their clean training data are associated with the proximity of the data to their CEs. Moreover, robust models perform very poorly when evaluated on the CEs directly, as they become increasingly invariant to the low-norm, semantic changes brought by CEs. The results indicate a significant overlap between non-robust and semantic features, countering the common assumption that non-robust features are not interpretable., Comment: Accepted as a short paper to the GCV Workshop at CVPR'24
Published: 2024

40. High-performance magnesium/sodium hybrid ion battery based on sodium vanadate oxide for reversible storage of Na+ and Mg2+

Author: Wang, Xiaoke, Li, Titi, Zhang, Xixi, Wang, Yaxin, Li, Hongfei, Li, Hai-Feng, Zhao, Gang, and Han, Cuiping
Subjects: Condensed Matter - Materials Science, Physics - Chemical Physics
Abstract: Magnesium ion batteries (MIBs) are a potential field for the energy storage of the future but are restricted by insufficient rate capability and rapid capacity degradation. Magnesium-sodium hybrid ion batteries (MSHBs) are an effective way to address these problems. Here, we report a new type of MSHBs that use layered sodium vanadate ((Na, Mn)V8O20 5H2O, Mn-NVO) cathodes coupled with an organic 3,4,9,10-perylenetetracarboxylic diimide (PTCDI) anode in Mg2+/Na+ hybrid electrolytes. During electrochemical cycling, Mg2+ and Na+ co-participate in the cathode reactions, and the introduction of Na+ promotes the structural stability of the Mn-NVO cathode, as cleared by several ex-situ characterizations. Consequently, the Mn-NVO cathode presents great specific capacity (249.9 mAh g-1 at 300 mA g-1) and cycling (1500 cycles at 1500 mA g-1) in the Mg2+/Na+ hybrid electrolytes. Besides, full battery displays long lifespan with 10,000 cycles at 1000 mA g-1. The rate performance and cycling stability of MSHBs have been improved by an economical and scalable method, and the mechanism for these improvements was discussed.
Published: 2024
Full Text: View/download PDF

41. Zincophilic armor: Phytate ammonium as a multifunctional additive for enhanced performance in aqueous zinc-ion batteries

Author: Xiao, Fangyuan, Wang, Xiaoke, Sun, Kaitong, Zhao, Qian, Han, Cuiping, and Li, Hai-Feng
Subjects: Physics - Applied Physics, Condensed Matter - Materials Science
Abstract: Corrosion and the formation of by-products resulting from parasitic side reactions, as well as random dendrite growth, pose significant challenges for aqueous zinc-ion batteries (AZIBs). In this study, phytate ammonium is introduced into the traditional dilute Zinc sulfate electrolyte as a multi-functional additive. Leveraging the inherent zincophilic nature of the phytic anion, a protective layer is formed on the surface of the zinc anode. This layer can effectively manipulate the deposition process, mitigate parasitic reactions, and reduce the accumulation of detrimental by-products. Additionally, the competitive deposition between dissociated ammonium ions and Zn2+ promotes uniform deposition, thereby alleviating dendrite growth. Consequently, the modified electrolyte with a lower volume addition exhibits superior performance. The zinc symmetric battery demonstrates much more reversible plating/stripping, sustaining over 2000 hours at 5 mA cm-2 and 1 mA h cm-2. A high average deposition/stripping efficiency of 99.83% is achieved, indicating the significant boosting effect and practical potential of our strategy for high-performance aqueous zinc-ion batteries.
Published: 2024
Full Text: View/download PDF

42. Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models

Author: Zhang, Jingyang, Sun, Jingwei, Yeats, Eric, Ouyang, Yang, Kuo, Martin, Zhang, Jianyi, Yang, Hao Frank, and Li, Hai
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: The problem of pre-training data detection for large language models (LLMs) has received growing attention due to its implications in critical issues like copyright violation and test data contamination. Despite improved performance, existing methods (including the state-of-the-art, Min-K%) are mostly developed upon simple heuristics and lack solid, reasonable foundations. In this work, we propose a novel and theoretically motivated methodology for pre-training data detection, named Min-K%++. Specifically, we present a key insight that training samples tend to be local maxima of the modeled distribution along each input dimension through maximum likelihood training, which in turn allow us to insightfully translate the problem into identification of local maxima. Then, we design our method accordingly that works under the discrete distribution modeled by LLMs, whose core idea is to determine whether the input forms a mode or has relatively high probability under the conditional categorical distribution. Empirically, the proposed method achieves new SOTA performance across multiple settings. On the WikiMIA benchmark, Min-K%++ outperforms the runner-up by 6.2% to 10.5% in detection AUROC averaged over five models. On the more challenging MIMIR benchmark, it consistently improves upon reference-free methods while performing on par with reference-based method that requires an extra reference model., Comment: Project page and code is available at https://zjysteven.github.io/mink-plus-plus/
Published: 2024

43. Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Author: Miyai, Atsuyuki, Yang, Jingkang, Zhang, Jingyang, Ming, Yifei, Yu, Qing, Irie, Go, Li, Yixuan, Li, Hai, Liu, Ziwei, and Aizawa, Kiyoharu
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: This paper introduces a novel and significant challenge for Vision Language Models (VLMs), termed Unsolvable Problem Detection (UPD). UPD examines the VLM's ability to withhold answers when faced with unsolvable problems in the context of Visual Question Answering (VQA) tasks. UPD encompasses three distinct settings: Absent Answer Detection (AAD), Incompatible Answer Set Detection (IASD), and Incompatible Visual Question Detection (IVQD). To deeply investigate the UPD problem, extensive experiments indicate that most VLMs, including GPT-4V and LLaVA-Next-34B, struggle with our benchmarks to varying extents, highlighting significant room for the improvements. To address UPD, we explore both training-free and training-based solutions, offering new insights into their effectiveness and limitations. We hope our insights, together with future efforts within the proposed UPD settings, will enhance the broader understanding and development of more practical and reliable VLMs., Comment: Code: https://github.com/AtsuMiyai/UPD
Published: 2024

44. Vox-Fusion++: Voxel-based Neural Implicit Dense Tracking and Mapping with Multi-maps

Author: Zhai, Hongjia, Li, Hai, Yang, Xingrui, Huang, Gan, Ming, Yuhang, Bao, Hujun, and Zhang, Guofeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we introduce Vox-Fusion++, a multi-maps-based robust dense tracking and mapping system that seamlessly fuses neural implicit representations with traditional volumetric fusion techniques. Building upon the concept of implicit mapping and positioning systems, our approach extends its applicability to real-world scenarios. Our system employs a voxel-based neural implicit surface representation, enabling efficient encoding and optimization of the scene within each voxel. To handle diverse environments without prior knowledge, we incorporate an octree-based structure for scene division and dynamic expansion. To achieve real-time performance, we propose a high-performance multi-process framework. This ensures the system's suitability for applications with stringent time constraints. Additionally, we adopt the idea of multi-maps to handle large-scale scenes, and leverage loop detection and hierarchical pose optimization strategies to reduce long-term pose drift and remove duplicate geometry. Through comprehensive evaluations, we demonstrate that our method outperforms previous methods in terms of reconstruction quality and accuracy across various scenarios. We also show that our Vox-Fusion++ can be used in augmented reality and collaborative mapping applications. Our source code will be publicly available at \url{https://github.com/zju3dv/Vox-Fusion_Plus_Plus}, Comment: 14 pages. arXiv admin note: text overlap with arXiv:2210.15858
Published: 2024

45. Exploring global symmetry-breaking superradiant phase via phase competition

Author: Li, Hai-Chao, Huang, Wen, and Xiong, Wei
Subjects: Quantum Physics
Abstract: Superradiant phase transitions play a fundamental role in understanding the mechanism of collective light-matter interaction at the quantum level. Here we investigate multiple superradiant phases and phase transitions with different symmetry-breaking patterns in a two-mode V-type Dicke model. Interestingly, we show that there exists a quadruple point where one normal phase, one global symmetry-breaking superradiant phase and two local symmetry-breaking superradiant phases meet. Such a global phase results from the phase competition between two local superradiant phases and can not occur in the standard $\Lambda$- and $\Xi$-type three-level configurations in quantum optics. Moreover, we exhibit a sequential first-order quantum phase transition from one local to the global again to the other local superradiant phase. Our study opens up a perspective of exploring multi-level quantum critical phenomena with global symmetry breaking.
Published: 2024
Full Text: View/download PDF

46. Coherent competition and control between three-wave mixing and four-wave mixing in superconducting circuits

Author: Liang, Miao-Xiang, Qiu, Yu-Xiang, Li, Hai-Chao, and Xiong, Wei
Subjects: Quantum Physics
Abstract: Exploring intermixing and interplay between different frequency-mixing processes has always been one of the interesting subjects at the interface of nonlinear optics with quantum optics. Here we investigate coherent competition and control between three-wave mixing (TWM) and four-wave mixing (FWM) in a cyclic three-level superconducting quantum system. In the weak control-field regime, strong competition leads to an alternating oscillation between TWM and FWM signals and this oscillation is a signature of strong energy exchange between these two nonlinear processes. In particular, such oscillation is absent from conventional multi-wave mixing in atomic systems. Surprisingly, synchronous TWM and FWM processes are demonstrated in the strong control-field regime and, at the same time, their efficiencies can be as high as 40% and 45%, respectively. Our study shows that these competitive behaviors between TWM and FWM can be manipulated by tuning the control-field intensity.
Published: 2024

47. Crystal, ferromagnetism, and magnetoresistance with sign reversal in a EuAgP semiconductor

Author: Zhao, Qian, Sun, Kaitong, Wu, Si, and Li, Hai-Feng
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Materials Science
Abstract: We synthesized the ferromagnetic EuAgP semiconductor and conducted a comprehensive study of its crystalline, magnetic, heat capacity, band gap, and magnetoresistance properties. Our investigation utilized a combination of X-ray diffraction, optical, and PPMS DynaCool measurements. EuAgP adopts a hexagonal structure with the $P6_3/mmc$ space group. As the temperature decreases, it undergoes a magnetic phase transition from high-temperature paramagnetism to low-temperature ferromagnetism. We determined the ferromagnetic transition temperature to be $T_{\textrm{C}} =$ 16.45(1) K by fitting the measured magnetic susceptibility using a Curie-Weiss law. Heat capacity analysis of EuAgP considered contributions from electrons, phonons, and magnons, revealing $\eta$ = 0.03 J/mol/$\textrm{K}^\textrm{2}$, indicative of semiconducting behavior. Additionally, we calculated a band gap of $\sim$ 1.324(4) eV based on absorption spectrum measurements. The resistivity versus temperature of EuAgP measured in the absence of an applied magnetic field shows a pronounced peak around $T_{\textrm{C}}$, which diminishes rapidly with increasing applied magnetic fields, ranging from 1 to 14 T. An intriguing phenomenon emerges in the form of a distinct magnetoresistance transition, shifting from positive (e.g., 1.95\% at 300 K and 14 T) to negative (e.g., -30.73\% at 14.25 K and 14 T) as the temperature decreases. This behavior could be attributed to spin-disordered scattering.
Published: 2024
Full Text: View/download PDF

48. Structure and magnetic properties of a La$_{0.75}$Sr$_{0.25}$Cr$_{0.90}$O$_{3-\delta}$ single crystal

Author: Sun, Kaitong, Zhu, Yinghao, Yano, Shinichiro, Zhao, Qian, Su, Muqing, Xu, Guanping, Zheng, Ruifeng, Fu, Ying Ellie, and Li, Hai-Feng
Subjects: Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: We have successfully grown large and good-quality single crystals of the La$_{0.75}$Sr$_{0.25}$Cr$_{0.90}$O$_{3-\delta}$ compound using the floating-zone method with laser diodes. We investigated the crystal quality, crystallography, chemical composition, magnetic properties and the oxidation state of Cr in the grown single crystals by employing a combination of techniques, including X-ray Laue and powder diffraction, scanning electron microscopy, magnetization measurements, X-ray photoelectron spectroscopy and light absorption. The La$_{0.75}$Sr$_{0.25}$Cr$_{0.90}$O$_{3-\delta}$ single crystal exhibits a single-phase composition, crystallizing in a trigonal structure with the space group $R\bar{3}c$ at room temperature. The chemical composition was determined as La$_{0.75}$Sr$_{0.25}$Cr$_{0.90}$O$_{3-\delta}$, indicating a significant chromium deficiency. Upon warming, we observed five distinctive characteristic temperatures, namely $T_1 =$ 21.50(1) K, $T_2 =$ 34.98(1) K, $T_3 =$ 117.94(1) K, $T_4 =$ 155.01(1) K, and $T_{\textrm{N}} =$ 271.80(1) K, revealing five distinct magnetic anomalies. Our magnetization study allows us to explore the nature of these anomalies. Remarkably, the oxidation state of chromium in the single-crystal La$_{0.75}$Sr$_{0.25}$Cr$_{0.90}$O$_{3-\delta}$, characterized by a band gap of 1.630(8) eV, is exclusively attributed to Cr$^{3+}$ ions, making a departure from the findings of previous studies on polycrystalline materials., Comment: 24 pages, 10 figures
Published: 2024
Full Text: View/download PDF

49. Peeking Behind the Curtains of Residual Learning

Author: Zhang, Tunhou, Yan, Feng, Li, Hai, and Chen, Yiran
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The utilization of residual learning has become widespread in deep and scalable neural nets. However, the fundamental principles that contribute to the success of residual learning remain elusive, thus hindering effective training of plain nets with depth scalability. In this paper, we peek behind the curtains of residual learning by uncovering the "dissipating inputs" phenomenon that leads to convergence failure in plain neural nets: the input is gradually compromised through plain layers due to non-linearities, resulting in challenges of learning feature representations. We theoretically demonstrate how plain neural nets degenerate the input to random noise and emphasize the significance of a residual connection that maintains a better lower bound of surviving neurons as a solution. With our theoretical discoveries, we propose "The Plain Neural Net Hypothesis" (PNNH) that identifies the internal path across non-linear layers as the most critical part in residual learning, and establishes a paradigm to support the training of deep plain neural nets devoid of residual connections. We thoroughly evaluate PNNH-enabled CNN architectures and Transformers on popular vision benchmarks, showing on-par accuracy, up to 0.3% higher training throughput, and 2x better parameter efficiency compared to ResNets and vision Transformers., Comment: Arxiv Preprint
Published: 2024

50. Non-existence of classical solutions to a two-phase flow model with vacuum

Author: Li, Hai-Liang, Wang, Yuexun, and Zhang, Yue
Subjects: Mathematics - Analysis of PDEs, 35A01, 76N06, 35Q31
Abstract: In this paper, we study the well-posedness of classical solutions to a two-phase flow model consisting of the pressureless Euler equations coupled with the isentropic compressible Navier-Stokes equations via a drag forcing term. We consider the case that the fluid densities may contain a vacuum, and the viscosities are density-dependent functions. Under suitable assumptions on the initial data, we show that the finite-energy (i.e., in the inhomogeneous Sobolev space) classical solutions to the Cauchy problem of this coupled system do not exist for any small time.
Published: 2024

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

476 results on '"Li, Hai"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources