Author: "WANG, Chen" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"WANG, Chen"' showing total 48,720 results

Start Over Author "WANG, Chen"

48,720 results on '"WANG, Chen"'

1. Convergence of random splitting method for the Allen-Cahn equation in a background flow

Author: Li, Lei and Wang, Chen
Subjects: Mathematics - Numerical Analysis, Mathematics - Analysis of PDEs
Abstract: We study in this paper the convergence of the random splitting method for Allen-Cahn equation in a background flow that plays as a simplified model for phase separation in multiphase flows. The model does not own the gradient flow structure as the usual Allen-Cahn equation does, and the random splitting method is advantageous due to its simplicity and better convergence rate. Though the random splitting is a classical method, the analysis of the convergence is not straightforward for this model due to the nonlinearity and unboundedness of the operators. We obtain uniform estimates of various Sobolev norms of the numerical solutions and the stability of the model. Based on the Sobolev estimates, the local trunction errors are then rigorously obtained. We then prove that the random operator splitting has an expected single run error with order $1.5$ and a bias with order $2$. Numerical experiments are then performed to confirm our theoretic findings.
Published: 2025

2. Measurement of Neutral Atmosphere Density During the Years of Increasing Solar Activity Using \textit{Insight}-HXMT Data with the Earth Occultation Technique

Author: Zhang, Hao-Hui, Xue, Wang-Chen, Li, Xiao-Bo, Zhang, Shuang-Nan, Xiong, Shao-Lin, Chen, Yong, Li, Hai-Tao, Song, Li-Ming, Ge, Ming-Yu, Zhao, Hai-Sheng, and Yu, Yun-Wei
Subjects: Physics - Atmospheric and Oceanic Physics, Astrophysics - Earth and Planetary Astrophysics, Astrophysics - High Energy Astrophysical Phenomena
Abstract: The density of the Earth's middle and upper atmosphere is an important question in Earth science and is a critical factor in the design, operation, and orbital determination of low Earth orbit spacecraft. In this study, we employ the Earth Occultation Technique (EOT) combined with Maximum Likelihood Estimation to estimate the neutral atmospheric density by modeling the attenuation of X-ray photons during the occultation process of \textit{Insight}-HXMT observations of Crab Nebula. Based on 83 occultation datasets of the Crab Nebula observed by all three sets of telescopes of \textit{Insight}-HXMT between 2022 and 2024, we derived the atmospheric densities at altitudes ranging from 55\,--130\,km. We find a general agreement between our results and the prediction by the NRLMSIS model within the altitude ranges of 65\,-- 90\,km, 95\,--100\,km and 120\,--130\,km, particularly during periods of enhanced solar activity. However, we also find that the NRLMSIS model overestimates atmospheric density at altitudes 90\,--95\,km and 100\,--120\,km by approximately 20\%. Furthermore, since the atmospheric density measurements at altitudes of 55\,--\,65\,km may be subject to selection bias, we do not report the prediction accuracy of the NRLMSIS model at this altitude., Comment: 11 pages, 4 figures, 2 tables
Published: 2025

3. Training Large Recommendation Models via Graph-Language Token Alignment

Author: Yang, Mingdai, Liu, Zhiwei, Yang, Liangwei, Liu, Xiaolong, Wang, Chen, Peng, Hao, and Yu, Philip S.
Subjects: Computer Science - Information Retrieval
Abstract: Recommender systems (RS) have become essential tools for helping users efficiently navigate the overwhelming amount of information on e-commerce and social platforms. However, traditional RS relying on Collaborative Filtering (CF) struggles to integrate the rich semantic information from textual data. Meanwhile, large language models (LLMs) have shown promising results in natural language processing, but directly using LLMs for recommendation introduces challenges, such as ambiguity in generating item predictions and inefficiencies in scalability. In this paper, we propose a novel framework to train Large Recommendation models via Graph-Language Token Alignment. By aligning item and user nodes from the interaction graph with pretrained LLM tokens, GLTA effectively leverages the reasoning abilities of LLMs. Furthermore, we introduce Graph-Language Logits Matching (GLLM) to optimize token alignment for end-to-end item prediction, eliminating ambiguity in the free-form text as recommendation results. Extensive experiments on three benchmark datasets demonstrate the effectiveness of GLTA, with ablation studies validating each component., Comment: 5 pages. Accepted by www'25 as short paper
Published: 2025
Full Text: View/download PDF

4. Enhancing External Validity of Experiments with Ongoing Sampling

Author: Wang, Chen, Han, Shichao, and Huang, Shan
Subjects: Economics - General Economics, Statistics - Applications
Abstract: Participants in online experiments often enroll over time, which can compromise sample representativeness due to temporal shifts in covariates. This issue is particularly critical in A/B tests, online controlled experiments extensively used to evaluate product updates, since these tests are cost-sensitive and typically short in duration. We propose a novel framework that dynamically assesses sample representativeness by dividing the ongoing sampling process into three stages. We then develop stage-specific estimators for Population Average Treatment Effects (PATE), ensuring that experimental results remain generalizable across varying experiment durations. Leveraging survival analysis, we develop a heuristic function that identifies these stages without requiring prior knowledge of population or sample characteristics, thereby keeping implementation costs low. Our approach bridges the gap between experimental findings and real-world applicability, enabling product decisions to be based on evidence that accurately represents the broader target population. We validate the effectiveness of our framework on three levels: (1) through a real-world online experiment conducted on WeChat; (2) via a synthetic experiment; and (3) by applying it to 600 A/B tests on WeChat in a platform-wide application. Additionally, we provide practical guidelines for practitioners to implement our method in real-world settings.
Published: 2025

5. Geometry-Aware 3D Salient Object Detection Network

Author: Wang, Chen, Zhang, Liyuan, Hui, Le, Liu, Qi, and Dai, Yuchao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Point cloud salient object detection has attracted the attention of researchers in recent years. Since existing works do not fully utilize the geometry context of 3D objects, blurry boundaries are generated when segmenting objects with complex backgrounds. In this paper, we propose a geometry-aware 3D salient object detection network that explicitly clusters points into superpoints to enhance the geometric boundaries of objects, thereby segmenting complete objects with clear boundaries. Specifically, we first propose a simple yet effective superpoint partition module to cluster points into superpoints. In order to improve the quality of superpoints, we present a point cloud class-agnostic loss to learn discriminative point features for clustering superpoints from the object. After obtaining superpoints, we then propose a geometry enhancement module that utilizes superpoint-point attention to aggregate geometric information into point features for predicting the salient map of the object with clear boundaries. Extensive experiments show that our method achieves new state-of-the-art performance on the PCSOD dataset.
Published: 2025

6. Physical Depth-aware Early Accident Anticipation: A Multi-dimensional Visual Feature Fusion Framework

Author: Huang, Hongpu, Zhou, Wei, and Wang, Chen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Early accident anticipation from dashcam videos is a highly desirable yet challenging task for improving the safety of intelligent vehicles. Existing advanced accident anticipation approaches commonly model the interaction among traffic agents (e.g., vehicles, pedestrians, etc.) in the coarse 2D image space, which may not adequately capture their true positions and interactions. To address this limitation, we propose a physical depth-aware learning framework that incorporates the monocular depth features generated by a large model named Depth-Anything to introduce more fine-grained spatial 3D information. Furthermore, the proposed framework also integrates visual interaction features and visual dynamic features from traffic scenes to provide a more comprehensive perception towards the scenes. Based on these multi-dimensional visual features, the framework captures early indicators of accidents through the analysis of interaction relationships between objects in sequential frames. Additionally, the proposed framework introduces a reconstruction adjacency matrix for key traffic participants that are occluded, mitigating the impact of occluded objects on graph learning and maintaining the spatio-temporal continuity. Experimental results on public datasets show that the proposed framework attains state-of-the-art performance, highlighting the effectiveness of incorporating visual depth features and the superiority of the proposed framework.
Published: 2025

7. Benchmarking LLMs for Political Science: A United Nations Perspective

Author: Liang, Yueqing, Yang, Liangwei, Wang, Chen, Xia, Congying, Meng, Rui, Xu, Xiongxiao, Wang, Haoran, Payani, Ali, and Shu, Kai
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society, Computer Science - Emerging Technologies
Abstract: Large Language Models (LLMs) have achieved significant advances in natural language processing, yet their potential for high-stake political decision-making remains largely unexplored. This paper addresses the gap by focusing on the application of LLMs to the United Nations (UN) decision-making process, where the stakes are particularly high and political decisions can have far-reaching consequences. We introduce a novel dataset comprising publicly available UN Security Council (UNSC) records from 1994 to 2024, including draft resolutions, voting records, and diplomatic speeches. Using this dataset, we propose the United Nations Benchmark (UNBench), the first comprehensive benchmark designed to evaluate LLMs across four interconnected political science tasks: co-penholder judgment, representative voting simulation, draft adoption prediction, and representative statement generation. These tasks span the three stages of the UN decision-making process--drafting, voting, and discussing--and aim to assess LLMs' ability to understand and simulate political dynamics. Our experimental analysis demonstrates the potential and challenges of applying LLMs in this domain, providing insights into their strengths and limitations in political science. This work contributes to the growing intersection of AI and political science, opening new avenues for research and practical applications in global governance. The UNBench Repository can be accessed at: https://github.com/yueqingliang1/UNBench.
Published: 2025

8. Using detailed single star and binary evolution models to probe the large observed luminosity spread of red supergiants in young open star clusters

Author: Wang, Chen, Patrick, Lee, Schootemeijer, Abel, de Mink, Selma E., Langer, Norbert, Britavskiy, Nikolay, Xu, Xiao-Tian, Bodensteiner, Julia, Laplace, Eva, Valli, Ruggero, Vigna-Gómez, Alejandro, Klencki, Jakub, Justham, Stephen, Johnston, Cole, and Ma, Jing-ze
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: Red supergiants (RSGs) represent a late evolutionary stage of massive stars. Recent observations reveal that the observed luminosity range of RSGs in young open clusters is wider than expected from single star evolution models. Binary evolution effects have been suggested as a possible explanation. Here, we analyse 3670 detailed binary-evolution models, as well as corresponding single-star models, to probe the contribution of binary mass transfer and binary mergers on the luminosity distribution of RSGs in star clusters with ages up to 100 Myr. We confirm that the expected luminosity range of RSGs in a coeval population can span a factor of ten, as a consequence of mergers between two main-sequence stars, which reproduces the observed red supergiant luminosity ranges in rich clusters well. While the luminosity increase as consequence of mass transfer is more limited, it may help to increase the number of overluminous RSGs. However, our results also demonstrate that binary effects alone are insufficient to account for the number of RSGs found with luminosities of up to three times those predicted by current single-star models. We discuss observational accuracy, rotational mixing, age spread, and intrinsic RSG variability as possible explanations. Further observations of RSGs in young open clusters, in particular studies of their intrinsic brightness variability, appear crucial for disentangling these effects., Comment: 24 pages, 19 figures. Accepted for Publication in Astrophysical Journal Letters (ApJL)
Published: 2025

9. ChineseSimpleVQA -- 'See the World, Discover Knowledge': A Chinese Factuality Evaluation for Large Vision Language Models

Author: Gu, Jihao, Wang, Yingyao, Bu, Pi, Wang, Chen, Wang, Ziming, Song, Tengtao, Wei, Donglai, Yuan, Jiale, Zhao, Yingxiu, He, Yancheng, Li, Shilong, Liu, Jiaheng, Cao, Meng, Song, Jun, Tan, Yingshui, Li, Xiang, Su, Wenbo, Zheng, Zhicheng, Zhu, Xiaoyong, and Zheng, Bo
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: The evaluation of factual accuracy in large vision language models (LVLMs) has lagged behind their rapid development, making it challenging to fully reflect these models' knowledge capacity and reliability. In this paper, we introduce the first factuality-based visual question-answering benchmark in Chinese, named ChineseSimpleVQA, aimed at assessing the visual factuality of LVLMs across 8 major topics and 56 subtopics. The key features of this benchmark include a focus on the Chinese language, diverse knowledge types, a multi-hop question construction, high-quality data, static consistency, and easy-to-evaluate through short answers. Moreover, we contribute a rigorous data construction pipeline and decouple the visual factuality into two parts: seeing the world (i.e., object recognition) and discovering knowledge. This decoupling allows us to analyze the capability boundaries and execution mechanisms of LVLMs. Subsequently, we evaluate 34 advanced open-source and closed-source models, revealing critical performance gaps within this field., Comment: 24 pages, 21 figures
Published: 2025

10. Latent Swap Joint Diffusion for Long-Form Audio Generation

Author: Dai, Yusheng, Wang, Chenxi, Li, Chang, Wang, Chen, Du, Jun, Li, Kewei, Wang, Ruoyu, Ma, Jiefeng, Sun, Lei, and Gao, Jianqing
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Previous work on long-form audio generation using global-view diffusion or iterative generation demands significant training or inference costs. While recent advancements in multi-view joint diffusion for panoramic generation provide an efficient option, they struggle with spectrum generation with severe overlap distortions and high cross-view consistency costs. We initially explore this phenomenon through the connectivity inheritance of latent maps and uncover that averaging operations excessively smooth the high-frequency components of the latent map. To address these issues, we propose Swap Forward (SaFa), a frame-level latent swap framework that synchronizes multiple diffusions to produce a globally coherent long audio with more spectrum details in a forward-only manner. At its core, the bidirectional Self-Loop Latent Swap is applied between adjacent views, leveraging stepwise diffusion trajectory to adaptively enhance high-frequency components without disrupting low-frequency components. Furthermore, to ensure cross-view consistency, the unidirectional Reference-Guided Latent Swap is applied between the reference and the non-overlap regions of each subview during the early stages, providing centralized trajectory guidance. Quantitative and qualitative experiments demonstrate that SaFa significantly outperforms existing joint diffusion methods and even training-based long audio generation models. Moreover, we find that it also adapts well to panoramic generation, achieving comparable state-of-the-art performance with greater efficiency and model generalizability. Project page is available at https://swapforward.github.io/.
Published: 2025

11. Nearly Tight Bounds for Exploration in Streaming Multi-armed Bandits with Known Optimality Gap

Author: Karpov, Nikolai and Wang, Chen
Subjects: Computer Science - Machine Learning, Computer Science - Data Structures and Algorithms
Abstract: We investigate the sample-memory-pass trade-offs for pure exploration in multi-pass streaming multi-armed bandits (MABs) with the *a priori* knowledge of the optimality gap $\Delta_{[2]}$. Here, and throughout, the optimality gap $\Delta_{[i]}$ is defined as the mean reward gap between the best and the $i$-th best arms. A recent line of results by Jin, Huang, Tang, and Xiao [ICML'21] and Assadi and Wang [COLT'24] have shown that if there is no known $\Delta_{[2]}$, a pass complexity of $\Theta(\log(1/\Delta_{[2]}))$ (up to $\log\log(1/\Delta_{[2]})$ terms) is necessary and sufficient to obtain the *worst-case optimal* sample complexity of $O(n/\Delta^{2}_{[2]})$ with a single-arm memory. However, our understanding of multi-pass algorithms with known $\Delta_{[2]}$ is still limited. Here, the key open problem is how many passes are required to achieve the complexity, i.e., $O( \sum_{i=2}^{n}1/\Delta^2_{[i]})$ arm pulls, with a sublinear memory size. In this work, we show that the ``right answer'' for the question is $\Theta(\log{n})$ passes (up to $\log\log{n}$ terms). We first present a lower bound, showing that any algorithm that finds the best arm with slightly sublinear memory -- a memory of $o({n}/{\text{polylog}({n})})$ arms -- and $O(\sum_{i=2}^{n}{1}/{\Delta^{2}_{[i]}}\cdot \log{(n)})$ arm pulls has to make $\Omega(\frac{\log{n}}{\log\log{n}})$ passes over the stream. We then show a nearly-matching algorithm that assuming the knowledge of $\Delta_{[2]}$, finds the best arm with $O( \sum_{i=2}^{n}1/\Delta^2_{[i]} \cdot \log{n})$ arm pulls and a *single arm* memory., Comment: AAAI 2025
Published: 2025

12. VL-Nav: Real-time Vision-Language Navigation with Spatial Reasoning

Author: Du, Yi, Fu, Taimeng, Chen, Zhuoqun, Li, Bowen, Su, Shaoshu, Zhao, Zhipeng, and Wang, Chen
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Vision-language navigation in unknown environments is crucial for mobile robots. In scenarios such as household assistance and rescue, mobile robots need to understand a human command, such as "find a person wearing black". We present a novel vision-language navigation (VL-Nav) system that integrates efficient spatial reasoning on low-power robots. Unlike prior methods that rely on a single image-level feature similarity to guide a robot, our method integrates pixel-wise vision-language features with curiosity-driven exploration. This approach enables robust navigation to human-instructed instances across diverse environments. We deploy VL-Nav on a four-wheel mobile robot and evaluate its performance through comprehensive navigation tasks in both indoor and outdoor environments, spanning different scales and semantic complexities. Remarkably, VL-Nav operates at a real-time frequency of 30 Hz with a Jetson Orin NX, highlighting its ability to conduct efficient vision-language navigation. Results show that VL-Nav achieves an overall success rate of 86.3%, outperforming previous methods by 44.15%.
Published: 2025

13. Data Fusion for Full-Range Response Reconstruction via Diffusion Models

Author: Feng, Wingho, Li, Quanwang, Wang, Chen, and Fan, Jian-sheng
Subjects: Computer Science - Computational Engineering, Finance, and Science
Abstract: Accurately capturing the full-range response of structures is crucial in structural health monitoring (SHM) for ensuring safety and operational integrity. However, limited sensor deployment due to cost, accessibility, or scale often hinders comprehensive monitoring. This paper presents a novel data fusion framework utilizing diffusion models to reconstruct the full-range structural response from sparse and heterogeneous sensor measurements. We incorporate Diffusion Posterior Sampling (DPS) into the reconstruction framework, using sensor measurements as probabilistic constraints to guide the sampling process. A lightweight neural network serves as the surrogate forward model within the DPS algorithm, which maps full-range structural responses to local sensor data. This approach enables flexibility in sensor configurations while reducing computational costs. The proposed framework is validated on a steel plate shear wall exhibiting nonlinear responses. Comparative experiments are conducted with three forward models. Among these, the neural network surrogate model achieves a desirable reconstruction accuracy, with a weighted mean absolute percentage error (WMAPE) as low as 1.57%, while also demonstrating superior adaptability and computational efficiency. Additional experiments explore the impact of sensor placement strategies and noise levels. Results show that even under sparse measurements or high noise conditions, the WMAPE remains capped at 15%, demonstrating the robustness in challenging scenarios. The proposed framework shows new possibilities for probabilistic modeling and decision-making in SHM, offering a novel data fusion approach for full-range monitoring of structures.
Published: 2025

14. Solving the Content Gap in Roblox Game Recommendations: LLM-Based Profile Generation and Reranking

Author: Wang, Chen, Wei, Xiaokai, Jiang, Yexi, Ong, Frank, Gao, Kevin, Yu, Xiao, Hui, Zheng, Yoon, Se-eun, Yu, Philip, and Gong, Michelle
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: With the vast and dynamic user-generated content on Roblox, creating effective game recommendations requires a deep understanding of game content. Traditional recommendation models struggle with the inconsistent and sparse nature of game text features such as titles and descriptions. Recent advancements in large language models (LLMs) offer opportunities to enhance recommendation systems by analyzing in-game text data. This paper addresses two challenges: generating high-quality, structured text features for games without extensive human annotation, and validating these features to ensure they improve recommendation relevance. We propose an approach that extracts in-game text and uses LLMs to infer attributes such as genre and gameplay objectives from raw player interactions. Additionally, we introduce an LLM-based re-ranking mechanism to assess the effectiveness of the generated text features, enhancing personalization and user satisfaction. Beyond recommendations, our approach supports applications such as user engagement-based integrity detection, already deployed in production. This scalable framework demonstrates the potential of in-game text understanding to improve recommendation quality on Roblox and adapt recommendations to its unique, user-generated ecosystem.
Published: 2025

15. From Data to Action: Charting A Data-Driven Path to Combat Antimicrobial Resistance

Author: Fu, Qian, Zhang, Yuzhe, Shu, Yanfeng, Ding, Ming, Yao, Lina, and Wang, Chen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Quantitative Biology - Populations and Evolution
Abstract: Antimicrobial-resistant (AMR) microbes are a growing challenge in healthcare, rendering modern medicines ineffective. AMR arises from antibiotic production and bacterial evolution, but quantifying its transmission remains difficult. With increasing AMR-related data, data-driven methods offer promising insights into its causes and treatments. This paper reviews AMR research from a data analytics and machine learning perspective, summarizing the state-of-the-art and exploring key areas such as surveillance, prediction, drug discovery, stewardship, and driver analysis. It discusses data sources, methods, and challenges, emphasizing standardization and interoperability. Additionally, it surveys statistical and machine learning techniques for AMR analysis, addressing issues like data noise and bias. Strategies for denoising and debiasing are highlighted to enhance fairness and robustness in AMR research. The paper underscores the importance of interdisciplinary collaboration and awareness of data challenges in advancing AMR research, pointing to future directions for innovation and improved methodologies., Comment: 29 pages, 3 figures, 4 tables, survey paper
Published: 2025

16. Extremal eigenvectors of sparse random matrices

Author: He, Yukun, Huang, Jiaoyang, and Wang, Chen
Subjects: Mathematics - Probability, Mathematical Physics, 05C80, 05C50, 60B20, 15B52
Abstract: We consider a class of sparse random matrices, which includes the adjacency matrix of Erd\H{o}s-R\'enyi graph ${\bf G}(N,p)$. For $N^{-1+o(1)}\leq p\leq 1/2$, we show that the non-trivial edge eigenvectors are asymptotically jointly normal. The main ingredient of the proof is an algorithm that directly computes the joint eigenvector distributions, without comparisons with GOE. The method is applicable in general. As an illustration, we also use it to prove the normal fluctuation in quantum ergodicity at the edge for Wigner matrices. Another ingredient of the proof is the isotropic local law for sparse matrices, which at the same time improves several existing results.
Published: 2025

17. Study on the Distribution Amplitude of the Scalar Meson $K_0^*(1430)$

Author: Wang, Chen, Ma, Yuanyuan, Wang, Zhijun, and Sun, Yanjun
Subjects: High Energy Physics - Phenomenology, Nuclear Theory
Abstract: Based on sum rules, we explore the twist-2 distribution amplitude of the $K_0^*(1430)$ meson, treating it as the ground state of a quark-antiquark system. We posit that the spacetime distance $x$ should be infinitesimally close to the quark separation $ z$. By incorporating quark distance corrections, with $x^2 \approx z^2 \approx x z$, the calculated moments yield additional insights. Moreover, we employ light-cone sum rules to compute the form factors for the semi-leptonic decay process $B_s \rightarrow K$. The reliability of the computed distribution amplitude is confirmed through its comparison with the form factor., Comment: 14 pages,3 figures
Published: 2025

18. AnyNav: Visual Neuro-Symbolic Friction Learning for Off-road Navigation

Author: Fu, Taimeng, Zhan, Zitong, Zhao, Zhipeng, Su, Shaoshu, Lin, Xiao, Esfahani, Ehsan Tarkesh, Dantu, Karthik, Chowdhury, Souma, and Wang, Chen
Subjects: Computer Science - Robotics
Abstract: Off-road navigation is essential for a wide range of applications in field robotics such as planetary exploration and disaster response. However, it remains an unresolved challenge due to the unstructured environments and inherent complexity of terrain-vehicle interactions. Traditional physics-based methods struggle to accurately model the nonlinear dynamics of these interactions, while data-driven approaches often suffer from overfitting to specific motion patterns, vehicle sizes, and types, limiting their generalizability. To overcome these challenges, we introduce a vision-based friction estimation framework grounded in neuro-symbolic principles, integrating neural networks for visual perception with symbolic reasoning for physical modeling. This enables significantly improved generalization abilities through explicit physical reasoning incorporating the predicted friction. Additionally, we develop a physics-informed planner that leverages the learned friction coefficient to generate physically feasible and efficient paths, along with corresponding speed profiles. We refer to our approach as AnyNav and evaluate it in both simulation and real-world experiments, demonstrating its utility and robustness across various off-road scenarios and multiple types of four-wheeled vehicles. These results mark an important step toward developing neuro-symbolic spatial intelligence to reason about complex, unstructured environments and enable autonomous off-road navigation in challenging scenarios. Video demonstrations are available at https://sairlab.org/anynav/, where the source code will also be released.
Published: 2025

19. Solving Constrained Optimization Problems Using Hybrid Qubit-Qumode Quantum Devices

Author: Dutta, Rishab, Allen, Brandon, Vu, Nam P., Xu, Chuzhi, Liu, Kun, Miao, Fei, Wang, Bing, Surana, Amit, Wang, Chen, Ding, Yongshan, and Batista, Victor S.
Subjects: Quantum Physics
Abstract: Optimization challenges span a wide array of fields, from logistics and scheduling to finance, materials science, and drug discovery. Among these, Quadratic Unconstrained Binary Optimization (QUBO) problems are especially significant due to their computational complexity and their potential as a key application for quantum computing. In this work, we introduce an approach for solving QUBO problems using hybrid qubit-qumode bosonic quantum computers$\unicode{x2014}$devices that manipulate and measure the quantum states of light within microwave cavity resonators. We map problems with soft and hard constraints onto the Hamiltonian of a hybrid quantum system, consisting of a single qubit coupled to multiple qumodes. The optimal solution is encoded in the ground state of the system, which is revealed by photon-number measurements. Trial states are prepared through universal qubit-qumode circuits, employing echoed conditional displacement (ECD) gates in combination with qubit rotations. Our approach demonstrates the immense potential of hybrid quantum systems, showcasing their ability to efficiently tackle complex optimization problems in both academia and industry., Comment: 12 pages, 11 figures
Published: 2025

20. Kilometer-Scale E3SM Land Model Simulation over North America

Author: Wang, Dali, Wang, Chen, Cao, Qinglei, Schwartz, Peter, Yuan, Fengming, Krishna, Jayesh, Wu, Danqing, Ricciuto, Danial, Thornton, Peter, Kao, Shih-Chieh, Thornton, Michele, and Mohror, Kathryn
Subjects: Computer Science - Computational Engineering, Finance, and Science
Abstract: The development of a kilometer-scale E3SM Land Model (km-scale ELM) is an integral part of the E3SM project, which seeks to advance energy-related Earth system science research with state-of-the-art modeling and simulation capabilities on exascale computing systems. Through the utilization of high-fidelity data products, such as atmospheric forcing and soil properties, the km-scale ELM plays a critical role in accurately modeling geographical characteristics and extreme weather occurrences. The model is vital for enhancing our comprehension and prediction of climate patterns, as well as their effects on ecosystems and human activities. This study showcases the first set of full-capability, km-scale ELM simulations over various computational domains, including simulations encompassing 21.6 million land gridcells, reflecting approximately 21.5 million square kilometers of North America at a 1 km x 1 km resolution. We present the largest km-scale ELM simulation using up to 100,800 CPU cores across 2,400 nodes. This continental-scale simulation is 300 times larger than any previous studies, and the computational resources used are about 400 times larger than those used in prior efforts. Both strong and weak scaling tests have been conducted, revealing exceptional performance efficiency and resource utilization. The km-scale ELM uses the common E3SM modeling infrastructure and a general data toolkit known as KiloCraft. Consequently, it can be readily adapted for both fully-coupled E3SM simulations and data-driven simulations over specific areas, ranging from a single gridcell to the entire North America.
Published: 2025

21. Constrained Coding for Composite DNA: Channel Capacity and Efficient Constructions

Author: Nguyen, Tuan Thanh, Wang, Chen, Cai, Kui, Zhang, Yiwei, and Yakhini, Zohar
Subjects: Computer Science - Information Theory
Abstract: Composite DNA is a recent novel method to increase the information capacity of DNA-based data storage above the theoretical limit of 2 bits/symbol. In this method, every composite symbol does not store a single DNA nucleotide but a mixture of the four nucleotides in a predetermined ratio. By using different mixtures and ratios, the alphabet can be extended to have much more than four symbols in the naive approach. While this method enables higher data content per synthesis cycle, potentially reducing the DNA synthesis cost, it also imposes significant challenges for accurate DNA sequencing since the base-level errors can easily change the mixture of bases and their ratio, resulting in changes to the composite symbols. With this motivation, we propose efficient constrained coding techniques to enforce the biological constraints, including the runlength-limited constraint and the GC-content constraint, into every DNA synthesized oligo, regardless of the mixture of bases in each composite letter and their corresponding ratio. Our goals include computing the capacity of the constrained channel, constructing efficient encoders/decoders, and providing the best options for the composite letters to obtain capacity-approaching codes. For certain codes' parameters, our methods incur only one redundant symbol.
Published: 2025

22. Refinements of Van Hamme's (E.2) and (F.2) supercongruences and two supercongruences by Swisher

Author: Guo, Victor J. W. and Wang, Chen
Subjects: Mathematics - Number Theory, Mathematics - Combinatorics
Abstract: In 1997, Van Hamme proposed 13 supercongruences on truncated hypergeometric series. Van Hamme's (B.2) supercongruence was first confirmed by Mortenson and received a WZ proof by Zudilin later. In 2012, using the WZ method again, Sun extended Van Hamme's (B.2) supercongruence to the modulus $p^4$ case, where $p$ is an odd prime. In this paper, by using a more general WZ pair, we generalize Hamme's (E.2) and (F.2) supercongruences, as well as two supercongruences by Swisher, to the modulus $p^4$ case. Our generalizations of these supercongruences are related to Euler polynomials. We also put forward a relevant conjecture on $q$-congruences for further study., Comment: 15 pages
Published: 2025

23. Balance Divergence for Knowledge Distillation

Author: Qi, Yafei, Wang, Chen, Zhang, Zhaoning, Liu, Yaping, and Zhang, Yongmin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Knowledge distillation has been widely adopted in computer vision task processing, since it can effectively enhance the performance of lightweight student networks by leveraging the knowledge transferred from cumbersome teacher networks. Most existing knowledge distillation methods utilize Kullback-Leibler divergence to mimic the logit output probabilities between the teacher network and the student network. Nonetheless, these methods may neglect the negative parts of the teacher's ''dark knowledge'' because the divergence calculations may ignore the effect of the minute probabilities from the teacher's logit output. This deficiency may lead to suboptimal performance in logit mimicry during the distillation process and result in an imbalance of information acquired by the student network. In this paper, we investigate the impact of this imbalance and propose a novel method, named Balance Divergence Distillation. By introducing a compensatory operation using reverse Kullback-Leibler divergence, our method can improve the modeling of the extremely small values in the negative from the teacher and preserve the learning capacity for the positive. Furthermore, we test the impact of different temperature coefficients adjustments, which may conducted to further balance for knowledge transferring. We evaluate the proposed method on several computer vision tasks, including image classification and semantic segmentation. The evaluation results show that our method achieves an accuracy improvement of 1%~3% for lightweight students on both CIFAR-100 and ImageNet dataset, and a 4.55% improvement in mIoU for PSP-ResNet18 on the Cityscapes dataset. The experiments show that our method is a simple yet highly effective solution that can be smoothly applied to different knowledge distillation methods.
Published: 2025

24. LPRnet: A self-supervised registration network for LiDAR and photogrammetric point clouds

Author: Wang, Chen, Gu, Yanfeng, and Li, Xian
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: LiDAR and photogrammetry are active and passive remote sensing techniques for point cloud acquisition, respectively, offering complementary advantages and heterogeneous. Due to the fundamental differences in sensing mechanisms, spatial distributions and coordinate systems, their point clouds exhibit significant discrepancies in density, precision, noise, and overlap. Coupled with the lack of ground truth for large-scale scenes, integrating the heterogeneous point clouds is a highly challenging task. This paper proposes a self-supervised registration network based on a masked autoencoder, focusing on heterogeneous LiDAR and photogrammetric point clouds. At its core, the method introduces a multi-scale masked training strategy to extract robust features from heterogeneous point clouds under self-supervision. To further enhance registration performance, a rotation-translation embedding module is designed to effectively capture the key features essential for accurate rigid transformations. Building upon the robust representations, a transformer-based architecture seamlessly integrates local and global features, fostering precise alignment across diverse point cloud datasets. The proposed method demonstrates strong feature extraction capabilities for both LiDAR and photogrammetric point clouds, addressing the challenges of acquiring ground truth at the scene level. Experiments conducted on two real-world datasets validate the effectiveness of the proposed method in solving heterogeneous point cloud registration problems., Comment: 12 pages, 9 figures, 5 tables
Published: 2025

25. Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation

Author: Meng, Xuyi, Wang, Chen, Lei, Jiahui, Daniilidis, Kostas, Gu, Jiatao, and Liu, Lingjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent advances in 2D image generation have achieved remarkable quality,largely driven by the capacity of diffusion models and the availability of large-scale datasets. However, direct 3D generation is still constrained by the scarcity and lower fidelity of 3D datasets. In this paper, we introduce Zero-1-to-G, a novel approach that addresses this problem by enabling direct single-view generation on Gaussian splats using pretrained 2D diffusion models. Our key insight is that Gaussian splats, a 3D representation, can be decomposed into multi-view images encoding different attributes. This reframes the challenging task of direct 3D generation within a 2D diffusion framework, allowing us to leverage the rich priors of pretrained 2D diffusion models. To incorporate 3D awareness, we introduce cross-view and cross-attribute attention layers, which capture complex correlations and enforce 3D consistency across generated splats. This makes Zero-1-to-G the first direct image-to-3D generative model to effectively utilize pretrained 2D diffusion priors, enabling efficient training and improved generalization to unseen objects. Extensive experiments on both synthetic and in-the-wild datasets demonstrate superior performance in 3D object generation, offering a new approach to high-quality 3D generation.
Published: 2025

26. Recorder: Comprehensive Parallel I/O Tracing and Analysis

Author: Wang, Chen, Yildirim, Izzet, Devarajan, Hariharan, Mohror, Kathryn, and Snir, Marc
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Performance
Abstract: This paper presents Recorder, a parallel I/O tracing tool designed to capture comprehensive I/O information on HPC applications. Recorder traces I/O calls across various I/O layers, storing all function parameters for each captured call. The volume of stored information scales linearly the application's execution scale. To address this, we present a sophisticated pattern-recognition-based compression algorithm. This algorithm identifies and compresses recurring I/O patterns both within individual processes and across multiple processes, significantly reducing space and time overheads. We evaluate the proposed compression algorithm using I/O benchmarks and real-world applications, demonstrating that Recorder can store more information while requiring approximately 12x less storage space compared to its predecessor. Notably, for applications with typical parallel I/O patterns, Recorder achieves a constant trace size regardless of execution scale. Additionally, a comparison with the profiling tool Darshan shows that Recorder captures detailed I/O information without incurring substantial overhead. The richer data collected by Recorder enables new insights and facilitates more in-depth I/O studies, offering valuable contributions to the I/O research community., Comment: 29 pages. Under Review. Submitted to the Journal of Supercomputing
Published: 2025

27. ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking

Author: Zhang, Tingyang, Wang, Chen, Dou, Zhiyang, Gao, Qingzhe, Lei, Jiahui, Chen, Baoquan, and Liu, Lingjie
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose ProTracker, a novel framework for robust and accurate long-term dense tracking of arbitrary points in videos. The key idea of our method is incorporating probabilistic integration to refine multiple predictions from both optical flow and semantic features for robust short-term and long-term tracking. Specifically, we integrate optical flow estimations in a probabilistic manner, producing smooth and accurate trajectories by maximizing the likelihood of each prediction. To effectively re-localize challenging points that disappear and reappear due to occlusion, we further incorporate long-term feature correspondence into our flow predictions for continuous trajectory generation. Extensive experiments show that ProTracker achieves the state-of-the-art performance among unsupervised and self-supervised approaches, and even outperforms supervised methods on several benchmarks. Our code and model will be publicly available upon publication., Comment: Project page: https://michaelszj.github.io/protracker
Published: 2025

28. Gravity potential determination based on China Space Station Dual-frequency microwave links frequency transfer

Author: Zhang, Peng Fei, Wang, Chen Xiang, Li, Li Hong, Wang, Lei, Shen, Zi Yu, Xu, Rui, Ning, An, Ruby, Abdelrahim, and Shen, Wen-Bin
Subjects: Physics - Geophysics, General Relativity and Quantum Cosmology
Abstract: The China Space Station (CSS) is currently in orbit and carries the high-precision optical atomic clock with stability of approximately $2.0 \times 10^{-15} / \sqrt{\tau}$ in its experiment module. We have developed a model to determine the gravity potential (GP) based on the gravity frequency shift equation and have created both one-way and dual-frequency transfer models up to $c^{-4}$. These models consider effects from the troposphere, ionosphere, and solid Earth tides. The proposed model is suitable for measurements at the magnitude of $10^{-19}$. Based on the CSS mission, we conducted the simulation experiments. The results indicate that when processing the simulation frequency signal using the proposed model, we can obtain the GP with the accuracies of $ (1.13\pm0.71)\,\mathrm{m^2/s^2}$, $ (0.09\pm0.89)\,\mathrm{m^2/s^2}$, and $(0.66\pm1.18)\,\mathrm{m^2/s^2}$ for cutoff elevation angles of $5^{\circ}$, $10^{\circ}$ and $15^{\circ}$, respectively. With the high-precision optical atomic clock onboard the CSS, the proposed model enables us to measure the GP differences in the magnitude of centimeter-level accuracy.
Published: 2024

29. FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs

Author: Zhou, Yuanchang, Hu, Siyu, Wang, Chen, Wang, Lin-Wang, Tan, Guangming, and Jia, Weile
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning
Abstract: Graph neural network universal interatomic potentials (GNN-UIPs) have demonstrated remarkable generalization and transfer capabilities in material discovery and property prediction. These models can accelerate molecular dynamics (MD) simulation by several orders of magnitude while maintaining \textit{ab initio} accuracy, making them a promising new paradigm in material simulations. One notable example is Crystal Hamiltonian Graph Neural Network (CHGNet), pretrained on the energies, forces, stresses, and magnetic moments from the MPtrj dataset, representing a state-of-the-art GNN-UIP model for charge-informed MD simulations. However, training the CHGNet model is time-consuming(8.3 days on one A100 GPU) for three reasons: (i) requiring multi-layer propagation to reach more distant atom information, (ii) requiring second-order derivatives calculation to finish weights updating and (iii) the implementation of reference CHGNet does not fully leverage the computational capabilities. This paper introduces FastCHGNet, an optimized CHGNet, with three contributions: Firstly, we design innovative Force/Stress Readout modules to decompose Force/Stress prediction. Secondly, we adopt massive optimizations such as kernel fusion, redundancy bypass, etc, to exploit GPU computation power sufficiently. Finally, we extend CHGNet to support multiple GPUs and propose a load-balancing technique to enhance GPU utilization. Numerical results show that FastCHGNet reduces memory footprint by a factor of 3.59. The final training time of FastCHGNet can be decreased to \textbf{1.53 hours} on 32 GPUs without sacrificing model accuracy.
Published: 2024

30. Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree Survey

Author: Ather, Hammad, Bez, Jean Luca, Wang, Chen, Childs, Hank, Malony, Allen D., and Byna, Suren
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Performance
Abstract: Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex. This complexity can lead to large I/O overheads and overall performance degradation. These inefficiencies are often mitigated using tools and techniques for characterizing, analyzing, and optimizing the I/O behavior of HPC applications. That said, the myriad number of tools and techniques available makes it challenging to navigate to the best approach. In response, this paper surveys 131 papers from the ACM Digital Library, IEEE Xplore, and other reputable journals to provide a comprehensive analysis, synthesized in the form of a taxonomy, of the current landscape of parallel I/O characterization, analysis, and optimization of large-scale HPC systems. We anticipate that this taxonomy will serve as a valuable resource for enhancing I/O performance of HPC applications., Comment: 31 pages, 1 figure, 7 tables
Published: 2024

31. Triple real-emission contribution to the zero-jettiness soft function at N3LO in QCD

Author: Baranowski, Daniel, Delto, Maximilian, Melnikov, Kirill, Pikelner, Andrey, and Wang, Chen-Yu
Subjects: High Energy Physics - Phenomenology
Abstract: Recently, we have presented the result for the zero-jettiness soft function at next-to-next-to-next-to-leading order (N3LO) in perturbative QCD [arXiv:2409.11042], without providing technical details of the calculation. The goal of this paper is to describe the most important element of that computation, the triple real-emission contribution. We present a detailed discussion of the many technical aspects of the calculation, for which a number of methodological innovations was required. Although some elements of the calculation were discussed earlier [arXiv:2004.03285,arXiv:2206.12323,arXiv:2111.13594,arXiv:2204.09459,arXiv:2401.05245], this paper is intended to provide a complete summary of the methods used in the computation of the triple real-emission contribution to the soft function., Comment: 75 pages, 5 figures
Published: 2024

32. Transport resistance dominates the fill factor losses in record organic solar cells

Author: Wang, Chen, MacKenzie, Roderick C. I., Würfel, Uli, Neher, Dieter, Kirchartz, Thomas, Deibel, Carsten, and Saladina, Maria
Subjects: Condensed Matter - Materials Science
Abstract: Organic photovoltaics are a promising solar cell technology well-suited to mass production using roll-to-roll processes. The efficiency of lab-scale solar cells has exceeded 20% and considerable attention is currently being given to understanding and minimising the remaining loss mechanisms preventing higher efficiencies. While recent efficiency improvements are partly owed to reducing non-radiative recombination losses at open-circuit, the low fill factor due to a significant transport resistance is becoming the Achilles heel of organic photovoltaics. The term transport resistance refers to a voltage and light intensity dependent charge collection loss in low-mobility materials. In this Perspective, we demonstrate that even the highest efficiency organic solar cells reported to-date have significant performance losses that can be attributed to transport resistance and that lead to high fill factor losses. We provide a closer look at the transport resistance and the material properties influencing it. We describe how to experimentally characterise and quantify the transport resistance by providing easy to follow instructions. Furthermore, the causes and theory behind transport resistance are detailed. In particular, we integrate the relevant figures of merit and different viewpoints on the transport resistance. Finally, we outline strategies that can be followed to minimise these charge collection losses in future solar cells., Comment: Perspective (31 page, 13 figures)
Published: 2024
Full Text: View/download PDF

33. UITrans: Seamless UI Translation from Android to HarmonyOS

Author: Gong, Lina, Wang, Chen, Huang, Yujun, Cui, Di, and Wei, Mingqiang
Subjects: Computer Science - Software Engineering
Abstract: Seamless user interface (i.e., UI) translation has emerged as a pivotal technique for modern mobile developers, addressing the challenge of developing separate UI applications for Android and HarmonyOS platforms due to fundamental differences in layout structures and development paradigms. In this paper, we present UITrans, the first automated UI translation tool designed for Android to HarmonyOS. UITrans leverages an LLM-driven multi-agent reflective collaboration framework to convert Android XML layouts into HarmonyOS ArkUI layouts. It not only maps component-level and page-level elements to ArkUI equivalents but also handles project-level challenges, including complex layouts and interaction logic. Our evaluation of six Android applications demonstrates that our UITrans achieves translation success rates of over 90.1%, 89.3%, and 89.2% at the component, page, and project levels, respectively. UITrans is available at https://github.com/OpenSELab/UITrans and the demo video can be viewed at https://www.youtube.com/watch?v=iqKOSmCnJG0., Comment: 5 pages
Published: 2024

34. Establishing a New Benchmark in Quantum Computational Advantage with 105-qubit Zuchongzhi 3.0 Processor

Author: Gao, Dongxin, Fan, Daojin, Zha, Chen, Bei, Jiahao, Cai, Guoqing, Cai, Jianbin, Cao, Sirui, Zeng, Xiangdong, Chen, Fusheng, Chen, Jiang, Chen, Kefu, Chen, Xiawei, Chen, Xiqing, Chen, Zhe, Chen, Zhiyuan, Chen, Zihua, Chu, Wenhao, Deng, Hui, Deng, Zhibin, Ding, Pei, Ding, Xun, Ding, Zhuzhengqi, Dong, Shuai, Dong, Yupeng, Fan, Bo, Fu, Yuanhao, Gao, Song, Ge, Lei, Gong, Ming, Gui, Jiacheng, Guo, Cheng, Guo, Shaojun, Guo, Xiaoyang, He, Tan, Hong, Linyin, Hu, Yisen, Huang, He-Liang, Huo, Yong-Heng, Jiang, Tao, Jiang, Zuokai, Jin, Honghong, Leng, Yunxiang, Li, Dayu, Li, Dongdong, Li, Fangyu, Li, Jiaqi, Li, Jinjin, Li, Junyan, Li, Junyun, Li, Na, Li, Shaowei, Li, Wei, Li, Yuhuai, Li, Yuan, Liang, Futian, Liang, Xuelian, Liao, Nanxing, Lin, Jin, Lin, Weiping, Liu, Dailin, Liu, Hongxiu, Liu, Maliang, Liu, Xinyu, Liu, Xuemeng, Liu, Yancheng, Lou, Haoxin, Ma, Yuwei, Meng, Lingxin, Mou, Hao, Nan, Kailiang, Nie, Binghan, Nie, Meijuan, Ning, Jie, Niu, Le, Peng, Wenyi, Qian, Haoran, Rong, Hao, Rong, Tao, Shen, Huiyan, Shen, Qiong, Su, Hong, Su, Feifan, Sun, Chenyin, Sun, Liangchao, Sun, Tianzuo, Sun, Yingxiu, Tan, Yimeng, Tan, Jun, Tang, Longyue, Tu, Wenbing, Wan, Cai, Wang, Jiafei, Wang, Biao, Wang, Chang, Wang, Chen, Wang, Chu, Wang, Jian, Wang, Liangyuan, Wang, Rui, Wang, Shengtao, Wang, Xinzhe, Wei, Zuolin, Wei, Jiazhou, Wu, Dachao, Wu, Gang, Wu, Jin, Wu, Shengjie, Wu, Yulin, Xie, Shiyong, Xin, Lianjie, Xu, Yu, Xue, Chun, Yan, Kai, Yang, Weifeng, Yang, Xinpeng, Yang, Yang, Ye, Yangsen, Ye, Zhenping, Ying, Chong, Yu, Jiale, Yu, Qinjing, Yu, Wenhu, Zhan, Shaoyu, Zhang, Feifei, Zhang, Haibin, Zhang, Kaili, Zhang, Pan, Zhang, Wen, Zhang, Yiming, Zhang, Yongzhuo, Zhang, Lixiang, Zhao, Guming, Zhao, Peng, Zhao, Xianhe, Zhao, Xintao, Zhao, Youwei, Zhao, Zhong, Zheng, Luyuan, Zhou, Fei, Zhou, Liang, Zhou, Na, Zhou, Naibin, Zhou, Shifeng, Zhou, Shuang, Zhou, Zhengxiao, Zhu, Chengjun, Zhu, Qingling, Zou, Guihong, Zou, Haonan, Zhang, Qiang, Lu, Chao-Yang, Peng, Cheng-Zhi, Zhu, XiaoBo, and Pan, Jian-Wei
Subjects: Quantum Physics
Abstract: In the relentless pursuit of quantum computational advantage, we present a significant advancement with the development of Zuchongzhi 3.0. This superconducting quantum computer prototype, comprising 105 qubits, achieves high operational fidelities, with single-qubit gates, two-qubit gates, and readout fidelity at 99.90%, 99.62% and 99.18%, respectively. Our experiments with an 83-qubit, 32-cycle random circuit sampling on Zuchongzhi 3.0 highlight its superior performance, achieving one million samples in just a few hundred seconds. This task is estimated to be infeasible on the most powerful classical supercomputers, Frontier, which would require approximately $6.4\times 10^9$ years to replicate the task. This leap in processing power places the classical simulation cost six orders of magnitude beyond Google's SYC-67 and SYC-70 experiments [Nature 634, 328(2024)], firmly establishing a new benchmark in quantum computational advantage. Our work not only advances the frontiers of quantum computing but also lays the groundwork for a new era where quantum processors play an essential role in tackling sophisticated real-world challenges.
Published: 2024

35. iKap: Kinematics-aware Planning with Imperative Learning

Author: Li, Qihang, Chen, Zhuoqun, Zheng, Haoze, He, Haonan, Su, Shaoshu, Geng, Junyi, and Wang, Chen
Subjects: Computer Science - Robotics
Abstract: Trajectory planning in robotics aims to generate collision-free pose sequences that can be reliably executed. Recently, vision-to-planning systems have garnered increasing attention for their efficiency and ability to interpret and adapt to surrounding environments. However, traditional modular systems suffer from increased latency and error propagation, while purely data-driven approaches often overlook the robot's kinematic constraints. This oversight leads to discrepancies between planned trajectories and those that are executable. To address these challenges, we propose iKap, a novel vision-to-planning system that integrates the robot's kinematic model directly into the learning pipeline. iKap employs a self-supervised learning approach and incorporates the state transition model within a differentiable bi-level optimization framework. This integration ensures the network learns collision-free waypoints while satisfying kinematic constraints, enabling gradient back-propagation for end-to-end training. Our experimental results demonstrate that iKap achieves higher success rates and reduced latency compared to the state-of-the-art methods. Besides the complete system, iKap offers a visual-to-planning network that seamlessly integrates kinematics into various controllers, providing a robust solution for robots navigating complex and dynamic environments., Comment: 6 pages, 6 figures
Published: 2024

36. CRAFTS for HI cosmology: I. data analysis and preliminary results

Author: Yang, Wenxiu, Wolz, Laura, Li, Yichao, Hu, Wenkai, Cunnington, Steven, Grainge, Keith, Deng, Furen, Zuo, Shifan, Shu, Shuanghao, Zhao, Xinyang, Li, Di, Zheng, Zheng, Krčo, Marko, Zheng, Yinghui, Feng, Linjing, Zuo, Pei, Chen, Hao, Jiang, Xue-Jian, Wang, Chen, Wang, Pei, Miao, Chen-Chen, Wang, Yougang, and Chen, Xuelei
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: We present the results from calibrating the data of the Commensal Radio Astronomy FAST Survey (CRAFTS) for \HI intensity mapping by the Five-hundred-meter Aperture Spherical Radio Telescope (FAST). Using 70 hours of drift-scan observation with the L-band (1.05-1.45GHz) 19-beam receiver, we obtain the data covering $270\,\rm deg^2$ sky area. We employ both the pulsar backend and the spectrum backend to calibrate the spectral time-ordered-data (TOD) before projecting them onto HEALPix maps. We produce calibrated TOD with frequency resolution of 30 kHz and time resolution of 1 s and the map data-cube with frequency resolution of 30kHz and spatial resolution of $2.95\,\rm arcmin^2$. We carefully examine the pointing errors, noise overflow, RFI contamination and their effect on the data quality. The resulting noise level is $\sim$ 5.7 mJy for the calibrated TOD and 1.6 mJy for the map, which is consistent with the theoretical predictions within 5\% at RFI-free channels. We also validate the data by Principal Components Analysis (PCA) and find most foreground components are concentrated in the first 30 modes. We identify 447 isolated bright continuum sources in our data matching the NRAO-VLA Sky Survey (NVSS) catalog, with relative flux error of 8.3\% for TOD and 11.9\% for the map-level. We also measure the \HI emission of 90 galaxies with redshift $z<0.07$ and compare with \HI-MaNGA spectra from the Green Bank Telescope (GBT), yielding an overall relative error of the \HI integral flux of 16.7\%. Our results confirm the feasibility of conducting cosmological \HI signal detection with CRAFTS., Comment: 30 pages, 30 figures, and 3 tables
Published: 2024

37. An Efficient Scene Coordinate Encoding and Relocalization Method

Author: Xu, Kuan, Jiang, Zeyu, Cao, Haozhi, Yuan, Shenghai, Wang, Chen, and Xie, Lihua
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Scene Coordinate Regression (SCR) is a visual localization technique that utilizes deep neural networks (DNN) to directly regress 2D-3D correspondences for camera pose estimation. However, current SCR methods often face challenges in handling repetitive textures and meaningless areas due to their reliance on implicit triangulation. In this paper, we propose an efficient scene coordinate encoding and relocalization method. Compared with the existing SCR methods, we design a unified architecture for both scene encoding and salient keypoint detection, enabling our system to focus on encoding informative regions, thereby significantly enhancing efficiency. Additionally, we introduce a mechanism that leverages sequential information during both map encoding and relocalization, which strengthens implicit triangulation, particularly in repetitive texture environments. Comprehensive experiments conducted across indoor and outdoor datasets demonstrate that the proposed system outperforms other state-of-the-art (SOTA) SCR methods. Our single-frame relocalization mode improves the recall rate of our baseline by 6.4% and increases the running speed from 56Hz to 90Hz. Furthermore, our sequence-based mode increases the recall rate by 11% while maintaining the original efficiency., Comment: 8 pages, 6 figures
Published: 2024

38. Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems

Author: Li, Rui, Wang, Song, and Wang, Chen
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Numerical Analysis
Abstract: Preconditioning techniques are crucial for enhancing the efficiency of solving large-scale linear equation systems that arise from partial differential equation (PDE) discretization. These techniques, such as Incomplete Cholesky factorization (IC) and data-driven neural network methods, accelerate the convergence of iterative solvers like Conjugate Gradient (CG) by approximating the original matrices. This paper introduces a novel approach that integrates Graph Neural Network (GNN) with traditional IC, addressing the shortcomings of direct generation methods based on GNN and achieving significant improvements in computational efficiency and scalability. Experimental results demonstrate an average reduction in iteration counts by 24.8% compared to IC and a two-order-of-magnitude increase in training scale compared to previous methods. A three-dimensional static structural analysis utilizing finite element methods was validated on training sparse matrices of up to 5 million dimensions and inference scales of up to 10 million. Furthermore, the approach demon-strates robust generalization capabilities across scales, facilitating the effective acceleration of CG solvers for large-scale linear equations using small-scale data on modest hardware. The method's robustness and scalability make it a practical solution for computational science.
Published: 2024

39. Graph-Sequential Alignment and Uniformity: Toward Enhanced Recommendation Systems

Author: Cao, Yuwei, Yang, Liangwei, Liu, Zhiwei, Liu, Yuqing, Wang, Chen, Liang, Yueqing, Peng, Hao, and Yu, Philip S.
Subjects: Computer Science - Information Retrieval
Abstract: Graph-based and sequential methods are two popular recommendation paradigms, each excelling in its domain but lacking the ability to leverage signals from the other. To address this, we propose a novel method that integrates both approaches for enhanced performance. Our framework uses Graph Neural Network (GNN)-based and sequential recommenders as separate submodules while sharing a unified embedding space optimized jointly. To enable positive knowledge transfer, we design a loss function that enforces alignment and uniformity both within and across submodules. Experiments on three real-world datasets demonstrate that the proposed method significantly outperforms using either approach alone and achieves state-of-the-art results. Our implementations are publicly available at https://github.com/YuweiCao-UIC/GSAU.git., Comment: Accepted to The Web Conference 2025
Published: 2024
Full Text: View/download PDF

40. Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey

Author: Zhou, Wei, Zhao, Lei, Zhang, Runyu, Cui, Yifan, Huang, Hongpu, Qie, Kun, and Wang, Chen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Traffic Surveillance Systems (TSS) have become increasingly crucial in modern intelligent transportation systems, with vision-based technologies playing a central role for scene perception and understanding. While existing surveys typically focus on isolated aspects of TSS, a comprehensive analysis bridging low-level and high-level perception tasks, particularly considering emerging technologies, remains lacking. This paper presents a systematic review of vision-based technologies in TSS, examining both low-level perception tasks (object detection, classification, and tracking) and high-level perception applications (parameter estimation, anomaly detection, and behavior understanding). Specifically, we first provide a detailed methodological categorization and comprehensive performance evaluation for each task. Our investigation reveals five fundamental limitations in current TSS: perceptual data degradation in complex scenarios, data-driven learning constraints, semantic understanding gaps, sensing coverage limitations and computational resource demands. To address these challenges, we systematically analyze five categories of potential solutions: advanced perception enhancement, efficient learning paradigms, knowledge-enhanced understanding, cooperative sensing frameworks and efficient computing frameworks. Furthermore, we evaluate the transformative potential of foundation models in TSS, demonstrating their unique capabilities in zero-shot learning, semantic understanding, and scene generation. This review provides a unified framework bridging low-level and high-level perception tasks, systematically analyzes current limitations and solutions, and presents a structured roadmap for integrating emerging technologies, particularly foundation models, to enhance TSS capabilities.
Published: 2024

41. TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension

Author: Qiu, Zipeng, Peng, You, He, Guangxin, Yuan, Binhang, and Wang, Chen
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Information Retrieval
Abstract: The advent of large language models (LLMs) has unlocked great opportunities in complex data management tasks, particularly in question answering (QA) over complicated multi-table relational data. Despite significant progress, systematically evaluating LLMs on multi-table QA remains a critical challenge due to the inherent complexity of analyzing heterogeneous table structures and potential large scale of serialized relational data. Existing benchmarks primarily focus on single-table QA, failing to capture the intricacies of reasoning across multiple relational tables, as required in real-world domains such as finance, healthcare, and e-commerce. To address this gap, we present TQA-Bench, a new multi-table QA benchmark designed to evaluate the capabilities of LLMs in tackling complex QA tasks over relational data. Our benchmark incorporates diverse relational database instances sourced from real-world public datasets and introduces a flexible sampling mechanism to create tasks with varying multi-table context lengths, ranging from 8K to 64K tokens. To ensure robustness and reliability, we integrate symbolic extensions into the evaluation framework, enabling the assessment of LLM reasoning capabilities beyond simple data retrieval or probabilistic pattern matching. We systematically evaluate a range of LLMs, both open-source and closed-source, spanning model scales from 7 billion to 70 billion parameters. Our extensive experiments reveal critical insights into the performance of LLMs in multi-table QA, highlighting both challenges and opportunities for advancing their application in complex, data-driven environments. Our benchmark implementation and results are available at https://github.com/Relaxed-System-Lab/TQA-Bench.
Published: 2024

42. Zero-Indexing Internet Search Augmented Generation for Large Language Models

Author: He, Guangxin, Dai, Zonghong, Zhu, Jiangcheng, Zhao, Binqiang, Hu, Qicheng, Li, Chenyue, Peng, You, Wang, Chen, and Yuan, Binhang
Subjects: Computer Science - Information Retrieval
Abstract: Retrieval augmented generation has emerged as an effective method to enhance large language model performance. This approach typically relies on an internal retrieval module that uses various indexing mechanisms to manage a static pre-processed corpus. However, such a paradigm often falls short when it is necessary to integrate the most up-to-date information that has not been updated into the corpus during generative inference time. In this paper, we explore an alternative approach that leverages standard search engine APIs to dynamically integrate the latest online information (without maintaining any index for any fixed corpus), thereby improving the quality of generated content. We design a collaborative LLM-based paradigm, where we include: (i) a parser-LLM that determines if the Internet augmented generation is demanded and extracts the search keywords if so with a single inference; (ii) a mixed ranking strategy that re-ranks the retrieved HTML files to eliminate bias introduced from the search engine API; and (iii) an extractor-LLM that can accurately and efficiently extract relevant information from the fresh content in each HTML file. We conduct extensive empirical studies to evaluate the performance of this Internet search augmented generation paradigm. The experimental results demonstrate that our method generates content with significantly improved quality. Our system has been successfully deployed in a production environment to serve 01.AI's generative inference requests.
Published: 2024

43. PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors

Author: Wei, Guangshun, Feng, Yuan, Ma, Long, Wang, Chen, Zhou, Yuanfeng, and Li, Changjian
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: This paper presents PCDreamer, a novel method for point cloud completion. Traditional methods typically extract features from partial point clouds to predict missing regions, but the large solution space often leads to unsatisfactory results. More recent approaches have started to use images as extra guidance, effectively improving performance, but obtaining paired data of images and partial point clouds is challenging in practice. To overcome these limitations, we harness the relatively view-consistent multi-view diffusion priors within large models, to generate novel views of the desired shape. The resulting image set encodes both global and local shape cues, which is especially beneficial for shape completion. To fully exploit the priors, we have designed a shape fusion module for producing an initial complete shape from multi-modality input (\ie, images and point clouds), and a follow-up shape consolidation module to obtain the final complete shape by discarding unreliable points introduced by the inconsistency from diffusion priors. Extensive experimental results demonstrate our superior performance, especially in recovering fine details.
Published: 2024

44. Searching radio signals from two magnetars and a high-magnetic field pulsar and the serendipitous discovery of a new radio pulsar PSR J1935+2200

Author: Xie, Lang, Han, J. L., Yang, Z. L., Jing, W. C., Zhou, D. J., Su, W. Q., Yan, Yi, Wang, Tao, Cai, N. N., Wang, P. F., and Wang, Chen
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: Magnetars are slowly rotating, highly magnetized young neutron stars that can show transient radio phenomena for radio pulses and fast radio bursts. We conducted radio observations of from two magnetars SGR$~$J1935+2154 and 3XMM$~$J185246.6+003317 and a high-magnetic field pulsar PSR$~$J1846$-$0258 using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). We performed single pulse and periodicity searches and did not detect radio signals from them. From the piggyback data recorded by other FAST telescope beams when we observed the magnetar SGR$~$1935+2154, we serendipitously discovered a new radio pulsar, PSR$~$J1935+2200. We carried out the follow-up observations and obtained the timing solution based on these new observations and the archive FAST data. PSR$~$J1935+2200 is an isolated old pulsar, with a spin period of $0.91$s, a spin-period derivative of $9.19 \times 10^{-15}$~s~s$^{-1}$, and a characteristic age of $1.57$ Myr. It is a weak pulsar with a flux density of 9.8 $\mu$Jy at 1.25 GHz. Discovery of a new pulsar from the long FAST observations of 30 minutes implies that there may be more weak older pulsars in the Galactic disk to be discovered., Comment: 7 pages, 3 figures and 3 tables. Published in RAA
Published: 2024
Full Text: View/download PDF

45. A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems

Author: Zhang, Dawen, Xu, Xiwei, Wang, Chen, Xing, Zhenchang, and Mao, Robert
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Multiagent Systems
Abstract: Significant efforts has been made to expand the use of Large Language Models (LLMs) beyond basic language tasks. While the generalizability and versatility of LLMs have enabled widespread adoption, evolving demands in application development often exceed their native capabilities. Meeting these demands may involve a diverse set of methods, such as enhancing creativity through either inference temperature adjustments or creativity-provoking prompts. Selecting the right approach is critical, as different methods lead to trade-offs in engineering complexity, scalability, and operational costs. This paper introduces a layered architecture that organizes LLM software system development into distinct layers, each characterized by specific attributes. By aligning capabilities with these layers, the framework encourages the systematic implementation of capabilities in effective and efficient ways that ultimately supports desired functionalities and qualities. Through practical case studies, we illustrate the utility of the framework. This work offers developers actionable insights for selecting suitable technologies in LLM-based software system development, promoting robustness and scalability.
Published: 2024

46. Membership Inference Attack against Long-Context Large Language Models

Author: Wang, Zixiong, Liu, Gaoyang, Yang, Yang, and Wang, Chen
Subjects: Computer Science - Computation and Language
Abstract: Recent advances in Large Language Models (LLMs) have enabled them to overcome their context window limitations, and demonstrate exceptional retrieval and reasoning capacities on longer context. Quesion-answering systems augmented with Long-Context Language Models (LCLMs) can automatically search massive external data and incorporate it into their contexts, enabling faithful predictions and reducing issues such as hallucinations and knowledge staleness. Existing studies targeting LCLMs mainly concentrate on addressing the so-called lost-in-the-middle problem or improving the inference effiencicy, leaving their privacy risks largely unexplored. In this paper, we aim to bridge this gap and argue that integrating all information into the long context makes it a repository of sensitive information, which often contains private data such as medical records or personal identities. We further investigate the membership privacy within LCLMs external context, with the aim of determining whether a given document or sequence is included in the LCLMs context. Our basic idea is that if a document lies in the context, it will exhibit a low generation loss or a high degree of semantic similarity to the contents generated by LCLMs. We for the first time propose six membership inference attack (MIA) strategies tailored for LCLMs and conduct extensive experiments on various popular models. Empirical results demonstrate that our attacks can accurately infer membership status in most cases, e.g., 90.66% attack F1-score on Multi-document QA datasets with LongChat-7b-v1.5-32k, highlighting significant risks of membership leakage within LCLMs input contexts. Furthermore, we examine the underlying reasons why LCLMs are susceptible to revealing such membership information.
Published: 2024

47. V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

Author: Yang, Lei, Zhang, Xinyu, Li, Jun, Wang, Chen, Song, Zhiying, Zhao, Tong, Song, Ziying, Wang, Li, Zhou, Mo, Shen, Yang, Wu, Kai, and Lv, Chen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Modern autonomous vehicle perception systems often struggle with occlusions and limited perception range. Previous studies have demonstrated the effectiveness of cooperative perception in extending the perception range and overcoming occlusions, thereby improving the safety of autonomous driving. In recent years, a series of cooperative perception datasets have emerged. However, these datasets only focus on camera and LiDAR, overlooking 4D Radar, a sensor employed in single-vehicle autonomous driving for robust perception in adverse weather conditions. In this paper, to bridge the gap of missing 4D Radar datasets in cooperative perception, we present V2X-Radar, the first large real-world multi-modal dataset featuring 4D Radar. Our V2X-Radar dataset is collected using a connected vehicle platform and an intelligent roadside unit equipped with 4D Radar, LiDAR, and multi-view cameras. The collected data includes sunny and rainy weather conditions, spanning daytime, dusk, and nighttime, as well as typical challenging scenarios. The dataset comprises 20K LiDAR frames, 40K camera images, and 20K 4D Radar data, with 350K annotated bounding boxes across five categories. To facilitate diverse research domains, we establish V2X-Radar-C for cooperative perception, V2X-Radar-I for roadside perception, and V2X-Radar-V for single-vehicle perception. We further provide comprehensive benchmarks of recent perception algorithms on the above three sub-datasets. The dataset and benchmark codebase will be available at \url{http://openmpd.com/column/V2X-Radar}., Comment: 11 pages, 5 figures
Published: 2024

48. Fully Dynamic Adversarially Robust Correlation Clustering in Polylogarithmic Update Time

Author: Braverman, Vladimir, Dharangutte, Prathamesh, Pai, Shreyas, Shah, Vihan, and Wang, Chen
Subjects: Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning
Abstract: We study the dynamic correlation clustering problem with $\textit{adaptive}$ edge label flips. In correlation clustering, we are given a $n$-vertex complete graph whose edges are labeled either $(+)$ or $(-)$, and the goal is to minimize the total number of $(+)$ edges between clusters and the number of $(-)$ edges within clusters. We consider the dynamic setting with adversarial robustness, in which the $\textit{adaptive}$ adversary could flip the label of an edge based on the current output of the algorithm. Our main result is a randomized algorithm that always maintains an $O(1)$-approximation to the optimal correlation clustering with $O(\log^{2}{n})$ amortized update time. Prior to our work, no algorithm with $O(1)$-approximation and $\text{polylog}{(n)}$ update time for the adversarially robust setting was known. We further validate our theoretical results with experiments on synthetic and real-world datasets with competitive empirical performances. Our main technical ingredient is an algorithm that maintains $\textit{sparse-dense decomposition}$ with $\text{polylog}{(n)}$ update time, which could be of independent interest.
Published: 2024

49. Non-Hermitian Effects in Dicke models

Author: Jiang, Bin, Li, Yi-Yang, Liu, Junjie, Wang, Chen, and Jiang, Jian-Hua
Subjects: Quantum Physics, Physics - Optics
Abstract: The Dicke model, which describes the collective interaction between an ensemble of atoms and a single-mode photon field, serves as a fundamental framework for studying light-matter interactions and quantum electrodynamic phenomena. In this work, we investigate the manifestation of non-Hermitian effects in a generalized Dicke model, where two dissipative atom ensembles interact with a single-mode photon field. By applying the Holstein-Primakoff transformation, we explore the system in the semiclassical limit as a non-Hermitian Dicke model, revealing rich exceptional points (EPs) and diabolic points in such a system. We find that, by introducing the nonlinear saturation gain into an atomic ensemble, higher-order EP can be induced, leading to intriguing properties. Furthermore, if the system is extended to a one-dimensional chain, then the band topology will interplay with the non-Hermitian effect. In the quantum regime, we explore the quantum signature of EPs, noting that the conditions for their emergence are influenced by discrete photon numbers. We further study the transition from photon anti-bunching to bunching at a steady state, driven by non-Hermitian dynamics. Our findings deepen the understanding of non-Hermitian physics in light-matter interaction which is instructive for the design of advanced photonic and quantum systems.
Published: 2024

50. Object-Centric Dexterous Manipulation from Human Motion Data

Author: Chen, Yuanpei, Wang, Chen, Yang, Yaodong, and Liu, C. Karen
Subjects: Computer Science - Robotics
Abstract: Manipulating objects to achieve desired goal states is a basic but important skill for dexterous manipulation. Human hand motions demonstrate proficient manipulation capability, providing valuable data for training robots with multi-finger hands. Despite this potential, substantial challenges arise due to the embodiment gap between human and robot hands. In this work, we introduce a hierarchical policy learning framework that uses human hand motion data for training object-centric dexterous robot manipulation. At the core of our method is a high-level trajectory generative model, learned with a large-scale human hand motion capture dataset, to synthesize human-like wrist motions conditioned on the desired object goal states. Guided by the generated wrist motions, deep reinforcement learning is further used to train a low-level finger controller that is grounded in the robot's embodiment to physically interact with the object to achieve the goal. Through extensive evaluation across 10 household objects, our approach not only demonstrates superior performance but also showcases generalization capability to novel object geometries and goal states. Furthermore, we transfer the learned policies from simulation to a real-world bimanual dexterous robot system, further demonstrating its applicability in real-world scenarios. Project website: https://cypypccpy.github.io/obj-dex.github.io/., Comment: 20 pages, 7 figures
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

48,720 results on '"WANG, Chen"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources