48,035 results on '"WANG, Chen"'
Search Results
2. Balance Divergence for Knowledge Distillation
- Author
-
Qi, Yafei, Wang, Chen, Zhang, Zhaoning, Liu, Yaping, and Zhang, Yongmin
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Knowledge distillation has been widely adopted in computer vision task processing, since it can effectively enhance the performance of lightweight student networks by leveraging the knowledge transferred from cumbersome teacher networks. Most existing knowledge distillation methods utilize Kullback-Leibler divergence to mimic the logit output probabilities between the teacher network and the student network. Nonetheless, these methods may neglect the negative parts of the teacher's ''dark knowledge'' because the divergence calculations may ignore the effect of the minute probabilities from the teacher's logit output. This deficiency may lead to suboptimal performance in logit mimicry during the distillation process and result in an imbalance of information acquired by the student network. In this paper, we investigate the impact of this imbalance and propose a novel method, named Balance Divergence Distillation. By introducing a compensatory operation using reverse Kullback-Leibler divergence, our method can improve the modeling of the extremely small values in the negative from the teacher and preserve the learning capacity for the positive. Furthermore, we test the impact of different temperature coefficients adjustments, which may conducted to further balance for knowledge transferring. We evaluate the proposed method on several computer vision tasks, including image classification and semantic segmentation. The evaluation results show that our method achieves an accuracy improvement of 1%~3% for lightweight students on both CIFAR-100 and ImageNet dataset, and a 4.55% improvement in mIoU for PSP-ResNet18 on the Cityscapes dataset. The experiments show that our method is a simple yet highly effective solution that can be smoothly applied to different knowledge distillation methods.
- Published
- 2025
3. LPRnet: A self-supervised registration network for LiDAR and photogrammetric point clouds
- Author
-
Wang, Chen, Gu, Yanfeng, and Li, Xian
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
LiDAR and photogrammetry are active and passive remote sensing techniques for point cloud acquisition, respectively, offering complementary advantages and heterogeneous. Due to the fundamental differences in sensing mechanisms, spatial distributions and coordinate systems, their point clouds exhibit significant discrepancies in density, precision, noise, and overlap. Coupled with the lack of ground truth for large-scale scenes, integrating the heterogeneous point clouds is a highly challenging task. This paper proposes a self-supervised registration network based on a masked autoencoder, focusing on heterogeneous LiDAR and photogrammetric point clouds. At its core, the method introduces a multi-scale masked training strategy to extract robust features from heterogeneous point clouds under self-supervision. To further enhance registration performance, a rotation-translation embedding module is designed to effectively capture the key features essential for accurate rigid transformations. Building upon the robust representations, a transformer-based architecture seamlessly integrates local and global features, fostering precise alignment across diverse point cloud datasets. The proposed method demonstrates strong feature extraction capabilities for both LiDAR and photogrammetric point clouds, addressing the challenges of acquiring ground truth at the scene level. Experiments conducted on two real-world datasets validate the effectiveness of the proposed method in solving heterogeneous point cloud registration problems., Comment: 12 pages, 9 figures, 5 tables
- Published
- 2025
4. Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation
- Author
-
Meng, Xuyi, Wang, Chen, Lei, Jiahui, Daniilidis, Kostas, Gu, Jiatao, and Liu, Lingjie
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent advances in 2D image generation have achieved remarkable quality,largely driven by the capacity of diffusion models and the availability of large-scale datasets. However, direct 3D generation is still constrained by the scarcity and lower fidelity of 3D datasets. In this paper, we introduce Zero-1-to-G, a novel approach that addresses this problem by enabling direct single-view generation on Gaussian splats using pretrained 2D diffusion models. Our key insight is that Gaussian splats, a 3D representation, can be decomposed into multi-view images encoding different attributes. This reframes the challenging task of direct 3D generation within a 2D diffusion framework, allowing us to leverage the rich priors of pretrained 2D diffusion models. To incorporate 3D awareness, we introduce cross-view and cross-attribute attention layers, which capture complex correlations and enforce 3D consistency across generated splats. This makes Zero-1-to-G the first direct image-to-3D generative model to effectively utilize pretrained 2D diffusion priors, enabling efficient training and improved generalization to unseen objects. Extensive experiments on both synthetic and in-the-wild datasets demonstrate superior performance in 3D object generation, offering a new approach to high-quality 3D generation.
- Published
- 2025
5. Recorder: Comprehensive Parallel I/O Tracing and Analysis
- Author
-
Wang, Chen, Yildirim, Izzet, Devarajan, Hariharan, Mohror, Kathryn, and Snir, Marc
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Performance - Abstract
This paper presents Recorder, a parallel I/O tracing tool designed to capture comprehensive I/O information on HPC applications. Recorder traces I/O calls across various I/O layers, storing all function parameters for each captured call. The volume of stored information scales linearly the application's execution scale. To address this, we present a sophisticated pattern-recognition-based compression algorithm. This algorithm identifies and compresses recurring I/O patterns both within individual processes and across multiple processes, significantly reducing space and time overheads. We evaluate the proposed compression algorithm using I/O benchmarks and real-world applications, demonstrating that Recorder can store more information while requiring approximately 12x less storage space compared to its predecessor. Notably, for applications with typical parallel I/O patterns, Recorder achieves a constant trace size regardless of execution scale. Additionally, a comparison with the profiling tool Darshan shows that Recorder captures detailed I/O information without incurring substantial overhead. The richer data collected by Recorder enables new insights and facilitates more in-depth I/O studies, offering valuable contributions to the I/O research community., Comment: 29 pages. Under Review. Submitted to the Journal of Supercomputing
- Published
- 2025
6. ProTracker: Probabilistic Integration for Robust and Accurate Point Tracking
- Author
-
Zhang, Tingyang, Wang, Chen, Dou, Zhiyang, Gao, Qingzhe, Lei, Jiahui, Chen, Baoquan, and Liu, Lingjie
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this paper, we propose ProTracker, a novel framework for robust and accurate long-term dense tracking of arbitrary points in videos. The key idea of our method is incorporating probabilistic integration to refine multiple predictions from both optical flow and semantic features for robust short-term and long-term tracking. Specifically, we integrate optical flow estimations in a probabilistic manner, producing smooth and accurate trajectories by maximizing the likelihood of each prediction. To effectively re-localize challenging points that disappear and reappear due to occlusion, we further incorporate long-term feature correspondence into our flow predictions for continuous trajectory generation. Extensive experiments show that ProTracker achieves the state-of-the-art performance among unsupervised and self-supervised approaches, and even outperforms supervised methods on several benchmarks. Our code and model will be publicly available upon publication., Comment: Project page: https://michaelszj.github.io/protracker
- Published
- 2025
7. Gravity potential determination based on China Space Station Dual-frequency microwave links frequency transfer
- Author
-
Zhang, Peng Fei, Wang, Chen Xiang, Li, Li Hong, Wang, Lei, Shen, Zi Yu, Xu, Rui, Ning, An, Ruby, Abdelrahim, and Shen, Wen-Bin
- Subjects
Physics - Geophysics ,General Relativity and Quantum Cosmology - Abstract
The China Space Station (CSS) is currently in orbit and carries the high-precision optical atomic clock with stability of approximately $2.0 \times 10^{-15} / \sqrt{\tau}$ in its experiment module. We have developed a model to determine the gravity potential (GP) based on the gravity frequency shift equation and have created both one-way and dual-frequency transfer models up to $c^{-4}$. These models consider effects from the troposphere, ionosphere, and solid Earth tides. The proposed model is suitable for measurements at the magnitude of $10^{-19}$. Based on the CSS mission, we conducted the simulation experiments. The results indicate that when processing the simulation frequency signal using the proposed model, we can obtain the GP with the accuracies of $ (1.13\pm0.71)\,\mathrm{m^2/s^2}$, $ (0.09\pm0.89)\,\mathrm{m^2/s^2}$, and $(0.66\pm1.18)\,\mathrm{m^2/s^2}$ for cutoff elevation angles of $5^{\circ}$, $10^{\circ}$ and $15^{\circ}$, respectively. With the high-precision optical atomic clock onboard the CSS, the proposed model enables us to measure the GP differences in the magnitude of centimeter-level accuracy.
- Published
- 2024
8. FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs
- Author
-
Zhou, Yuanchang, Hu, Siyu, Wang, Chen, Wang, Lin-Wang, Tan, Guangming, and Jia, Weile
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning - Abstract
Graph neural network universal interatomic potentials (GNN-UIPs) have demonstrated remarkable generalization and transfer capabilities in material discovery and property prediction. These models can accelerate molecular dynamics (MD) simulation by several orders of magnitude while maintaining \textit{ab initio} accuracy, making them a promising new paradigm in material simulations. One notable example is Crystal Hamiltonian Graph Neural Network (CHGNet), pretrained on the energies, forces, stresses, and magnetic moments from the MPtrj dataset, representing a state-of-the-art GNN-UIP model for charge-informed MD simulations. However, training the CHGNet model is time-consuming(8.3 days on one A100 GPU) for three reasons: (i) requiring multi-layer propagation to reach more distant atom information, (ii) requiring second-order derivatives calculation to finish weights updating and (iii) the implementation of reference CHGNet does not fully leverage the computational capabilities. This paper introduces FastCHGNet, an optimized CHGNet, with three contributions: Firstly, we design innovative Force/Stress Readout modules to decompose Force/Stress prediction. Secondly, we adopt massive optimizations such as kernel fusion, redundancy bypass, etc, to exploit GPU computation power sufficiently. Finally, we extend CHGNet to support multiple GPUs and propose a load-balancing technique to enhance GPU utilization. Numerical results show that FastCHGNet reduces memory footprint by a factor of 3.59. The final training time of FastCHGNet can be decreased to \textbf{1.53 hours} on 32 GPUs without sacrificing model accuracy.
- Published
- 2024
9. Parallel I/O Characterization and Optimization on Large-Scale HPC Systems: A 360-Degree Survey
- Author
-
Ather, Hammad, Bez, Jean Luca, Wang, Chen, Childs, Hank, Malony, Allen D., and Byna, Suren
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Performance - Abstract
Driven by artificial intelligence, data science, and high-resolution simulations, I/O workloads and hardware on high-performance computing (HPC) systems have become increasingly complex. This complexity can lead to large I/O overheads and overall performance degradation. These inefficiencies are often mitigated using tools and techniques for characterizing, analyzing, and optimizing the I/O behavior of HPC applications. That said, the myriad number of tools and techniques available makes it challenging to navigate to the best approach. In response, this paper surveys 131 papers from the ACM Digital Library, IEEE Xplore, and other reputable journals to provide a comprehensive analysis, synthesized in the form of a taxonomy, of the current landscape of parallel I/O characterization, analysis, and optimization of large-scale HPC systems. We anticipate that this taxonomy will serve as a valuable resource for enhancing I/O performance of HPC applications., Comment: 31 pages, 1 figure, 7 tables
- Published
- 2024
10. Triple real-emission contribution to the zero-jettiness soft function at N3LO in QCD
- Author
-
Baranowski, Daniel, Delto, Maximilian, Melnikov, Kirill, Pikelner, Andrey, and Wang, Chen-Yu
- Subjects
High Energy Physics - Phenomenology - Abstract
Recently, we have presented the result for the zero-jettiness soft function at next-to-next-to-next-to-leading order (N3LO) in perturbative QCD [arXiv:2409.11042], without providing technical details of the calculation. The goal of this paper is to describe the most important element of that computation, the triple real-emission contribution. We present a detailed discussion of the many technical aspects of the calculation, for which a number of methodological innovations was required. Although some elements of the calculation were discussed earlier [arXiv:2004.03285,arXiv:2206.12323,arXiv:2111.13594,arXiv:2204.09459,arXiv:2401.05245], this paper is intended to provide a complete summary of the methods used in the computation of the triple real-emission contribution to the soft function., Comment: 75 pages, 5 figures
- Published
- 2024
11. Transport resistance dominates the fill factor losses in record organic solar cells
- Author
-
Wang, Chen, MacKenzie, Roderick C. I., Würfel, Uli, Neher, Dieter, Kirchartz, Thomas, Deibel, Carsten, and Saladina, Maria
- Subjects
Condensed Matter - Materials Science - Abstract
Organic photovoltaics are a promising solar cell technology well-suited to mass production using roll-to-roll processes. The efficiency of lab-scale solar cells has exceeded 20% and considerable attention is currently being given to understanding and minimising the remaining loss mechanisms preventing higher efficiencies. While recent efficiency improvements are partly owed to reducing non-radiative recombination losses at open-circuit, the low fill factor due to a significant transport resistance is becoming the Achilles heel of organic photovoltaics. The term transport resistance refers to a voltage and light intensity dependent charge collection loss in low-mobility materials. In this Perspective, we demonstrate that even the highest efficiency organic solar cells reported to-date have significant performance losses that can be attributed to transport resistance and that lead to high fill factor losses. We provide a closer look at the transport resistance and the material properties influencing it. We describe how to experimentally characterise and quantify the transport resistance by providing easy to follow instructions. Furthermore, the causes and theory behind transport resistance are detailed. In particular, we integrate the relevant figures of merit and different viewpoints on the transport resistance. Finally, we outline strategies that can be followed to minimise these charge collection losses in future solar cells., Comment: Perspective (31 page, 13 figures)
- Published
- 2024
12. A2H: A UI Converter from Android to HarmonyOS Platform
- Author
-
Wang, Chen, Gong, Lina, Huang, Yujun, Cui, Di, and Wei, Mingqiang
- Subjects
Computer Science - Software Engineering - Abstract
With the growing importance of smartphones, developers face the challenge of creating separate applications for multiple platforms (e.g., Android, iOS, and HarmonyOS), leading to increased development costs and longer iteration cycles. One potential solution is to develop an app on one platform and then automatically convert it to other platforms, reducing the need for separate development efforts. However, migrating user interfaces (UIs) between platforms is particularly challenging due to significant differences in layout structures and development paradigms, such as the disparity between XML layout files in Android and ArkUI framework in HarmonyOS. Manual conversion of UIs is time-consuming, error-prone, and inefficient, necessitating an automated solution to streamline the process and enable seamless migration from Android to HarmonyOS. To address this challenge, we propose the A2H Converter, an automated tool for migrating Android UIs to HarmonyOS. The tool employs an large language model (LLM)-driven multi-agent framework to convert Android XML layouts into HarmonyOS ArkUI layouts. Using the RAG combing with decision rules, the system maps Android UI components to ArkUI equivalents, while a reflective mechanism continuously improves conversion accuracy. A2H Converter handles project-level layouts, ensuring consistency across multiple files and addressing complex UI logic. Experiments on six Android applications collected from GitHub demonstrate that our A2H Converter achieves a migration success rate of over 90.1%, 89.3%, and 89.2% at the component, page, and project levels, respectively. The demo video is available at. The tool is available at http://124.70.54.129:37860/., Comment: 5 pages
- Published
- 2024
13. Establishing a New Benchmark in Quantum Computational Advantage with 105-qubit Zuchongzhi 3.0 Processor
- Author
-
Gao, Dongxin, Fan, Daojin, Zha, Chen, Bei, Jiahao, Cai, Guoqing, Cai, Jianbin, Cao, Sirui, Zeng, Xiangdong, Chen, Fusheng, Chen, Jiang, Chen, Kefu, Chen, Xiawei, Chen, Xiqing, Chen, Zhe, Chen, Zhiyuan, Chen, Zihua, Chu, Wenhao, Deng, Hui, Deng, Zhibin, Ding, Pei, Ding, Xun, Ding, Zhuzhengqi, Dong, Shuai, Dong, Yupeng, Fan, Bo, Fu, Yuanhao, Gao, Song, Ge, Lei, Gong, Ming, Gui, Jiacheng, Guo, Cheng, Guo, Shaojun, Guo, Xiaoyang, He, Tan, Hong, Linyin, Hu, Yisen, Huang, He-Liang, Huo, Yong-Heng, Jiang, Tao, Jiang, Zuokai, Jin, Honghong, Leng, Yunxiang, Li, Dayu, Li, Dongdong, Li, Fangyu, Li, Jiaqi, Li, Jinjin, Li, Junyan, Li, Junyun, Li, Na, Li, Shaowei, Li, Wei, Li, Yuhuai, Li, Yuan, Liang, Futian, Liang, Xuelian, Liao, Nanxing, Lin, Jin, Lin, Weiping, Liu, Dailin, Liu, Hongxiu, Liu, Maliang, Liu, Xinyu, Liu, Xuemeng, Liu, Yancheng, Lou, Haoxin, Ma, Yuwei, Meng, Lingxin, Mou, Hao, Nan, Kailiang, Nie, Binghan, Nie, Meijuan, Ning, Jie, Niu, Le, Peng, Wenyi, Qian, Haoran, Rong, Hao, Rong, Tao, Shen, Huiyan, Shen, Qiong, Su, Hong, Su, Feifan, Sun, Chenyin, Sun, Liangchao, Sun, Tianzuo, Sun, Yingxiu, Tan, Yimeng, Tan, Jun, Tang, Longyue, Tu, Wenbing, Wan, Cai, Wang, Jiafei, Wang, Biao, Wang, Chang, Wang, Chen, Wang, Chu, Wang, Jian, Wang, Liangyuan, Wang, Rui, Wang, Shengtao, Wang, Xinzhe, Wei, Zuolin, Wei, Jiazhou, Wu, Dachao, Wu, Gang, Wu, Jin, Wu, Shengjie, Wu, Yulin, Xie, Shiyong, Xin, Lianjie, Xu, Yu, Xue, Chun, Yan, Kai, Yang, Weifeng, Yang, Xinpeng, Yang, Yang, Ye, Yangsen, Ye, Zhenping, Ying, Chong, Yu, Jiale, Yu, Qinjing, Yu, Wenhu, Zhan, Shaoyu, Zhang, Feifei, Zhang, Haibin, Zhang, Kaili, Zhang, Pan, Zhang, Wen, Zhang, Yiming, Zhang, Yongzhuo, Zhang, Lixiang, Zhao, Guming, Zhao, Peng, Zhao, Xianhe, Zhao, Xintao, Zhao, Youwei, Zhao, Zhong, Zheng, Luyuan, Zhou, Fei, Zhou, Liang, Zhou, Na, Zhou, Naibin, Zhou, Shifeng, Zhou, Shuang, Zhou, Zhengxiao, Zhu, Chengjun, Zhu, Qingling, Zou, Guihong, Zou, Haonan, Zhang, Qiang, Lu, Chao-Yang, Peng, Cheng-Zhi, Zhu, XiaoBo, and Pan, Jian-Wei
- Subjects
Quantum Physics - Abstract
In the relentless pursuit of quantum computational advantage, we present a significant advancement with the development of Zuchongzhi 3.0. This superconducting quantum computer prototype, comprising 105 qubits, achieves high operational fidelities, with single-qubit gates, two-qubit gates, and readout fidelity at 99.90%, 99.62% and 99.18%, respectively. Our experiments with an 83-qubit, 32-cycle random circuit sampling on Zuchongzhi 3.0 highlight its superior performance, achieving one million samples in just a few hundred seconds. This task is estimated to be infeasible on the most powerful classical supercomputers, Frontier, which would require approximately $6.4\times 10^9$ years to replicate the task. This leap in processing power places the classical simulation cost six orders of magnitude beyond Google's SYC-67 and SYC-70 experiments [Nature 634, 328(2024)], firmly establishing a new benchmark in quantum computational advantage. Our work not only advances the frontiers of quantum computing but also lays the groundwork for a new era where quantum processors play an essential role in tackling sophisticated real-world challenges.
- Published
- 2024
14. iKap: Kinematics-aware Planning with Imperative Learning
- Author
-
Li, Qihang, Chen, Zhuoqun, Zheng, Haoze, He, Haonan, Su, Shaoshu, Geng, Junyi, and Wang, Chen
- Subjects
Computer Science - Robotics - Abstract
Trajectory planning in robotics aims to generate collision-free pose sequences that can be reliably executed. Recently, vision-to-planning systems have garnered increasing attention for their efficiency and ability to interpret and adapt to surrounding environments. However, traditional modular systems suffer from increased latency and error propagation, while purely data-driven approaches often overlook the robot's kinematic constraints. This oversight leads to discrepancies between planned trajectories and those that are executable. To address these challenges, we propose iKap, a novel vision-to-planning system that integrates the robot's kinematic model directly into the learning pipeline. iKap employs a self-supervised learning approach and incorporates the state transition model within a differentiable bi-level optimization framework. This integration ensures the network learns collision-free waypoints while satisfying kinematic constraints, enabling gradient back-propagation for end-to-end training. Our experimental results demonstrate that iKap achieves higher success rates and reduced latency compared to the state-of-the-art methods. Besides the complete system, iKap offers a visual-to-planning network that seamlessly integrates kinematics into various controllers, providing a robust solution for robots navigating complex and dynamic environments., Comment: 6 pages, 6 figures
- Published
- 2024
15. CRAFTS for HI cosmology: I. data analysis and preliminary results
- Author
-
Yang, Wenxiu, Wolz, Laura, Li, Yichao, Hu, Wenkai, Cunnington, Steven, Grainge, Keith, Deng, Furen, Zuo, Shifan, Shu, Shuanghao, Zhao, Xinyang, Li, Di, Zheng, Zheng, Krčo, Marko, Zheng, Yinghui, Feng, Linjing, Zuo, Pei, Chen, Hao, Jiang, Xue-Jian, Wang, Chen, Wang, Pei, Miao, Chen-Chen, Wang, Yougang, and Chen, Xuelei
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
We present the results from calibrating the data of the Commensal Radio Astronomy FAST Survey (CRAFTS) for \HI intensity mapping by the Five-hundred-meter Aperture Spherical Radio Telescope (FAST). Using 70 hours of drift-scan observation with the L-band (1.05-1.45GHz) 19-beam receiver, we obtain the data covering $270\,\rm deg^2$ sky area. We employ both the pulsar backend and the spectrum backend to calibrate the spectral time-ordered-data (TOD) before projecting them onto HEALPix maps. We produce calibrated TOD with frequency resolution of 30 kHz and time resolution of 1 s and the map data-cube with frequency resolution of 30kHz and spatial resolution of $2.95\,\rm arcmin^2$. We carefully examine the pointing errors, noise overflow, RFI contamination and their effect on the data quality. The resulting noise level is $\sim$ 5.7 mJy for the calibrated TOD and 1.6 mJy for the map, which is consistent with the theoretical predictions within 5\% at RFI-free channels. We also validate the data by Principal Components Analysis (PCA) and find most foreground components are concentrated in the first 30 modes. We identify 447 isolated bright continuum sources in our data matching the NRAO-VLA Sky Survey (NVSS) catalog, with relative flux error of 8.3\% for TOD and 11.9\% for the map-level. We also measure the \HI emission of 90 galaxies with redshift $z<0.07$ and compare with \HI-MaNGA spectra from the Green Bank Telescope (GBT), yielding an overall relative error of the \HI integral flux of 16.7\%. Our results confirm the feasibility of conducting cosmological \HI signal detection with CRAFTS., Comment: 30 pages, 30 figures, and 3 tables
- Published
- 2024
16. An Efficient Scene Coordinate Encoding and Relocalization Method
- Author
-
Xu, Kuan, Jiang, Zeyu, Cao, Haozhi, Yuan, Shenghai, Wang, Chen, and Xie, Lihua
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Scene Coordinate Regression (SCR) is a visual localization technique that utilizes deep neural networks (DNN) to directly regress 2D-3D correspondences for camera pose estimation. However, current SCR methods often face challenges in handling repetitive textures and meaningless areas due to their reliance on implicit triangulation. In this paper, we propose an efficient scene coordinate encoding and relocalization method. Compared with the existing SCR methods, we design a unified architecture for both scene encoding and salient keypoint detection, enabling our system to focus on encoding informative regions, thereby significantly enhancing efficiency. Additionally, we introduce a mechanism that leverages sequential information during both map encoding and relocalization, which strengthens implicit triangulation, particularly in repetitive texture environments. Comprehensive experiments conducted across indoor and outdoor datasets demonstrate that the proposed system outperforms other state-of-the-art (SOTA) SCR methods. Our single-frame relocalization mode improves the recall rate of our baseline by 6.4% and increases the running speed from 56Hz to 90Hz. Furthermore, our sequence-based mode increases the recall rate by 11% while maintaining the original efficiency., Comment: 8 pages, 6 figures
- Published
- 2024
17. Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems
- Author
-
Li, Rui, Wang, Song, and Wang, Chen
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Mathematics - Numerical Analysis - Abstract
Preconditioning techniques are crucial for enhancing the efficiency of solving large-scale linear equation systems that arise from partial differential equation (PDE) discretization. These techniques, such as Incomplete Cholesky factorization (IC) and data-driven neural network methods, accelerate the convergence of iterative solvers like Conjugate Gradient (CG) by approximating the original matrices. This paper introduces a novel approach that integrates Graph Neural Network (GNN) with traditional IC, addressing the shortcomings of direct generation methods based on GNN and achieving significant improvements in computational efficiency and scalability. Experimental results demonstrate an average reduction in iteration counts by 24.8% compared to IC and a two-order-of-magnitude increase in training scale compared to previous methods. A three-dimensional static structural analysis utilizing finite element methods was validated on training sparse matrices of up to 5 million dimensions and inference scales of up to 10 million. Furthermore, the approach demon-strates robust generalization capabilities across scales, facilitating the effective acceleration of CG solvers for large-scale linear equations using small-scale data on modest hardware. The method's robustness and scalability make it a practical solution for computational science.
- Published
- 2024
18. Graph-Sequential Alignment and Uniformity: Toward Enhanced Recommendation Systems
- Author
-
Cao, Yuwei, Yang, Liangwei, Liu, Zhiwei, Liu, Yuqing, Wang, Chen, Liang, Yueqing, Peng, Hao, and Yu, Philip S.
- Subjects
Computer Science - Information Retrieval - Abstract
Graph-based and sequential methods are two popular recommendation paradigms, each excelling in its domain but lacking the ability to leverage signals from the other. To address this, we propose a novel method that integrates both approaches for enhanced performance. Our framework uses Graph Neural Network (GNN)-based and sequential recommenders as separate submodules while sharing a unified embedding space optimized jointly. To enable positive knowledge transfer, we design a loss function that enforces alignment and uniformity both within and across submodules. Experiments on three real-world datasets demonstrate that the proposed method significantly outperforms using either approach alone and achieves state-of-the-art results. Our implementations are publicly available at https://github.com/YuweiCao-UIC/GSAU.git., Comment: Under review
- Published
- 2024
19. Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
- Author
-
Zhou, Wei, Zhao, Lei, Zhang, Runyu, Cui, Yifan, Huang, Hongpu, Qie, Kun, and Wang, Chen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Traffic Surveillance Systems (TSS) have become increasingly crucial in modern intelligent transportation systems, with vision-based technologies playing a central role for scene perception and understanding. While existing surveys typically focus on isolated aspects of TSS, a comprehensive analysis bridging low-level and high-level perception tasks, particularly considering emerging technologies, remains lacking. This paper presents a systematic review of vision-based technologies in TSS, examining both low-level perception tasks (object detection, classification, and tracking) and high-level perception applications (parameter estimation, anomaly detection, and behavior understanding). Specifically, we first provide a detailed methodological categorization and comprehensive performance evaluation for each task. Our investigation reveals five fundamental limitations in current TSS: perceptual data degradation in complex scenarios, data-driven learning constraints, semantic understanding gaps, sensing coverage limitations and computational resource demands. To address these challenges, we systematically analyze five categories of potential solutions: advanced perception enhancement, efficient learning paradigms, knowledge-enhanced understanding, cooperative sensing frameworks and efficient computing frameworks. Furthermore, we evaluate the transformative potential of foundation models in TSS, demonstrating their unique capabilities in zero-shot learning, semantic understanding, and scene generation. This review provides a unified framework bridging low-level and high-level perception tasks, systematically analyzes current limitations and solutions, and presents a structured roadmap for integrating emerging technologies, particularly foundation models, to enhance TSS capabilities.
- Published
- 2024
20. TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension
- Author
-
Qiu, Zipeng, Peng, You, He, Guangxin, Yuan, Binhang, and Wang, Chen
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Information Retrieval - Abstract
The advent of large language models (LLMs) has unlocked great opportunities in complex data management tasks, particularly in question answering (QA) over complicated multi-table relational data. Despite significant progress, systematically evaluating LLMs on multi-table QA remains a critical challenge due to the inherent complexity of analyzing heterogeneous table structures and potential large scale of serialized relational data. Existing benchmarks primarily focus on single-table QA, failing to capture the intricacies of reasoning across multiple relational tables, as required in real-world domains such as finance, healthcare, and e-commerce. To address this gap, we present TQA-Bench, a new multi-table QA benchmark designed to evaluate the capabilities of LLMs in tackling complex QA tasks over relational data. Our benchmark incorporates diverse relational database instances sourced from real-world public datasets and introduces a flexible sampling mechanism to create tasks with varying multi-table context lengths, ranging from 8K to 64K tokens. To ensure robustness and reliability, we integrate symbolic extensions into the evaluation framework, enabling the assessment of LLM reasoning capabilities beyond simple data retrieval or probabilistic pattern matching. We systematically evaluate a range of LLMs, both open-source and closed-source, spanning model scales from 7 billion to 70 billion parameters. Our extensive experiments reveal critical insights into the performance of LLMs in multi-table QA, highlighting both challenges and opportunities for advancing their application in complex, data-driven environments. Our benchmark implementation and results are available at https://github.com/Relaxed-System-Lab/TQA-Bench.
- Published
- 2024
21. Zero-Indexing Internet Search Augmented Generation for Large Language Models
- Author
-
He, Guangxin, Dai, Zonghong, Zhu, Jiangcheng, Zhao, Binqiang, Hu, Qicheng, Li, Chenyue, Peng, You, Wang, Chen, and Yuan, Binhang
- Subjects
Computer Science - Information Retrieval - Abstract
Retrieval augmented generation has emerged as an effective method to enhance large language model performance. This approach typically relies on an internal retrieval module that uses various indexing mechanisms to manage a static pre-processed corpus. However, such a paradigm often falls short when it is necessary to integrate the most up-to-date information that has not been updated into the corpus during generative inference time. In this paper, we explore an alternative approach that leverages standard search engine APIs to dynamically integrate the latest online information (without maintaining any index for any fixed corpus), thereby improving the quality of generated content. We design a collaborative LLM-based paradigm, where we include: (i) a parser-LLM that determines if the Internet augmented generation is demanded and extracts the search keywords if so with a single inference; (ii) a mixed ranking strategy that re-ranks the retrieved HTML files to eliminate bias introduced from the search engine API; and (iii) an extractor-LLM that can accurately and efficiently extract relevant information from the fresh content in each HTML file. We conduct extensive empirical studies to evaluate the performance of this Internet search augmented generation paradigm. The experimental results demonstrate that our method generates content with significantly improved quality. Our system has been successfully deployed in a production environment to serve 01.AI's generative inference requests.
- Published
- 2024
22. PCDreamer: Point Cloud Completion Through Multi-view Diffusion Priors
- Author
-
Wei, Guangshun, Feng, Yuan, Ma, Long, Wang, Chen, Zhou, Yuanfeng, and Li, Changjian
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics - Abstract
This paper presents PCDreamer, a novel method for point cloud completion. Traditional methods typically extract features from partial point clouds to predict missing regions, but the large solution space often leads to unsatisfactory results. More recent approaches have started to use images as extra guidance, effectively improving performance, but obtaining paired data of images and partial point clouds is challenging in practice. To overcome these limitations, we harness the relatively view-consistent multi-view diffusion priors within large models, to generate novel views of the desired shape. The resulting image set encodes both global and local shape cues, which is especially beneficial for shape completion. To fully exploit the priors, we have designed a shape fusion module for producing an initial complete shape from multi-modality input (\ie, images and point clouds), and a follow-up shape consolidation module to obtain the final complete shape by discarding unreliable points introduced by the inconsistency from diffusion priors. Extensive experimental results demonstrate our superior performance, especially in recovering fine details.
- Published
- 2024
23. Searching radio signals from two magnetars and a high-magnetic field pulsar and the serendipitous discovery of a new radio pulsar PSR J1935+2200
- Author
-
Xie, Lang, Han, J. L., Yang, Z. L., Jing, W. C., Zhou, D. J., Su, W. Q., Yan, Yi, Wang, Tao, Cai, N. N., Wang, P. F., and Wang, Chen
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Magnetars are slowly rotating, highly magnetized young neutron stars that can show transient radio phenomena for radio pulses and fast radio bursts. We conducted radio observations of from two magnetars SGR J1935+2154 and 3XMM J185246.6+003317 and a high-magnetic field pulsar PSR J1846$-$0258 using the Five-hundred-meter Aperture Spherical radio Telescope (FAST). We performed single pulse and periodicity searches and did not detect radio signals from them. From the piggyback data recorded by other FAST telescope beams when we observed the magnetar SGR 1935+2154, we serendipitously discovered a new radio pulsar, PSR J1935+2200. We carried out the follow-up observations and obtained the timing solution based on these new observations and the archive FAST data. PSR J1935+2200 is an isolated old pulsar, with a spin period of $0.91$s, a spin-period derivative of $9.19 \times 10^{-15}$~s~s$^{-1}$, and a characteristic age of $1.57$ Myr. It is a weak pulsar with a flux density of 9.8 $\mu$Jy at 1.25 GHz. Discovery of a new pulsar from the long FAST observations of 30 minutes implies that there may be more weak older pulsars in the Galactic disk to be discovered., Comment: 6 pages, 3 figures and 3 tables. Submitted to RAA
- Published
- 2024
24. A Layered Architecture for Developing and Enhancing Capabilities in Large Language Model-based Software Systems
- Author
-
Zhang, Dawen, Xu, Xiwei, Wang, Chen, Xing, Zhenchang, and Mao, Robert
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Multiagent Systems - Abstract
Significant efforts has been made to expand the use of Large Language Models (LLMs) beyond basic language tasks. While the generalizability and versatility of LLMs have enabled widespread adoption, evolving demands in application development often exceed their native capabilities. Meeting these demands may involve a diverse set of methods, such as enhancing creativity through either inference temperature adjustments or creativity-provoking prompts. Selecting the right approach is critical, as different methods lead to trade-offs in engineering complexity, scalability, and operational costs. This paper introduces a layered architecture that organizes LLM software system development into distinct layers, each characterized by specific attributes. By aligning capabilities with these layers, the framework encourages the systematic implementation of capabilities in effective and efficient ways that ultimately supports desired functionalities and qualities. Through practical case studies, we illustrate the utility of the framework. This work offers developers actionable insights for selecting suitable technologies in LLM-based software system development, promoting robustness and scalability.
- Published
- 2024
25. Membership Inference Attack against Long-Context Large Language Models
- Author
-
Wang, Zixiong, Liu, Gaoyang, Yang, Yang, and Wang, Chen
- Subjects
Computer Science - Computation and Language - Abstract
Recent advances in Large Language Models (LLMs) have enabled them to overcome their context window limitations, and demonstrate exceptional retrieval and reasoning capacities on longer context. Quesion-answering systems augmented with Long-Context Language Models (LCLMs) can automatically search massive external data and incorporate it into their contexts, enabling faithful predictions and reducing issues such as hallucinations and knowledge staleness. Existing studies targeting LCLMs mainly concentrate on addressing the so-called lost-in-the-middle problem or improving the inference effiencicy, leaving their privacy risks largely unexplored. In this paper, we aim to bridge this gap and argue that integrating all information into the long context makes it a repository of sensitive information, which often contains private data such as medical records or personal identities. We further investigate the membership privacy within LCLMs external context, with the aim of determining whether a given document or sequence is included in the LCLMs context. Our basic idea is that if a document lies in the context, it will exhibit a low generation loss or a high degree of semantic similarity to the contents generated by LCLMs. We for the first time propose six membership inference attack (MIA) strategies tailored for LCLMs and conduct extensive experiments on various popular models. Empirical results demonstrate that our attacks can accurately infer membership status in most cases, e.g., 90.66% attack F1-score on Multi-document QA datasets with LongChat-7b-v1.5-32k, highlighting significant risks of membership leakage within LCLMs input contexts. Furthermore, we examine the underlying reasons why LCLMs are susceptible to revealing such membership information.
- Published
- 2024
26. V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
- Author
-
Yang, Lei, Zhang, Xinyu, Li, Jun, Wang, Chen, Song, Zhiying, Zhao, Tong, Song, Ziying, Wang, Li, Zhou, Mo, Shen, Yang, Wu, Kai, and Lv, Chen
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Modern autonomous vehicle perception systems often struggle with occlusions and limited perception range. Previous studies have demonstrated the effectiveness of cooperative perception in extending the perception range and overcoming occlusions, thereby improving the safety of autonomous driving. In recent years, a series of cooperative perception datasets have emerged. However, these datasets only focus on camera and LiDAR, overlooking 4D Radar, a sensor employed in single-vehicle autonomous driving for robust perception in adverse weather conditions. In this paper, to bridge the gap of missing 4D Radar datasets in cooperative perception, we present V2X-Radar, the first large real-world multi-modal dataset featuring 4D Radar. Our V2X-Radar dataset is collected using a connected vehicle platform and an intelligent roadside unit equipped with 4D Radar, LiDAR, and multi-view cameras. The collected data includes sunny and rainy weather conditions, spanning daytime, dusk, and nighttime, as well as typical challenging scenarios. The dataset comprises 20K LiDAR frames, 40K camera images, and 20K 4D Radar data, with 350K annotated bounding boxes across five categories. To facilitate diverse research domains, we establish V2X-Radar-C for cooperative perception, V2X-Radar-I for roadside perception, and V2X-Radar-V for single-vehicle perception. We further provide comprehensive benchmarks of recent perception algorithms on the above three sub-datasets. The dataset and benchmark codebase will be available at \url{http://openmpd.com/column/V2X-Radar}., Comment: 11 pages, 5 figures
- Published
- 2024
27. Fully Dynamic Adversarially Robust Correlation Clustering in Polylogarithmic Update Time
- Author
-
Braverman, Vladimir, Dharangutte, Prathamesh, Pai, Shreyas, Shah, Vihan, and Wang, Chen
- Subjects
Computer Science - Data Structures and Algorithms ,Computer Science - Machine Learning - Abstract
We study the dynamic correlation clustering problem with $\textit{adaptive}$ edge label flips. In correlation clustering, we are given a $n$-vertex complete graph whose edges are labeled either $(+)$ or $(-)$, and the goal is to minimize the total number of $(+)$ edges between clusters and the number of $(-)$ edges within clusters. We consider the dynamic setting with adversarial robustness, in which the $\textit{adaptive}$ adversary could flip the label of an edge based on the current output of the algorithm. Our main result is a randomized algorithm that always maintains an $O(1)$-approximation to the optimal correlation clustering with $O(\log^{2}{n})$ amortized update time. Prior to our work, no algorithm with $O(1)$-approximation and $\text{polylog}{(n)}$ update time for the adversarially robust setting was known. We further validate our theoretical results with experiments on synthetic and real-world datasets with competitive empirical performances. Our main technical ingredient is an algorithm that maintains $\textit{sparse-dense decomposition}$ with $\text{polylog}{(n)}$ update time, which could be of independent interest.
- Published
- 2024
28. Non-Hermitian Effects in Dicke models
- Author
-
Jiang, Bin, Li, Yi-Yang, Liu, Junjie, Wang, Chen, and Jiang, Jian-Hua
- Subjects
Quantum Physics ,Physics - Optics - Abstract
The Dicke model, which describes the collective interaction between an ensemble of atoms and a single-mode photon field, serves as a fundamental framework for studying light-matter interactions and quantum electrodynamic phenomena. In this work, we investigate the manifestation of non-Hermitian effects in a generalized Dicke model, where two dissipative atom ensembles interact with a single-mode photon field. By applying the Holstein-Primakoff transformation, we explore the system in the semiclassical limit as a non-Hermitian Dicke model, revealing rich exceptional points (EPs) and diabolic points in such a system. We find that, by introducing the nonlinear saturation gain into an atomic ensemble, higher-order EP can be induced, leading to intriguing properties. Furthermore, if the system is extended to a one-dimensional chain, then the band topology will interplay with the non-Hermitian effect. In the quantum regime, we explore the quantum signature of EPs, noting that the conditions for their emergence are influenced by discrete photon numbers. We further study the transition from photon anti-bunching to bunching at a steady state, driven by non-Hermitian dynamics. Our findings deepen the understanding of non-Hermitian physics in light-matter interaction which is instructive for the design of advanced photonic and quantum systems.
- Published
- 2024
29. Object-Centric Dexterous Manipulation from Human Motion Data
- Author
-
Chen, Yuanpei, Wang, Chen, Yang, Yaodong, and Liu, C. Karen
- Subjects
Computer Science - Robotics - Abstract
Manipulating objects to achieve desired goal states is a basic but important skill for dexterous manipulation. Human hand motions demonstrate proficient manipulation capability, providing valuable data for training robots with multi-finger hands. Despite this potential, substantial challenges arise due to the embodiment gap between human and robot hands. In this work, we introduce a hierarchical policy learning framework that uses human hand motion data for training object-centric dexterous robot manipulation. At the core of our method is a high-level trajectory generative model, learned with a large-scale human hand motion capture dataset, to synthesize human-like wrist motions conditioned on the desired object goal states. Guided by the generated wrist motions, deep reinforcement learning is further used to train a low-level finger controller that is grounded in the robot's embodiment to physically interact with the object to achieve the goal. Through extensive evaluation across 10 household objects, our approach not only demonstrates superior performance but also showcases generalization capability to novel object geometries and goal states. Furthermore, we transfer the learned policies from simulation to a real-world bimanual dexterous robot system, further demonstrating its applicability in real-world scenarios. Project website: https://cypypccpy.github.io/obj-dex.github.io/., Comment: 20 pages, 7 figures
- Published
- 2024
30. Revisit Many-body Interaction Heat Current and Thermal Conductivity Calculation in Moment Tensor Potential/LAMMPS Interface
- Author
-
Tai, Siu Ting, Wang, Chen, Cheng, Ruihuan, and Chen, Yue
- Subjects
Condensed Matter - Materials Science - Abstract
The definition of heat current operator for systems for non-pairwise additive interactions and its impact on related lattice thermal conductivity ($\kappa_{L}$) via molecular dynamics simulation (MD) are ambiguous and controversial when migrating from conventional empirical potential models to machine learning potential (MLP) models. Empirical model descriptions are often limited to three- to four-body interaction while a sophisticated representation of the many-body physics could be resembled in MLPs. Herein, we study and compare the significance of many-body interaction to the heat current computation in one of the most popular MLP models, the Moment Tensor Potential (MTP). Non-equilibrium MD simulations and equilibrium MD simulations among four different materials, $PbTe$, amorphous $Sc_{0.2}Sb_{2}Te_{3}$, graphene, and $BAs$, were performed. We found inconsistency between the simulation thermostat and its implemented heat current operator in our non-equilibrium MD results which violate law of energy conservation and suggest a need for revision. We revisit the virial stress tensor expression within the calculator and identified the lack of a generalised many-body heat current description in it. We uncover the influence of the modified heat current formula that could alter the $\kappa_{L}$ results 29% to 64% using the equilibrium MD computational approach. Our work demonstrates the importance of a many-body description during thermal analysis in MD simulations when MLPs are in concern. This work sheds light on a better understanding of the relationship between interatomic interaction and its heat transport mechanism., Comment: 9 pages, 9 figures
- Published
- 2024
31. LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation
- Author
-
Li, Bowen, Li, Zhaoyu, Du, Qiwei, Luo, Jinqi, Wang, Wenshan, Xie, Yaqi, Stepputtis, Simon, Wang, Chen, Sycara, Katia P., Ravikumar, Pradeep Kumar, Gray, Alexander G., Si, Xujie, and Scherer, Sebastian
- Subjects
Computer Science - Artificial Intelligence - Abstract
Recent years have witnessed the rapid development of Neuro-Symbolic (NeSy) AI systems, which integrate symbolic reasoning into deep neural networks. However, most of the existing benchmarks for NeSy AI fail to provide long-horizon reasoning tasks with complex multi-agent interactions. Furthermore, they are usually constrained by fixed and simplistic logical rules over limited entities, making them far from real-world complexities. To address these crucial gaps, we introduce LogiCity, the first simulator based on customizable first-order logic (FOL) for an urban-like environment with multiple dynamic agents. LogiCity models diverse urban elements using semantic and spatial concepts, such as IsAmbulance(X) and IsClose(X, Y). These concepts are used to define FOL rules that govern the behavior of various agents. Since the concepts and rules are abstractions, they can be universally applied to cities with any agent compositions, facilitating the instantiation of diverse scenarios. Besides, a key feature of LogiCity is its support for user-configurable abstractions, enabling customizable simulation complexities for logical reasoning. To explore various aspects of NeSy AI, LogiCity introduces two tasks, one features long-horizon sequential decision-making, and the other focuses on one-step visual reasoning, varying in difficulty and agent behaviors. Our extensive evaluation reveals the advantage of NeSy frameworks in abstract reasoning. Moreover, we highlight the significant challenges of handling more complex abstractions in long-horizon multi-agent scenarios or under high-dimensional, imbalanced data. With its flexible design, various features, and newly raised challenges, we believe LogiCity represents a pivotal step forward in advancing the next generation of NeSy AI. All the code and data are open-sourced at our website., Comment: 25 pages, 8 figures
- Published
- 2024
32. Bridging the Gap: GRB 230812B -- A Three-Second Supernova-Associated Burst Detected by the GRID Mission
- Author
-
Wang, Chen-Yu, Yin, Yi-Han Iris, Zhang, Bin-Bin, Feng, Hua, Zeng, Ming, Xiong, Shao-Lin, Pan, Xiao-Fan, Yang, Jun, Zhang, Yan-Qiu, Li, Chen, Yan, Zhen-Yu, Wang, Chen-Wei, Zheng, Xu-Tao, Liu, Jia-Cong, Wang, Qi-Dong, Yang, Zi-Rui, Li, Long-Hao, Liu, Qi-Ze, Zhao, Zheng-Yang, Hu, Bo, Liu, Yi-Qi, Lu, Si-Yuan, Luo, Zi-You, Cang, Ji-Rong, Cao, De-Zhi, Han, Wen-Tao, Jia, Li-Ping, Pan, Xing-Yu, Tian, Yang, Xu, Ben-Da, Yang, Xiao, and Zeng, Zhi
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
GRB 230812B, detected by the Gamma-Ray Integrated Detectors (GRID) constellation mission, is an exceptionally bright gamma-ray burst (GRB) with a duration of only 3 seconds. Sitting near the traditional boundary ($\sim$ 2 s) between long and short GRBs, GRB 230812B is notably associated with a supernova (SN), indicating a massive star progenitor. This makes it a rare example of a short-duration GRB resulting from stellar collapse. Our analysis, using a time-evolving synchrotron model, suggests that the burst has an emission radius of approximately $10^{14.5}$~cm. We propose that the short duration of GRB 230812B is due to the combined effects of the central engine's activity time and the time required for the jet to break through the stellar envelope. Our findings provide another case that challenges the conventional view that short-duration GRBs originate exclusively from compact object mergers, demonstrating that a broader range of durations exists for GRBs arising from the collapse of massive stars., Comment: 10 pages, 3 tables, 11 figures
- Published
- 2024
33. Advancing Gamma-Ray Burst Identification through Transfer Learning with Convolutional Neural Networks
- Author
-
Zhang, Peng, Li, Bing, Gui, Ren-zhou, Xiong, Shao-lin, Wang, Yu, Zhang, Yan-qiu, Wang, Chen-wei, Liu, Jia-cong, Xue, Wang-chen, Zheng, Chao, Yu, Zheng-hang, and Zhang, Wen-long
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Instrumentation and Methods for Astrophysics - Abstract
The Rapid and accurate identification of Gamma-Ray Bursts (GRBs) is crucial for unraveling their origins. However, current burst search algorithms frequently miss low-threshold signals or lack universality for observations. In this study, we propose a novel approach utilizing transfer learning experiment based on convolutional neural network (CNN) to establish a universal GRB identification method, which validated successfully using GECAM-B data. By employing data augmentation techniques, we enhance the diversity and quantity of the GRB sample. We develop a 1D CNN model with a multi-scale feature cross fusion module (MSCFM) to extract features from samples and perform classification. The comparative results demonstrated significant performance improvements following pre-training and transferring on a large-scale dataset. Our optimal model achieved an impressive accuracy of 96.41% on the source dataset of GECAM-B, and identified three previously undiscovered GRBs by contrast with manual analysis of GECAM-B observations. These innovative transfer learning and data augmentation methods presented in this work hold promise for applications in multi-satellite exploration scenarios characterized by limited data sets and a scarcity of labeled samples in high-energy astronomy., Comment: 17 pages, 7 figures
- Published
- 2024
34. Norfloxacin Sub-Inhibitory Concentration Affects Streptococcus suis Biofilm Formation and Virulence Gene Expression
- Author
-
Liu, Baobao, Yi, Li., Li, Jinpeng, Gong, Shenglong, Dong, Xiao, Wang, Chen, and Wang, Yang
- Published
- 2020
- Full Text
- View/download PDF
35. Biological validation of peak-width of skeletonized mean diffusivity as a VCID biomarker: The MarkVCID Consortium.
- Author
-
Luckey, Alison, Ghosh, Saptaparni, Wang, Chen-Pin, Beiser, Alexa, Bernal, Rebecca, Li, Zhiguang, Mbangdadji, Djass, Fadaee, Elyas, Snoussi, Haykel, Dediós, Angel, Trevino, Hector, Goss, Monica, Hillmer, Laura, Bauer, Christopher, Staffaroni, Adam, Stables, Lara, Albert, Marilyn, Himali, Jayandra, Mosley, Thomas, Forsberg, Lars, Guðnason, Vilmundur, Singh, Baljeet, Singh, Herpreet, Schwab, Kristin, Kramer, Joel, Rosenberg, Gary, Helmer, Karl, Greenberg, Steven, Habes, Mohamad, Wang, Danny, Gold, Brian, Lu, Hanzhang, Caprihan, Arvind, Fornage, Myriam, Launer, Lenore, Arfanakis, Konstantinos, Seshadri, Sudha, Decarli, Charles, Maillard, Pauline, and Satizabal, Claudia
- Subjects
biomarker ,cognitive impairment ,diffusion tensor imaging ,peak‐width of skeletonized mean diffusivity ,small vessel disease ,vascular contributions to cognitive impairment and dementia ,Humans ,Female ,Male ,Aged ,Diffusion Tensor Imaging ,Biomarkers ,Cognitive Dysfunction ,Cerebral Small Vessel Diseases ,Dementia ,Vascular ,Middle Aged ,Brain ,Neuroimaging ,Aged ,80 and over - Abstract
BACKGROUND: Peak-width of skeletonized mean diffusivity (PSMD), a neuroimaging marker of cerebral small vessel disease (SVD), has shown excellent instrumental properties. Here, we extend our work to perform a biological validation of PSMD. METHODS: We included 396 participants from the Biomarkers for Vascular Contributions to Cognitive Impairment and Dementia (MarkVCID-1) Consortium and three replication samples (Cohorts for Heart and Aging Research in Genomic Epidemiology = 6172, Rush University Medical Center = 287, University of California Davis Alzheimers Disease Research Center = 567). PSMD was derived from diffusion tensor imaging using an automated algorithm. We related PSMD to a composite measure of general cognitive function using linear regression models adjusting for confounders. RESULTS: Higher PSMD was associated with lower general cognition in MarkVCID-1 independent of age, sex, education, and intracranial volume (Beta [95% confidence interval], -0.8 [-1.2, -0.4], P
- Published
- 2024
36. Handling delayed or missed direct oral anticoagulant doses: model-informed individual remedial dosing.
- Author
-
Liu, Xiao-Qin, Li, Ziran, Wang, Chen-Yu, and Jiao, Zheng
- Subjects
Humans ,Administration ,Oral ,Anticoagulants ,Models ,Biological - Abstract
Nonadherence to direct oral anticoagulant (DOAC) pharmacotherapy may increase the risk of thromboembolism or bleeding, and delayed or missed doses are the most common types of nonadherence. Current recommendations from regulatory agencies or guidelines regarding this issue lack evidence and fail to consider individual differences. This study aimed to develop individual remedial dosing strategies when the dose was delayed or missed for DOACs, including rivaroxaban, apixaban, edoxaban, and dabigatran etexilate. Remedial dosing regimens based on population pharmacokinetic (PK)-pharmacodynamic (PD) modeling and simulation strategies were developed to expeditiously restore drug concentration or PD biomarkers within the therapeutic range. Population PK-PD characteristics of DOACs were retrieved from previously published literature. The effects of factors that influence PK and PD parameters were assessed for their impact on remedial dosing regimens. A web-based dashboard was established with R-shiny to recommend remedial dosing regimens based on patient traits, dosing schedules, and delay duration. Addressing delayed or missed doses relies on the delay time and specific DOACs involved. Additionally, age, body weight, renal function, and polypharmacy may marginally affect remedial strategies. The proposed remedial dosing strategies surpass current recommendations, with less deviation time beyond the therapeutic range. The online dashboard offers quick and convenient solutions for addressing missed or delayed DOACs, enabling individualized remedial dosing strategies based on patient characteristics to mitigate the risks of bleeding and thrombosis.
- Published
- 2024
37. The human milk oligosaccharide 3′sialyllactose reduces low-grade inflammation and atherosclerosis development in mice
- Author
-
Pessentheiner, Ariane R, Spann, Nathanael J, Autran, Chloe A, Oh, Tae Gyu, Grunddal, Kaare V, Coker, Joanna KC, Painter, Chelsea D, Ramms, Bastian, Chiang, Austin WT, Wang, Chen-Yi, Hsiao, Jason, Wang, Yiwen, Quach, Anthony, Booshehri, Laela M, Hammond, Alexandra, Tognaccini, Chiara, Latasiewicz, Joanna, Willemsen, Lisa, Zengler, Karsten, de Winther, Menno PJ, Hoffman, Hal M, Philpott, Martin, Cribbs, Adam P, Oppermann, Udo, Lewis, Nathan E, Witztum, Joseph L, Yu, Ruth, Atkins, Annette R, Downes, Michael, Evans, Ron M, Glass, Christopher K, Bode, Lars, and Gordts, Philip LSM
- Subjects
Biomedical and Clinical Sciences ,Genetics ,Cardiovascular ,Nutrition ,Atherosclerosis ,2.1 Biological and endogenous factors ,1.1 Normal biological development and functioning ,Good Health and Well Being ,Animals ,Milk ,Human ,Mice ,Oligosaccharides ,Humans ,Inflammation ,Macrophages ,Disease Models ,Animal ,Female ,Toll-Like Receptor 4 ,Mice ,Inbred C57BL ,Male ,Cell biology ,Epigenetics ,Biomedical and clinical sciences ,Health sciences - Abstract
Macrophages contribute to the induction and resolution of inflammation and play a central role in chronic low-grade inflammation in cardiovascular diseases caused by atherosclerosis. Human milk oligosaccharides (HMOs) are complex unconjugated glycans unique to human milk that benefit infant health and act as innate immune modulators. Here, we identify the HMO 3'sialyllactose (3'SL) as a natural inhibitor of TLR4-induced low-grade inflammation in macrophages and endothelium. Transcriptome analysis in macrophages revealed that 3'SL attenuates mRNA levels of a selected set of inflammatory genes and promotes the activity of liver X receptor (LXR) and sterol regulatory element binding protein-1 (SREBP1). These acute antiinflammatory effects of 3'SL were associated with reduced histone H3K27 acetylation at a subset of LPS-inducible enhancers distinguished by preferential enrichment for CCCTC-binding factor (CTCF), IFN regulatory factor 2 (IRF2), B cell lymphoma 6 (BCL6), and other transcription factor recognition motifs. In a murine atherosclerosis model, both s.c. and oral administration of 3'SL significantly reduced atherosclerosis development and the associated inflammation. This study provides evidence that 3'SL attenuates inflammation by a transcriptional mechanism to reduce atherosclerosis development in the context of cardiovascular disease.
- Published
- 2024
38. EgoMimic: Scaling Imitation Learning via Egocentric Video
- Author
-
Kareer, Simar, Patel, Dhruv, Punamiya, Ryan, Mathur, Pranay, Cheng, Shuo, Wang, Chen, Hoffman, Judy, and Xu, Danfei
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
The scale and diversity of demonstration data required for imitation learning is a significant challenge. We present EgoMimic, a full-stack framework which scales manipulation via human embodiment data, specifically egocentric human videos paired with 3D hand tracking. EgoMimic achieves this through: (1) a system to capture human embodiment data using the ergonomic Project Aria glasses, (2) a low-cost bimanual manipulator that minimizes the kinematic gap to human data, (3) cross-domain data alignment techniques, and (4) an imitation learning architecture that co-trains on human and robot data. Compared to prior works that only extract high-level intent from human videos, our approach treats human and robot data equally as embodied demonstration data and learns a unified policy from both data sources. EgoMimic achieves significant improvement on a diverse set of long-horizon, single-arm and bimanual manipulation tasks over state-of-the-art imitation learning methods and enables generalization to entirely new scenes. Finally, we show a favorable scaling trend for EgoMimic, where adding 1 hour of additional hand data is significantly more valuable than 1 hour of additional robot data. Videos and additional information can be found at https://egomimic.github.io/
- Published
- 2024
39. Observation of Anderson localization transitions in a two-dimensional conjugated metal-organic framework
- Author
-
Cheng, Jinhao, Wang, Chen, He, Wenxue, Wang, Jiaojiao, Pang, Yifan, Yang, Fan, Ding, Shuaishuai, Ren, Hechen, and Hu, Wenping
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Materials Science - Abstract
Anderson localization transitions are a universal quantum phenomenon sensitive to the disorder and dimensionality of electronic systems. Over the past decades, this intriguing topic has inspired overwhelmingly more theoretical studies than experimental verifications due to the difficulty of controlling a material's disorder or dimensionality without modifying its fundamental electronic properties. Organic crystals with their rich disorders would be terrific playgrounds to investigate such disorder-driven phase transitions except for their low conductivities which usually prohibit low-temperature measurements. Here, we conduct systematic transport experiments in mesoscopic devices made with copper benzenehexathiol thin films across a wide range of thicknesses. We find metal-insulator transitions both among three-dimensional samples with different disorder strengths and between three-dimensional and quasi-two-dimensional samples. Temperature-dependence analysis of the conductivities corroborates the dimensionality crossover. Moreover, our theoretical modeling provides a basis for understanding both types of metal-insulator transitions within the framework of Anderson localization transitions. Our findings establish for the first time that organic crystals such as conductive metal-organic frameworks can exhibit such quantum interference effects. With organic materials' versatile chemical designs and crystalline structures, our work opens new avenues to search for novel quantum phenomena in organic material platforms.
- Published
- 2024
40. Revisiting the Velocity Dispersion-Size Relation in Molecular Cloud Structures
- Author
-
Feng, Haoran, Chen, Zhiwei, Jiang, Zhibo, Ma, Yuehui, Yang, Yang, Yu, Shuling, Ge, Dongqing, Zhou, Wei, Du, Fujun, Wang, Chen, Zhang, Shiyu, Su, Yang, and Yang, Ji
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Structures in molecular ISM are observed to follow a power-law relation between the velocity dispersion and spatial size, known as Larson's first relation, which is often attributed to the turbulent nature of molecular ISM and imprints the dynamics of molecular cloud structures. Using the ${}^{13}\mathrm{CO}~(J=1-0)$ data from the Milky Way Imaging Scroll Painting survey, we built a sample with 360 structures having relatively accurate distances obtained from either the reddened background stars with Gaia parallaxes or associated maser parallaxes, spanning from $0.4$ to $\sim 15~\mathrm{kpc}$. Using this sample and about 0.3 million pixels, we analyzed the correlations between velocity dispersion, surface/column density, and spatial scales. Our structure-wise results show power-law indices smaller than 0.5 in both the $\sigma_v$-$R_{\mathrm{eff}}$ and $\sigma_v$-$R_{\mathrm{eff}} \cdot \Sigma$ relations. In the pixel-wise results, the $\sigma_v^{\mathrm{pix}}$ is statistically scaling with the beam physical size ($R_{\mathrm{s}} \equiv \Theta D/2$) in form of $\sigma_v^{\mathrm{pix}} \propto R_{\mathrm{s}}^{0.43 \pm 0.03}$. Meanwhile, $\sigma_v^{\mathrm{pix}}$ in the inner Galaxy is statistically larger than the outer side. We also analyzed correlations between $\sigma_v^{\mathrm{pix}}$ and the $\mathrm{H_2}$ column density $N(\mathrm{H_2})$, finding that $\sigma_v^{\mathrm{pix}}$ stops increasing with $N(\mathrm{H_2})$ after $\gtrsim 10^{22}~{\mathrm{cm^{-2}}}$. The structures with and without high-column-density ($> 10^{22}~\mathrm{cm^{-2}}$) pixels show different $\sigma_v^{\mathrm{pix}} \propto N(\mathrm{H_2})^{\xi}$ relations, where the mean (std) $\xi$ values are $0.38~(0.14)$ and $0.62~(0.27)$, respectively., Comment: 23 pages, 12 figures, accepted for publication in Research in Astronomy and Astrophysics
- Published
- 2024
41. Solving Prior Distribution Mismatch in Diffusion Models via Optimal Transport
- Author
-
Wang, Zhanpeng, Li, Shenghao, Wang, Chen, Cao, Shuting, Lei, Na, and Luo, Zhongxuan
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In recent years, the knowledge surrounding diffusion models(DMs) has grown significantly, though several theoretical gaps remain. Particularly noteworthy is prior error, defined as the discrepancy between the termination distribution of the forward process and the initial distribution of the reverse process. To address these deficiencies, this paper explores the deeper relationship between optimal transport(OT) theory and DMs with discrete initial distribution. Specifically, we demonstrate that the two stages of DMs fundamentally involve computing time-dependent OT. However, unavoidable prior error result in deviation during the reverse process under quadratic transport cost. By proving that as the diffusion termination time increases, the probability flow exponentially converges to the gradient of the solution to the classical Monge-Amp\`ere equation, we establish a vital link between these fields. Therefore, static OT emerges as the most intrinsic single-step method for bridging this theoretical potential gap. Additionally, we apply these insights to accelerate sampling in both unconditional and conditional generation scenarios. Experimental results across multiple image datasets validate the effectiveness of our approach.
- Published
- 2024
42. Do LLMs Have the Generalization Ability in Conducting Causal Inference?
- Author
-
Wang, Chen, Zhao, Dongming, Wang, Bo, He, Ruifang, and Hou, Yuexian
- Subjects
Computer Science - Computation and Language - Abstract
In causal inference, generalization capability refers to the ability to conduct causal inference methods on new data to estimate the causal-effect between unknown phenomenon, which is crucial for expanding the boundaries of knowledge. Studies have evaluated the causal inference capabilities of Large Language Models (LLMs) concerning known phenomena, yet the generalization capabilities of LLMs concerning unseen phenomena remain unexplored. In this paper, we selected four tasks: Causal Path Discovery (CP), Backdoor Adjustment (BA), Factual Inference (FI), and Counterfactual Inference (CI) as representatives of causal inference tasks. To generate evaluation questions about previously unseen phenomena in new data on the four tasks, we propose a benchmark generation framework, which employs randomly generated graphs and node names to formulate questions within hypothetical new causal scenarios. Based on this framework, we compile a benchmark dataset of varying levels of question complexity. We extensively tested the generalization capabilities of five leading LLMs across four tasks. Experiment results reveal that while LLMs exhibit good generalization performance in solving simple CP, FI, and complex CI questions, they encounter difficulties when tackling BA questions and face obvious performance fluctuations as the problem complexity changes. Furthermore, when the names of phenomena incorporate existing terms, even if these names are entirely novel, their generalization performance can still be hindered by interference from familiar terms.
- Published
- 2024
43. Blue straggler stars
- Author
-
Wang, Chen and Ryu, Taeho
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Blue straggler stars are unique main-sequence stars that appear more luminous, hotter, and therefore younger, than their coeval counterparts. In star clusters, these stars are located above the cluster turn-off in the Hertzsprung-Russell diagram or color-magnitude diagram. First identified in the 1950s, these stars are found across diverse environments, from sparse galactic fields to dense star clusters. They are crucial for understanding stellar and binary evolution and star cluster dynamics. Despite extensive research, many challenges concerning their properties and origin mechanisms remain unresolved. This chapter delves into the properties and origins of blue stragglers, examining how theoretical tools are employed to study them and the implications of each proposed formation mechanism. We assess how contemporary observational data either support or challenge these theoretical predictions. Continued theoretical and observational efforts are essential for advancing our understanding of these enigmatic stars., Comment: This is a pre-print of a chapter for the Encyclopedia of Astrophysics (edited by I. Mandel, section editor F.R.N. Schneider) to be published by Elsevier as a Reference Module
- Published
- 2024
44. Stripped helium-star and compact object binaries in coeval populations -- predictions based on detailed binary evolution models
- Author
-
Wang, Chen, Bodensteiner, Julia, Xu, Xiao-Tian, de Mink, Selma E., Langer, Norbert, Laplace, Eva, Vigna-Gómez, Alejandro, Justham, Stephen, Klencki, Jakub, Olejak, Aleksandra, Valli, Ruggero, and Schootemeijer, Abel
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Massive stars mainly form in close binaries, where their mutual interactions can profoundly alter their evolutionary paths. Evolved binaries consisting of a massive OB-type main-sequence star with a stripped helium star or a compact companion represent a crucial stage in the evolution towards double compact objects, whose mergers are (potentially) detectable via gravitational waves. The recent detection of X-ray quiet OB+black hole binaries and OB+stripped helium star binaries has set the stage for discovering more of these systems in the near future. In this work, based on 3670 detailed binary-evolution models and using empirical distributions of initial binary parameters, we compute the expected population of such evolved massive binaries in coeval stellar populations, including stars in star clusters and in galaxies with starburst activities, for ages up to 100 Myr. Our results are vividly illustrated in an animation that shows the evolution of these binaries in the color-magnitude diagram over time. We find that the number of OB+black hole binaries peaks around 10 Myr, and OB+neutron star binaries are most abundant at approximately 20 Myr. Both black holes and neutron stars can potentially be found in populations with ages up to 90 Myr. Additionally, we analyze the properties of such binaries at specific ages. We find that OB+helium stars and OB+black hole binaries are likely to be identifiable as single-lined spectroscopic binaries. Our research serves as a guide for future observational efforts to discover such binaries in young star clusters and starburst environments., Comment: 32 pages, 15 figures. Accepted for Publication in Astrophysical Journal Letters (ApJL)
- Published
- 2024
45. Language Imbalance Driven Rewarding for Multilingual Self-improving
- Author
-
Yang, Wen, Wu, Junhong, Wang, Chen, Zong, Chengqing, and Zhang, Jiajun
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large Language Models (LLMs) have achieved state-of-the-art performance across numerous tasks. However, these advancements have predominantly benefited "first-class" languages such as English and Chinese, leaving many other languages underrepresented. This imbalance, while limiting broader applications, generates a natural preference ranking between languages, offering an opportunity to bootstrap the multilingual capabilities of LLM in a self-improving manner. Thus, we propose $\textit{Language Imbalance Driven Rewarding}$, where the inherent imbalance between dominant and non-dominant languages within LLMs is leveraged as a reward signal. Iterative DPO training demonstrates that this approach not only enhances LLM performance in non-dominant languages but also improves the dominant language's capacity, thereby yielding an iterative reward signal. Fine-tuning Meta-Llama-3-8B-Instruct over two iterations of this approach results in continuous improvements in multilingual performance across instruction-following and arithmetic reasoning tasks, evidenced by an average improvement of 7.46% win rate on the X-AlpacaEval leaderboard and 13.9% accuracy on the MGSM benchmark. This work serves as an initial exploration, paving the way for multilingual self-improvement of LLMs., Comment: Work in progress
- Published
- 2024
46. ARCap: Collecting High-quality Human Demonstrations for Robot Learning with Augmented Reality Feedback
- Author
-
Chen, Sirui, Wang, Chen, Nguyen, Kaden, Fei-Fei, Li, and Liu, C. Karen
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
Recent progress in imitation learning from human demonstrations has shown promising results in teaching robots manipulation skills. To further scale up training datasets, recent works start to use portable data collection devices without the need for physical robot hardware. However, due to the absence of on-robot feedback during data collection, the data quality depends heavily on user expertise, and many devices are limited to specific robot embodiments. We propose ARCap, a portable data collection system that provides visual feedback through augmented reality (AR) and haptic warnings to guide users in collecting high-quality demonstrations. Through extensive user studies, we show that ARCap enables novice users to collect robot-executable data that matches robot kinematics and avoids collisions with the scenes. With data collected from ARCap, robots can perform challenging tasks, such as manipulation in cluttered environments and long-horizon cross-embodiment manipulation. ARCap is fully open-source and easy to calibrate; all components are built from off-the-shelf products. More details and results can be found on our website: https://stanford-tml.github.io/ARCap, Comment: 8 pages, 8 Figures, submitted to ICRA 2025
- Published
- 2024
47. Automated Creation of Digital Cousins for Robust Policy Learning
- Author
-
Dai, Tianyuan, Wong, Josiah, Jiang, Yunfan, Wang, Chen, Gokmen, Cem, Zhang, Ruohan, Wu, Jiajun, and Fei-Fei, Li
- Subjects
Computer Science - Robotics - Abstract
Training robot policies in the real world can be unsafe, costly, and difficult to scale. Simulation serves as an inexpensive and potentially limitless source of training data, but suffers from the semantics and physics disparity between simulated and real-world environments. These discrepancies can be minimized by training in digital twins, which serve as virtual replicas of a real scene but are expensive to generate and cannot produce cross-domain generalization. To address these limitations, we propose the concept of digital cousins, a virtual asset or scene that, unlike a digital twin, does not explicitly model a real-world counterpart but still exhibits similar geometric and semantic affordances. As a result, digital cousins simultaneously reduce the cost of generating an analogous virtual environment while also facilitating better robustness during sim-to-real domain transfer by providing a distribution of similar training scenes. Leveraging digital cousins, we introduce a novel method for their automated creation, and propose a fully automated real-to-sim-to-real pipeline for generating fully interactive scenes and training robot policies that can be deployed zero-shot in the original scene. We find that digital cousin scenes that preserve geometric and semantic affordances can be produced automatically, and can be used to train policies that outperform policies trained on digital twins, achieving 90% vs. 25% success rates under zero-shot sim-to-real transfer. Additional details are available at https://digital-cousins.github.io/., Comment: CoRL 2024
- Published
- 2024
48. Item Cluster-aware Prompt Learning for Session-based Recommendation
- Author
-
Yang, Wooseong, Wang, Chen, Song, Zihe, Zhang, Weizhi, and Yu, Philip S.
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Session-based recommendation (SBR) aims to capture dynamic user preferences by analyzing item sequences within individual sessions. However, most existing approaches focus mainly on intra-session item relationships, neglecting the connections between items across different sessions (inter-session relationships), which limits their ability to fully capture complex item interactions. While some methods incorporate inter-session information, they often suffer from high computational costs, leading to longer training times and reduced efficiency. To address these challenges, we propose the CLIP-SBR (Cluster-aware Item Prompt learning for Session-Based Recommendation) framework. CLIP-SBR is composed of two modules: 1) an item relationship mining module that builds a global graph to effectively model both intra- and inter-session relationships, and 2) an item cluster-aware prompt learning module that uses soft prompts to integrate these relationships into SBR models efficiently. We evaluate CLIP-SBR across eight SBR models and three benchmark datasets, consistently demonstrating improved recommendation performance and establishing CLIP-SBR as a robust solution for session-based recommendation tasks., Comment: 9 pages
- Published
- 2024
49. The Observed O VI is Just the Tip of the Iceberg: Estimating the Hidden Material in Circumgalactic and Intergalactic Clouds
- Author
-
Goetz, Eric, Wang, Chen, and Shelton, Robin
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
This paper proposes a new method for estimating the total quantity of material in moving circumgalactic and intergalactic clouds from O VI measurements. We simulate high-velocity clouds (HVCs) with the FLASH hydrodynamic code and track the ionization and recombination of all ionization levels of oxygen as a function of time. We calculate the O VI/oxygen ratio ($f_{\rm O VI}$) in our dynamic NEI clouds, finding that it differs significantly from that in static gas. We find that O VI exists in cool, medium, and hot gas in the clouds. As such, it traces all of the hydrogen rather than merely the ionized hydrogen. The total quantity of hydrogen along a typical observed line of sight through a cloud can be estimated from the observed O VI column density, metallicity, and our $f_{\rm O VI}$. We provide the simulations' $f_{\rm O VI}$, a prescription for finding $f_{\rm O VI}$ for observed dynamic clouds, and a methodology for calculating the total hydrogen column density from this $f_{\rm O VI}$ and an observed O VI column density. As examples, we use our $f_{\rm O VI}$ to estimate the total hydrogen column densities along various observed sight lines through two HVCs, Complex C and the Magellanic Stream, finding that these clouds contain more material than the previous lower limits. We also extend this analysis to {low-redshift} intergalactic O VI clouds, finding that they contain several times more baryonic material than previously thought and therefore may account for a significant fraction of the Universe's baryons., Comment: 15 pages, 9 figures
- Published
- 2024
- Full Text
- View/download PDF
50. Multimodal Alignment of Histopathological Images Using Cell Segmentation and Point Set Matching for Integrative Cancer Analysis
- Author
-
Jiang, Jun, Moore, Raymond, Novotny, Brenna, Liu, Leo, Fogarty, Zachary, Guo, Ray, Svetomir, Markovic, and Wang, Chen
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Quantitative Biology - Quantitative Methods - Abstract
Histopathological imaging is vital for cancer research and clinical practice, with multiplexed Immunofluorescence (MxIF) and Hematoxylin and Eosin (H&E) providing complementary insights. However, aligning different stains at the cell level remains a challenge due to modality differences. In this paper, we present a novel framework for multimodal image alignment using cell segmentation outcomes. By treating cells as point sets, we apply Coherent Point Drift (CPD) for initial alignment and refine it with Graph Matching (GM). Evaluated on ovarian cancer tissue microarrays (TMAs), our method achieves high alignment accuracy, enabling integration of cell-level features across modalities and generating virtual H&E images from MxIF data for enhanced clinical interpretation., Comment: initial version
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.