213,344 results on '"WANG, WEI"'
Search Results
2. Association between single nucleotide polymorphisms in TYW5 locus and beef amino acids content in shuxuan cattle
- Author
-
Jia, Xianbo, Chen, Shiyi, Wang, Jie, Fu, Maozhong, Yi, Jun, Fang, Donghui, Wang, Wei, and Lai, Songjia
- Published
- 2022
- Full Text
- View/download PDF
3. Body Size Traits and Association with the Genetic Polymorphism of Melatonin Receptor 1A (MTNR1A) Gene in Shuxuan Cattle
- Author
-
Wang, Wei, Jia, Xian-bo, Gan, Jia, Fang, Dong-hui, Shi, Yi, He, Fang, and Yi, Jun
- Published
- 2021
- Full Text
- View/download PDF
4. Large Language Models Are Innate Crystal Structure Generators
- Author
-
Gan, Jingru, Zhong, Peichen, Du, Yuanqi, Zhu, Yanqiao, Duan, Chenru, Wang, Haorui, Gomes, Carla P., Persson, Kristin A., Schwalbe-Koda, Daniel, and Wang, Wei
- Subjects
Condensed Matter - Materials Science ,Computer Science - Machine Learning - Abstract
Crystal structure generation is fundamental to materials discovery, enabling the prediction of novel materials with desired properties. While existing approaches leverage Large Language Models (LLMs) through extensive fine-tuning on materials databases, we show that pre-trained LLMs can inherently generate stable crystal structures without additional training. Our novel framework MatLLMSearch integrates pre-trained LLMs with evolutionary search algorithms, achieving a 78.38% metastable rate validated by machine learning interatomic potentials and 31.7% DFT-verified stability via quantum mechanical calculations, outperforming specialized models such as CrystalTextLLM. Beyond crystal structure generation, we further demonstrate that our framework can be readily adapted to diverse materials design tasks, including crystal structure prediction and multi-objective optimization of properties such as deformation energy and bulk modulus, all without fine-tuning. These results establish pre-trained LLMs as versatile and effective tools for materials discovery, opening up new venues for crystal structure generation with reduced computational overhead and broader accessibility., Comment: Preprint, 18 pages
- Published
- 2025
5. Finding Local Diffusion Schr\'odinger Bridge using Kolmogorov-Arnold Network
- Author
-
Qiu, Xingyu, Yang, Mengying, Ma, Xinghua, Li, Fanding, Liang, Dong, Luo, Gongning, Wang, Wei, Wang, Kuanquan, and Li, Shuo
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In image generation, Schr\"odinger Bridge (SB)-based methods theoretically enhance the efficiency and quality compared to the diffusion models by finding the least costly path between two distributions. However, they are computationally expensive and time-consuming when applied to complex image data. The reason is that they focus on fitting globally optimal paths in high-dimensional spaces, directly generating images as next step on the path using complex networks through self-supervised training, which typically results in a gap with the global optimum. Meanwhile, most diffusion models are in the same path subspace generated by weights $f_A(t)$ and $f_B(t)$, as they follow the paradigm ($x_t = f_A(t)x_{Img} + f_B(t)\epsilon$). To address the limitations of SB-based methods, this paper proposes for the first time to find local Diffusion Schr\"odinger Bridges (LDSB) in the diffusion path subspace, which strengthens the connection between the SB problem and diffusion models. Specifically, our method optimizes the diffusion paths using Kolmogorov-Arnold Network (KAN), which has the advantage of resistance to forgetting and continuous output. The experiment shows that our LDSB significantly improves the quality and efficiency of image generation using the same pre-trained denoising network and the KAN for optimising is only less than 0.1MB. The FID metric is reduced by \textbf{more than 15\%}, especially with a reduction of 48.50\% when NFE of DDIM is $5$ for the CelebA dataset. Code is available at https://github.com/Qiu-XY/LDSB., Comment: 16 pages, 10 figures, to be published in CVPR 2025
- Published
- 2025
6. CLIP-SENet: CLIP-based Semantic Enhancement Network for Vehicle Re-identification
- Author
-
Lu, Liping, Fu, Zihao, Chu, Duanfeng, Wang, Wei, and Xu, Bingrong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Vehicle re-identification (Re-ID) is a crucial task in intelligent transportation systems (ITS), aimed at retrieving and matching the same vehicle across different surveillance cameras. Numerous studies have explored methods to enhance vehicle Re-ID by focusing on semantic enhancement. However, these methods often rely on additional annotated information to enable models to extract effective semantic features, which brings many limitations. In this work, we propose a CLIP-based Semantic Enhancement Network (CLIP-SENet), an end-to-end framework designed to autonomously extract and refine vehicle semantic attributes, facilitating the generation of more robust semantic feature representations. Inspired by zero-shot solutions for downstream tasks presented by large-scale vision-language models, we leverage the powerful cross-modal descriptive capabilities of the CLIP image encoder to initially extract general semantic information. Instead of using a text encoder for semantic alignment, we design an adaptive fine-grained enhancement module (AFEM) to adaptively enhance this general semantic information at a fine-grained level to obtain robust semantic feature representations. These features are then fused with common Re-ID appearance features to further refine the distinctions between vehicles. Our comprehensive evaluation on three benchmark datasets demonstrates the effectiveness of CLIP-SENet. Our approach achieves new state-of-the-art performance, with 92.9% mAP and 98.7% Rank-1 on VeRi-776 dataset, 90.4% Rank-1 and 98.7% Rank-5 on VehicleID dataset, and 89.1% mAP and 97.9% Rank-1 on the more challenging VeRi-Wild dataset.
- Published
- 2025
7. Hyper-active repeating fast radio bursts from rotation modulated starquakes on magnetars
- Author
-
Luo, Jia-Wei, Niu, Jia-Rui, Wang, Wei-Yang, Zhang, Yong-Kun, Zhou, De-Jiang, Xu, Heng, Wang, Pei, Niu, Chen-Hui, Zhang, Zhen-Hui, Zhang, Shuai, Cai, Ce, Han, Jin-Lin, Li, Di, Lee, Ke-Jia, Zhu, Wei-Wei, and Zhang, Bing
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
The non-detection of periodicity related to rotation challenges the magnetar model for fast radio bursts (FRBs). Moreover, a bimodal distribution of the burst waiting times is widely observed in hyper-active FRBs, a significant deviation from the exponential distribution expected from stationary Poisson processes. By combining the epidemic-type aftershock sequence (ETAS) earthquake model and the rotating vector model (RVM) involving the rotation of the magnetar and orientations of the spin and magnetic axes, we find that starquake events modulated by the rotation of FRB-emitting magnetar can explain the bimodal distribution of FRB waiting times, as well as the non-detection of periodicity in active repeating FRBs. We analyze data from multiple FRB sources, demonstrating that differences in waiting time distributions and observed energies can be explained by varying parameters related to magnetar properties and starquake dynamics. Our results suggest that rotation-modulated starquakes on magnetars can possibly be a unified source for FRBs. Notably, we find that active repeaters tend to have small magnetic inclination angles in order to hide their periodicity. We also show that our model can reproduce the waiting time distribution of a pulsar phase of the galactic magnetar SGR J1935+2154 with a larger inclination angle than the active repeaters, which could explain the detection of spin period and the relatively low observed energy for FRBs from the magnetar. The spin periods of active repeaters are not well constrained, but most likely fall in the valley region between the two peaks of the waiting time distributions., Comment: 27 Pages, 18 Figures, 2 Tables. Submitted to ApJ
- Published
- 2025
8. A Joint Learning Framework for Bridging Defect Prediction and Interpretation
- Author
-
Xu, Guifang, Zhu, Zhiling, Guo, Xingcheng, and Wang, Wei
- Subjects
Computer Science - Software Engineering - Abstract
Over the past fifty years, numerous software defect prediction (SDP) approaches have been proposed. However, the ability to explain why predictors make certain predictions remains limited. Explainable SDP has emerged as a promising solution by using explainable artificial intelligence (XAI) methods to clarify the decision-making processes of predictors. Despite this progress, there is still significant potential to enhance the reliability of existing approaches. To address this limitation, we treat defect prediction and the corresponding interpretation as two distinct but closely related tasks and propose a joint learning framework that allows for the simultaneous training of the predictor and its interpreter. The novelty of our approach lies in two main aspects: 1. We design feedback loops that convey the decision-making logic from the predictor to the interpreter. This ensures a high level of conciseness in decision logic and feature engineering for both the predictor and the interpreter, enabling the interpreter to achieve reliable local and global interpretability. 2. We incorporate the interpretation results as a penalty term in the loss function of the joint-learning framework. This not only improves the accuracy of the predictor but also imposes a stronger constraint on the reliability of the interpreter. We validated our proposed method against several existing explainable SDPs across multiple datasets. The results demonstrate its effectiveness in both interpretation and defect prediction. The source code for the proposed method is available at: https://github.com/BugPredictor/software-defect-prediction.git, Comment: This paper is currently under review and being revised in response to reviewer comments. The current version includes revisions addressing reviewer comments
- Published
- 2025
9. Concept Corrector: Erase concepts on the fly for text-to-image diffusion models
- Author
-
Meng, Zheling, Peng, Bo, Jin, Xiaochuan, Lyu, Yueming, Wang, Wei, and Dong, Jing
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Text-to-image diffusion models have demonstrated the underlying risk of generating various unwanted content, such as sexual elements. To address this issue, the task of concept erasure has been introduced, aiming to erase any undesired concepts that the models can generate. Previous methods, whether training-based or training-free, have primarily focused on the input side, i.e. texts. However, they often suffer from incomplete erasure due to limitations in the generalization from limited prompts to diverse image content. In this paper, motivated by the notion that concept erasure on the output side, i.e. generated images, may be more direct and effective, we propose to check concepts based on intermediate-generated images and correct them in the remainder of the generation process. Two key challenges are identified, i.e. determining the presence of target concepts during generation and replacing them on the fly. Leveraging the generation mechanism of diffusion models, we present the Concept Corrector, which incorporates the Generation Check Mechanism and the Concept Removal Attention. This method can identify the generated features associated with target concepts and replace them using pre-defined negative prompts, thereby achieving concept erasure. It requires no changes to model parameters and only relies on a given concept name and its replacement content. To the best of our knowledge, this is the first erasure method based on intermediate-generated images. The experiments on various concepts demonstrate its impressive erasure performance. Code: https://github.com/RichardSunnyMeng/ConceptCorrector.
- Published
- 2025
10. Protein Large Language Models: A Comprehensive Survey
- Author
-
Xiao, Yijia, Zhao, Wanjia, Zhang, Junkai, Jin, Yiqiao, Zhang, Han, Ren, Zhicheng, Sun, Renliang, Wang, Haixin, Wan, Guancheng, Lu, Pan, Luo, Xiao, Zhang, Yu, Zou, James, Sun, Yizhou, and Wang, Wei
- Subjects
Quantitative Biology - Biomolecules ,Computer Science - Artificial Intelligence ,Computer Science - Computational Engineering, Finance, and Science ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Protein-specific large language models (Protein LLMs) are revolutionizing protein science by enabling more efficient protein structure prediction, function annotation, and design. While existing surveys focus on specific aspects or applications, this work provides the first comprehensive overview of Protein LLMs, covering their architectures, training datasets, evaluation metrics, and diverse applications. Through a systematic analysis of over 100 articles, we propose a structured taxonomy of state-of-the-art Protein LLMs, analyze how they leverage large-scale protein sequence data for improved accuracy, and explore their potential in advancing protein engineering and biomedical research. Additionally, we discuss key challenges and future directions, positioning Protein LLMs as essential tools for scientific discovery in protein science. Resources are maintained at https://github.com/Yijia-Xiao/Protein-LLM-Survey., Comment: 24 pages, 4 figures, 5 tables
- Published
- 2025
11. Constructing a Norm for Children's Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models
- Author
-
Zhang, Yi, Wei, Fan, Li, Jingyi, Wang, Yan, Yu, Yanyan, Chen, Jianli, Cai, Zipo, Liu, Xinyu, Wang, Wei, Wang, Peng, and Wang, Zhong
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
The use of children's drawings to examining their conceptual understanding has been proven to be an effective method, but there are two major problems with previous research: 1. The content of the drawings heavily relies on the task, and the ecological validity of the conclusions is low; 2. The interpretation of drawings relies too much on the subjective feelings of the researchers. To address this issue, this study uses the Large Language Model (LLM) to identify 1420 children's scientific drawings (covering 9 scientific themes/concepts), and uses the word2vec algorithm to calculate their semantic similarity. The study explores whether there are consistent drawing representations for children on the same theme, and attempts to establish a norm for children's scientific drawings, providing a baseline reference for follow-up children's drawing research. The results show that the representation of most drawings has consistency, manifested as most semantic similarity greater than 0.8. At the same time, it was found that the consistency of the representation is independent of the accuracy (of LLM's recognition), indicating the existence of consistency bias. In the subsequent exploration of influencing factors, we used Kendall rank correlation coefficient to investigate the effects of Sample Size, Abstract Degree, and Focus Points on drawings, and used word frequency statistics to explore whether children represented abstract themes/concepts by reproducing what was taught in class.
- Published
- 2025
12. Bridging the Gap: Transforming Natural Language Questions into SQL Queries via Abstract Query Pattern and Contextual Schema Markup
- Author
-
Kong, Yonghui, Hu, Hongbing, Zhang, Dan, Chai, Siyuan, Zhang, Fan, and Wang, Wei
- Subjects
Computer Science - Computation and Language - Abstract
Large language models have demonstrated excellent performance in many tasks, including Text-to-SQL, due to their powerful in-context learning capabilities. They are becoming the mainstream approach for Text-to-SQL. However, these methods still have a significant gap compared to human performance, especially on complex questions. As the complexity of questions increases, the gap between questions and SQLs increases. We identify two important gaps: the structural mapping gap and the lexical mapping gap. To tackle these two gaps, we propose PAS-SQL, an efficient SQL generation pipeline based on LLMs, which alleviates gaps through Abstract Query Pattern (AQP) and Contextual Schema Markup (CSM). AQP aims to obtain the structural pattern of the question by removing database-related information, which enables us to find structurally similar demonstrations. CSM aims to associate database-related text span in the question with specific tables or columns in the database, which alleviates the lexical mapping gap. Experimental results on the Spider and BIRD datasets demonstrate the effectiveness of our proposed method. Specifically, PAS-SQL + GPT-4o sets a new state-of-the-art on the Spider benchmark with an execution accuracy of 87.9\%, and achieves leading results on the BIRD dataset with an execution accuracy of 64.67\%.
- Published
- 2025
13. Joint Waveform and Beamforming Design in RIS-ISAC Systems: A Model-Driven Learning Approach
- Author
-
Jiang, Peng, Li, Ming, Liu, Rang, Wang, Wei, and Liu, Qian
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Integrated Sensing and Communication (ISAC) has emerged as a key enabler for future wireless systems. The recently developed symbol-level precoding (SLP) technique holds significant potential for ISAC waveform design, as it leverages both temporal and spatial degrees of freedom (DoFs) to enhance multi-user communication and radar sensing capabilities. Concurrently, reconfigurable intelligent surfaces (RIS) offer additional controllable propagation paths, further amplifying interest in their application. However, previous studies have encountered substantial computational challenges due to the complexity of jointly designing SLP-based waveforms and RIS passive beamforming. In this paper, we propose a novel model-driven learning approach that jointly optimizes waveform and beamforming by unfolding the iterative alternative direction method of multipliers (ADMM) algorithm. Two joint design algorithms are developed for radar target detection and direction-of-arrival (DoA) estimation tasks in a cluttered RIS-ISAC system. While ensuring the communication quality-of-service (QoS) requirements, our objectives are: 1) to maximize the radar output signal-to-interference-plus-noise ratio (SINR) for target detection, and 2) to minimize the Cram\'{e}r-Rao bound (CRB) for DoA estimation. Simulation results verify that our proposed model-driven learning algorithms achieve satisfactory communication and sensing performance, while also offering a substantial reduction in computational complexity, as reflected by the average execution time., Comment: Accepted by IEEE Transactions on Communications
- Published
- 2025
14. MaskPrune: Mask-based LLM Pruning for Layer-wise Uniform Structures
- Author
-
Qin, Jiayu, Tan, Jianchao, Zhang, Kefeng, Cai, Xunliang, and Wang, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
The remarkable performance of large language models (LLMs) in various language tasks has attracted considerable attention. However, the ever-increasing size of these models presents growing challenges for deployment and inference. Structured pruning, an effective model compression technique, is gaining increasing attention due to its ability to enhance inference efficiency. Nevertheless, most previous optimization-based structured pruning methods sacrifice the uniform structure across layers for greater flexibility to maintain performance. The heterogeneous structure hinders the effective utilization of off-the-shelf inference acceleration techniques and impedes efficient configuration for continued training. To address this issue, we propose a novel masking learning paradigm based on minimax optimization to obtain the uniform pruned structure by optimizing the masks under sparsity regularization. Extensive experimental results demonstrate that our method can maintain high performance while ensuring the uniformity of the pruned model structure, thereby outperforming existing SOTA methods.
- Published
- 2025
15. DataSciBench: An LLM Agent Benchmark for Data Science
- Author
-
Zhang, Dan, Zhoubian, Sining, Cai, Min, Li, Fengzu, Yang, Lekang, Wang, Wei, Dong, Tianjiao, Hu, Ziniu, Tang, Jie, and Yue, Yisong
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
This paper presents DataSciBench, a comprehensive benchmark for evaluating Large Language Model (LLM) capabilities in data science. Recent related benchmarks have primarily focused on single tasks, easily obtainable ground truth, and straightforward evaluation metrics, which limits the scope of tasks that can be evaluated. In contrast, DataSciBench is constructed based on a more comprehensive and curated collection of natural and challenging prompts for uncertain ground truth and evaluation metrics. We develop a semi-automated pipeline for generating ground truth (GT) and validating evaluation metrics. This pipeline utilizes and implements an LLM-based self-consistency and human verification strategy to produce accurate GT by leveraging collected prompts, predefined task types, and aggregate functions (metrics). Furthermore, we propose an innovative Task - Function - Code (TFC) framework to assess each code execution outcome based on precisely defined metrics and programmatic rules. Our experimental framework involves testing 6 API-based models, 8 open-source general models, and 9 open-source code generation models using the diverse set of prompts we have gathered. This approach aims to provide a more comprehensive and rigorous evaluation of LLMs in data science, revealing their strengths and weaknesses. Experimental results demonstrate that API-based models outperform open-sourced models on all metrics and Deepseek-Coder-33B-Instruct achieves the highest score among open-sourced models. We release all code and data at https://github.com/THUDM/DataSciBench., Comment: 40 pages, 7 figures, 6 tables
- Published
- 2025
16. Quark Transverse Spin-Momentum Correlation of the Nucleon from Lattice QCD: The Boer-Mulders Function
- Author
-
Ma, Lingquan, Hua, Jun, Schäfer, Andreas, Shu, Hai-Tao, Su, Yushan, Sun, Peng, Walter, Lisa, Wang, Wei, Xiong, Xiaonu, Yang, Yi-Bo, Zhang, Jian-Hui, and Zhang, Qi-An
- Subjects
High Energy Physics - Lattice ,High Energy Physics - Phenomenology - Abstract
We present the first lattice QCD calculation of the quark transverse spin-momentum correlation, i.e., the naive time-reversal-odd Boer-Mulders function, of the nucleon, using large-momentum effective theory (LaMET). The calculation is carried out on an ensemble with lattice spacing $a=0.098$ fm and pion mass $338$ MeV, at various proton momenta up to $2.11$ GeV. We have implemented perturbative matching up to the next-to-next-to-leading order together with a renormalization-group resummation improvement. The result exhibits a decay behavior with increasing transverse separation $b_\perp$. We also compare the results in the nucleon and pion., Comment: 18 pages, 14 figures, 2 tables
- Published
- 2025
17. Revisiting the charge-density-wave superlattice of 1$T$-TiSe$_2$
- Author
-
Wang, Wei, Liu, Patrick, Wu, Lijun, Tao, Jing, Gu, Genda, Zong, Alfred, and Zhu, Yimei
- Subjects
Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Materials Science - Abstract
A number of intriguing phenomena, including exciton condensation, orbital ordering, and emergence of chirality, have been proposed to accompany charge-density-wave (CDW) formation in the layered transition metal dichalcogenide 1$T$-TiSe$_2$. Explaining these effects relies on knowledge of the atomic displacement pattern underlying the CDW, yet structural proposals based on spatially-averaging bulk crystal diffraction and surface-dependent scanning tunneling microscopy have remained inconsistent. Here, we revisit the CDW superlattice structure with selected-area electron diffraction, a bulk-sensitive probe capable of capturing sub-micrometer spatial variations while maintaining high momentum resolution. We resolved two distinct, spatially separated CDW phases characterized by different interlayer ordering. In both phases, previously reported atomic displacement patterns fail to account for the observed extinction rules. Instead, our analysis reveals a new superlattice structure, which features a large number of nearly degenerate CDW domains. These findings not only provide a new basis for understanding the gyrotropic electronic order and metastability in 1$T$-TiSe$_2$, they also underscore the importance of bulk-sensitive mesoscopic techniques in investigating materials that host unconventional superlattices.
- Published
- 2025
18. Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems
- Author
-
Wang, Zhao, Moriyama, Sota, Wang, Wei-Yao, Gangopadhyay, Briti, and Takamatsu, Shingo
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Multiagent Systems - Abstract
Recent advancements in LLM-based multi-agent (LLM-MA) systems have shown promise, yet significant challenges remain in managing communication and refinement when agents collaborate on complex tasks. In this paper, we propose \textit{Talk Structurally, Act Hierarchically (TalkHier)}, a novel framework that introduces a structured communication protocol for context-rich exchanges and a hierarchical refinement system to address issues such as incorrect outputs, falsehoods, and biases. \textit{TalkHier} surpasses various types of SoTA, including inference scaling model (OpenAI-o1), open-source multi-agent models (e.g., AgentVerse), and majority voting strategies on current LLM and single-agent baselines (e.g., ReAct, GPT4o), across diverse tasks, including open-domain question answering, domain-specific selective questioning, and practical advertisement text generation. These results highlight its potential to set a new standard for LLM-MA systems, paving the way for more effective, adaptable, and collaborative multi-agent frameworks. The code is available https://github.com/sony/talkhier.
- Published
- 2025
19. A Survey of Large Language Models in Psychotherapy: Current Landscape and Future Directions
- Author
-
Na, Hongbin, Hua, Yining, Wang, Zimu, Shen, Tao, Yu, Beibei, Wang, Lilin, Wang, Wei, Torous, John, and Chen, Ling
- Subjects
Computer Science - Computation and Language - Abstract
Mental health remains a critical global challenge, with increasing demand for accessible, effective interventions. Large language models (LLMs) offer promising solutions in psychotherapy by enhancing the assessment, diagnosis, and treatment of mental health conditions through dynamic, context-aware interactions. This survey provides a comprehensive overview of the current landscape of LLM applications in psychotherapy, highlighting the roles of LLMs in symptom detection, severity estimation, cognitive assessment, and therapeutic interventions. We present a novel conceptual taxonomy to organize the psychotherapy process into three core components: assessment, diagnosis, and treatment, and examine the challenges and advancements in each area. The survey also addresses key research gaps, including linguistic biases, limited disorder coverage, and underrepresented therapeutic models. Finally, we discuss future directions to integrate LLMs into a holistic, end-to-end psychotherapy framework, addressing the evolving nature of mental health conditions and fostering more inclusive, personalized care., Comment: in progress
- Published
- 2025
20. Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
- Author
-
He, Yuwen, Wang, Wei, Wang, Wanyu, and Jiang, Kui
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Lens flares arise from light reflection and refraction within sensor arrays, whose diverse types include glow, veiling glare, reflective flare and so on. Existing methods are specialized for one specific type only, and overlook the simultaneous occurrence of multiple typed lens flares, which is common in the real-world, e.g. coexistence of glow and displacement reflections from the same light source. These co-occurring lens flares cannot be effectively resolved by the simple combination of individual flare removal methods, since these coexisting flares originates from the same light source and are generated simultaneously within the same sensor array, exhibit a complex interdependence rather than simple additive relation. To model this interdependent flare relationship, our Nighttime Lens Flare Formation model is the first attempt to learn the intrinsic physical relationship between flares on the imaging plane. Building on this physical model, we introduce a solution to this joint flare removal task named Self-supervised Generation-based Lens Flare Removal Network (SGLFR-Net), which is self-supervised without pre-training. Specifically, the nighttime glow is detangled in PSF Rendering Network(PSFR-Net) based on PSF Rendering Prior, while the reflective flare is modelled in Texture Prior Based Reflection Flare Removal Network (TPRR-Net). Empirical evaluations demonstrate the effectiveness of the proposed method in both joint and individual glare removal tasks., Comment: 9 pages,Accepted by AAAI2025
- Published
- 2025
21. Realistic Evaluation of Deep Partial-Label Learning Algorithms
- Author
-
Wang, Wei, Wu, Dong-Dong, Wang, Jindong, Niu, Gang, Zhang, Min-Ling, and Sugiyama, Masashi
- Subjects
Computer Science - Machine Learning - Abstract
Partial-label learning (PLL) is a weakly supervised learning problem in which each example is associated with multiple candidate labels and only one is the true label. In recent years, many deep PLL algorithms have been developed to improve model performance. However, we find that some early developed algorithms are often underestimated and can outperform many later algorithms with complicated designs. In this paper, we delve into the empirical perspective of PLL and identify several critical but previously overlooked issues. First, model selection for PLL is non-trivial, but has never been systematically studied. Second, the experimental settings are highly inconsistent, making it difficult to evaluate the effectiveness of the algorithms. Third, there is a lack of real-world image datasets that can be compatible with modern network architectures. Based on these findings, we propose PLENCH, the first Partial-Label learning bENCHmark to systematically compare state-of-the-art deep PLL algorithms. We investigate the model selection problem for PLL for the first time, and propose novel model selection criteria with theoretical guarantees. We also create Partial-Label CIFAR-10 (PLCIFAR10), an image dataset of human-annotated partial labels collected from Amazon Mechanical Turk, to provide a testbed for evaluating the performance of PLL algorithms in more realistic scenarios. Researchers can quickly and conveniently perform a comprehensive and fair evaluation and verify the effectiveness of newly developed algorithms based on PLENCH. We hope that PLENCH will facilitate standardized, fair, and practical evaluation of PLL algorithms in the future., Comment: ICLR 2025 Spotlight
- Published
- 2025
22. {\lambda}Scale: Enabling Fast Scaling for Serverless Large Language Model Inference
- Author
-
Yu, Minchen, Yang, Rui, Jia, Chaobo, Su, Zhaoyuan, Yao, Sheng, Lan, Tingfeng, Yang, Yuchen, Cheng, Yue, Wang, Wei, Wang, Ao, and Chen, Ruichuan
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Serverless computing has emerged as a compelling solution for cloud-based model inference. However, as modern large language models (LLMs) continue to grow in size, existing serverless platforms often face substantial model startup overhead. This poses a significant challenge in efficiently scaling model instances to accommodate dynamic, bursty workloads commonly observed in real-world inference services. In this paper, we introduce {\lambda}Scale, an efficient serverless inference system to achieve fast model scaling. The key idea behind {\lambda}Scale is to leverage high-speed RDMA networks between GPU nodes for fast model multicast, while enabling distributed inference execution during model transmission -- referred to as "execute-while-load". {\lambda}Scale proposes an efficient model scaling scheme, {\lambda}Pipe, which supports adaptive model multicast and dynamically constructs execution pipelines across receiving nodes for collaborative, distributed inference. Additionally, {\lambda}Scale supports efficient model management across GPU and host memory, allowing fast scaling for models across different storage tiers. Evaluation results show that {\lambda}Scale enables fast model scaling and effectively handles load spikes, achieving up to 5x tail-latency improvement and 31.3% cost reduction compared to state-of-the-art solutions on real-world LLM inference traces.
- Published
- 2025
23. CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
- Author
-
Wang, Xintao, Wang, Heng, Zhang, Yifei, Yuan, Xinfeng, Xu, Rui, Huang, Jen-tse, Yuan, Siyu, Guo, Haoran, Chen, Jiangjie, Wang, Wei, Xiao, Yanghua, and Zhou, Shuchang
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Role-playing language agents (RPLAs) have emerged as promising applications of large language models (LLMs). However, simulating established characters presents a challenging task for RPLAs, due to the lack of authentic character datasets and nuanced evaluation methods using such data. In this paper, we present CoSER, a collection of a high-quality dataset, open models, and an evaluation protocol towards effective RPLAs of established characters. The CoSER dataset covers 17,966 characters from 771 renowned books. It provides authentic dialogues with real-world intricacies, as well as diverse data types such as conversation setups, character experiences and internal thoughts. Drawing from acting methodology, we introduce given-circumstance acting for training and evaluating role-playing LLMs, where LLMs sequentially portray multiple characters in book scenes. Using our dataset, we develop CoSER 8B and CoSER 70B, i.e., advanced open role-playing LLMs built on LLaMA-3.1 models. Extensive experiments demonstrate the value of the CoSER dataset for RPLA training, evaluation and retrieval. Moreover, CoSER 70B exhibits state-of-the-art performance surpassing or matching GPT-4o on our evaluation and three existing benchmarks, i.e., achieving 75.80% and 93.47% accuracy on the InCharacter and LifeChoice benchmarks respectively.
- Published
- 2025
24. Deep Optical Images of the Ejecta Nebula Around the Wolf-Rayet Star WR 8 (HD 62910)
- Author
-
Fesen, Robert A., Patnaude, Daniel, Wang, Wei-Hao, Chu, You-Hua, Sun, Jason, Peitsch, Manuel C., Pugh, Martin, Garrod, Scott, Selby, Michael, and Woronow, Alex
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
We report the results of deep H-alpha and [O III] images of the bright WN7/WC4 Wolf-Rayet star WR 8 (HD 62910). These data show considerably more surrounding nebulosity than seen in prior imaging. The brighter portions of the nebula span 6' in diameter and exhibit considerable fine-scale structure including numerous emission clumps and bright head-tail like features presumably due to the effects of the WR star's stellar winds. Due to the overlap of a relatively bright band of unrelated foreground diffuse interstellar H-alpha emission, WR 8's nebula is best viewed via its [O III] emission. A faint 9' x 13' diffuse outer nebulosity is detected surrounding the nebula's main ring of emission. The nebula's optical structure is substantially different from that of its thermal continuum dust emission seen in WISE 22 micron infrared images which show a smaller and sharply defined emission shell., Comment: 8 pages, 6 figures, 1 table
- Published
- 2025
25. Position reconstruction and surface background model for the PandaX-4T detector
- Author
-
Qian, Zhicheng, Gu, Linhui, Cheng, Chen, Bo, Zihao, Chen, Wei, Chen, Xun, Chen, Yunhua, Cheng, Zhaokan, Cui, Xiangyi, Fan, Yingjie, Fang, Deqing, Gao, Zhixing, Geng, Lisheng, Giboni, Karl, Guo, Xunan, Guo, Xuyuan, Guo, Zichao, Han, Chencheng, Han, Ke, He, Changda, He, Jinrong, Huang, Di, Huang, Houqi, Huang, Junting, Hou, Ruquan, Hou, Yu, Ji, Xiangdong, Ji, Xiangpan, Ju, Yonglin, Li, Chenxiang, Li, Jiafu, Li, Mingchuan, Li, Shuaijie, Li, Tao, Li, Zhiyuan, Lin, Qing, Liu, Jianglai, Lu, Congcong, Lu, Xiaoying, Luo, Lingyin, Luo, Yunyang, Ma, Wenbo, Ma, Yugang, Mao, Yajun, Meng, Yue, Ning, Xuyang, Pang, Binyu, Qi, Ningchun, Ren, Xiangxiang, Shan, Dong, Shang, Xiaofeng, Shao, Xiyuan, Shen, Guofang, Shen, Manbin, Sun, Wenliang, Tao, Yi, Wang, Anqing, Wang, Guanbo, Wang, Hao, Wang, Jiamin, Wang, Lei, Wang, Meng, Wang, Qiuhong, Wang, Shaobo, Wang, Siguang, Wang, Wei, Wang, Xiuli, Wang, Xu, Wang, Zhou, Wei, Yuehuan, Wu, Weihao, Wu, Yuan, Xiao, Mengjiao, Xiao, Xiang, Xiong, Kaizhi, Xu, Yifan, Yao, Shunyu, Yan, Binbin, Yan, Xiyu, Yang, Yong, Ye, Peihua, Yu, Chunxu, Yuan, Ying, Yuan, Zhe, Yun, Youhui, Zeng, Xinning, Zhang, Minzhen, Zhang, Peng, Zhang, Shibo, Zhang, Shu, Zhang, Tao, Zhang, Wei, Zhang, Yang, Zhang, Yingxin, Zhang, Yuanyuan, Zhao, Li, Zhou, Jifang, Zhou, Jiaxu, Zhou, Jiayi, Zhou, Ning, Zhou, Xiaopeng, Zhou, Yubo, and Zhou, Zhizhen
- Subjects
Physics - Instrumentation and Detectors ,High Energy Physics - Experiment - Abstract
We report the position reconstruction methods and surface background model for the PandaX-4T dark matter direct search experiment. This work develops two position reconstruction algorithms: template matching (TM) method and photon acceptance function (PAF) method. Both methods determine the horizontal position of events based on the light pattern of secondary scintillation collected by the light sensors. After a comprehensive evaluation of resolution, uniformity, and robustness, the PAF method was selected for position reconstruction, while the TM method was employed for verification. The PAF method achieves a bulk event resolution of 1.0 mm and a surface event resolution of 4.4 mm for a typical $S2$ signal with a bottom charge of 1500 PE (about 14 keV). The uniformity is around 20\%. Robustness studies reveal average deviations of 5.1 mm and 8.8 mm for the commissioning run (Run0) and the first science run (Run1), respectively, due to the deactivation of certain PMTs. A data-driven surface background model is developed based on the PAF method. The surface background is estimated to be $0.09 \pm 0.06$ events for Run0 (0.54 tonne$\cdot$year) and $0.17 \pm 0.11$ events for Run1 (1.00 tonne$\cdot$year)., Comment: 22 pages, 15 figures, 2 tables
- Published
- 2025
26. Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment
- Author
-
Zhang, Long, Zhang, Meng, Wang, Wei Lin, and Luo, Yu
- Subjects
Computer Science - Computers and Society - Abstract
The advancement of Artificial Intelligence (AI) has created opportunities for e-learning, particularly in automated assessment systems that reduce educators' workload and provide timely feedback to students. However, developing effective AI-based assessment tools remains challenging due to the substantial resources required for collecting and annotating real student data. This study investigates the potential and gap of simulative data to address this limitation. Through a two-phase experimental study, we examined the effectiveness and gap of Large Language Model generated synthetic data in training educational assessment systems. Our findings reveal that while simulative data demonstrates promising results in training automated assessment models, outperforming state-of-the-art GPT-4o in most question types, its effectiveness has notable limitations. Specifically, models trained on synthetic data show excellent performance in simulated environment but need progress when applied to real-world scenarios. This performance gap highlights the limitations of only using synthetic data in controlled experimental settings for AI training. The absence of real-world noise and biases, which are also present in over-processed real-world data, contributes to this limitation. We recommend that future development of automated assessment agents and other AI tools should incorporate a mixture of synthetic and real-world data, or introduce more realistic noise and biases patterns, rather than relying solely on synthetic or over-processed data.
- Published
- 2025
27. HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates
- Author
-
Lu, Lei, Li, Yize, Wang, Yanzhi, Wang, Wei, and Jiang, Wei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Multimedia - Abstract
Image compression under ultra-low bitrates remains challenging for both conventional learned image compression (LIC) and generative vector-quantized (VQ) modeling. Conventional LIC suffers from severe artifacts due to heavy quantization, while generative VQ modeling gives poor fidelity due to the mismatch between learned generative priors and specific inputs. In this work, we propose Hybrid-Diffusion Image Compression (HDCompression), a dual-stream framework that utilizes both generative VQ-modeling and diffusion models, as well as conventional LIC, to achieve both high fidelity and high perceptual quality. Different from previous hybrid methods that directly use pre-trained LIC models to generate low-quality fidelity-preserving information from heavily quantized latent, we use diffusion models to extract high-quality complimentary fidelity information from the ground-truth input, which can enhance the system performance in several aspects: improving indices map prediction, enhancing the fidelity-preserving output of the LIC stream, and refining conditioned image reconstruction with VQ-latent correction. In addition, our diffusion model is based on a dense representative vector (DRV), which is lightweight with very simple sampling schedulers. Extensive experiments demonstrate that our HDCompression outperforms the previous conventional LIC, generative VQ-modeling, and hybrid frameworks in both quantitative metrics and qualitative visualization, providing balanced robust compression performance at ultra-low bitrates., Comment: Under Review
- Published
- 2025
28. CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories
- Author
-
Xiao, Yijia, Wang, Runhui, Kong, Luyang, Golac, Davor, and Wang, Wei
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
The increasing complexity of computer science research projects demands more effective tools for deploying code repositories. Large Language Models (LLMs), such as Anthropic Claude and Meta Llama, have demonstrated significant advancements across various fields of computer science research, including the automation of diverse software engineering tasks. To evaluate the effectiveness of LLMs in handling complex code development tasks of research projects, particularly for NLP/CV/AI/ML/DM topics, we introduce CSR-Bench, a benchmark for Computer Science Research projects. This benchmark assesses LLMs from various aspects including accuracy, efficiency, and deployment script quality, aiming to explore their potential in conducting computer science research autonomously. We also introduce a novel framework, CSR-Agents, that utilizes multiple LLM agents to automate the deployment of GitHub code repositories of computer science research projects. Specifically, by checking instructions from markdown files and interpreting repository structures, the model generates and iteratively improves bash commands that set up the experimental environments and deploy the code to conduct research tasks. Preliminary results from CSR-Bench indicate that LLM agents can significantly enhance the workflow of repository deployment, thereby boosting developer productivity and improving the management of developmental workflows.
- Published
- 2025
29. Physics-Conditioned Diffusion Models for Lattice Gauge Theory
- Author
-
Zhu, Qianteng, Aarts, Gert, Wang, Wei, Zhou, Kai, and Wang, Lingxiao
- Subjects
High Energy Physics - Lattice ,Computer Science - Machine Learning - Abstract
We develop diffusion models for simulating lattice gauge theories, where stochastic quantization is explicitly incorporated as a physical condition for sampling. We demonstrate the applicability of this novel sampler to U(1) gauge theory in two spacetime dimensions and find that a model trained at a small inverse coupling constant can be extrapolated to larger inverse coupling regions without encountering the topological freezing problem. Additionally, the trained model can be employed to sample configurations on different lattice sizes without requiring further training. The exactness of the generated samples is ensured by incorporating Metropolis-adjusted Langevin dynamics into the generation process. Furthermore, we demonstrate that this approach enables more efficient sampling of topological quantities compared to traditional algorithms such as Hybrid Monte Carlo and Langevin simulations., Comment: 25 pages, 10 figures, comments are welcome! Codes are available at: https://github.com/zzzqt/DM4U1
- Published
- 2025
30. DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
- Author
-
Deng, Yihe, Yang, Yu, Zhang, Junkai, Wang, Wei, and Li, Bo
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
The rapid advancement of large language models (LLMs) has increased the need for guardrail models to ensure responsible use, particularly in detecting unsafe and illegal content. While substantial safety data exist in English, multilingual guardrail modeling remains underexplored due to the scarcity of open-source safety data in other languages. To address this gap, we propose a novel two-player Reinforcement Learning (RL) framework, where a generator and a guardrail model co-evolve adversarially to produce high-quality synthetic data for multilingual guardrail training. We theoretically formalize this interaction as a two-player game, proving convergence to a Nash equilibrium. Empirical evaluations show that our model \ours outperforms state-of-the-art models, achieving nearly 10% improvement over LlamaGuard3 (8B) on English benchmarks while being 4.5x faster at inference with a significantly smaller model (0.5B). We achieve substantial advancements in multilingual safety tasks, particularly in addressing the imbalance for lower-resource languages in a collected real dataset. Ablation studies emphasize the critical role of synthetic data generation in bridging the imbalance in open-source data between English and other languages. These findings establish a scalable and efficient approach to synthetic data generation, paving the way for improved multilingual guardrail models to enhance LLM safety. Code, model, and data will be open-sourced at https://github.com/yihedeng9/DuoGuard., Comment: 24 pages, 9 figures, 5 tables
- Published
- 2025
31. Online Robot Motion Planning Methodology Guided by Group Social Proxemics Feature
- Author
-
Mu, Xuan, Liu, Xiaorui, Guo, Shuai, Chi, Wenzheng, Wang, Wei, and Ge, Shuzhi Sam
- Subjects
Computer Science - Robotics ,Electrical Engineering and Systems Science - Systems and Control - Abstract
Nowadays robot is supposed to demonstrate human-like perception, reasoning and behavior pattern in social or service application. However, most of the existing motion planning methods are incompatible with above requirement. A potential reason is that the existing navigation algorithms usually intend to treat people as another kind of obstacle, and hardly take the social principle or awareness into consideration. In this paper, we attempt to model the proxemics of group and blend it into the scenario perception and navigation of robot. For this purpose, a group clustering method considering both social relevance and spatial confidence is introduced. It can enable robot to identify individuals and divide them into groups. Next, we propose defining the individual proxemics within magnetic dipole model, and further established the group proxemics and scenario map through vector-field superposition. On the basis of the group clustering and proxemics modeling, we present the method to obtain the optimal observation positions (OOPs) of group. Once the OOPs grid and scenario map are established, a heuristic path is employed to generate path that guide robot cruising among the groups for interactive purpose. A series of experiments are conducted to validate the proposed methodology on the practical robot, the results have demonstrated that our methodology has achieved promising performance on group recognition accuracy and path-generation efficiency. This concludes that the group awareness evolved as an important module to make robot socially behave in the practical scenario., Comment: 14 pages,14 figures
- Published
- 2025
32. DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection
- Author
-
Yan, Mingxuan, Zhang, Ruijie, Xiao, Xuedou, and Wang, Wei
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
While MPEG-standardized video-based point cloud compression (VPCC) achieves high compression efficiency for human perception, it struggles with a poor trade-off between bitrate savings and detection accuracy when supporting 3D object detectors. This limitation stems from VPCC's inability to prioritize regions of different importance within point clouds. To address this issue, we propose DetVPCC, a novel method integrating region-of-interest (RoI) encoding with VPCC for efficient point cloud sequence compression while preserving the 3D object detection accuracy. Specifically, we augment VPCC to support RoI-based compression by assigning spatially non-uniform quality levels. Then, we introduce a lightweight RoI detector to identify crucial regions that potentially contain objects. Experiments on the nuScenes dataset demonstrate that our approach significantly improves the detection accuracy. The code and demo video are available in supplementary materials.
- Published
- 2025
33. PDStream: Slashing Long-Tail Delay in Interactive Video Streaming via Pseudo-Dual Streaming
- Author
-
Xiao, Xuedou, Zuo, Yingying, Yan, Mingxuan, Liu, Kezhong, and Wang, Wei
- Subjects
Computer Science - Multimedia ,Computer Science - Networking and Internet Architecture - Abstract
End-to-end (E2E) delay is critical for interactive video streaming (IVS) experiences, but remains unsatisfactory for its long-tail distribution caused by periodic large keyframes. Conventional optimization strategies, such as jitter buffer, bitrate adaptation, and customized encoding, either sacrifice clarity, average delay, or compatibility. To address this issue, we propose PDStream, a novel pseudo-dual streaming algorithm, aimed at minimizing E2E delay while maintaining video clarity. The core idea is to split the two functions, delay-sensitive playback and delay-tolerant reference, on keyframes through dual streaming. Specifically, the playback function is held by a second parallel stream, which comprises much smaller non-keyframes and is allocated more immediate bandwidth for real-time performance. The reference function is ensured by the first stream with keyframe preservation, allocated more subsequent bandwidth to smooth out bursty traffic. Additionally, ``pseudo'' minimizes computational and transmission overheads by restricting dual streams to brief activation only when keyframes appear, supported by corresponding dual-stream bitrate allocation and adaptation to ensure delay and clarity. We implement PDStream on a WebRTC-based IVS testbed with real-world network traces. Results show that PDStream significantly outperforms prior algorithms, reducing average E2E delay by 17.5\% and slashing its 97th percentile by 33.3\%, while keeping clarity under varying bandwidth., Comment: IEEE INFOCOM 2025
- Published
- 2025
34. Search for Double Beta Decay of $^{136}$Xe to the $0^+_1$ Excited State of $^{136}$Ba with PandaX-4T
- Author
-
PandaX Collaboration, Luo, Lingyin, Bo, Zihao, Chen, Wei, Chen, Xun, Chen, Yunhua, Cheng, Zhaokan, Cui, Xiangyi, Fang, Yingji, Fang, Deqing, Gao, Zhixing, Geng, Lisheng, Giboni, Karl, Guo, Xunan, Guo, Xuyuan, Guo, Zichao, Han, Chencheng, Han, Ke, He, Changda, He, Jinrong, Huang, Di, Huang, Houqi, Huang, Junting, Hou, Ruquan, Hou, Yu, Ji, Xiangdong, Ji, Xiangpan, Ju, Yonglin, Li, Chenxiang, Li, Tao, Li, Jiafu, Li, Mingchuan, Li, Shuaijie, Li, Zhiyuan, Lin, Qing, Liu, Jianglai, Lu, Congcong, Lu, Xiaoying, Luo, Yunyang, Ma, Wenbo, Ma, Yugang, Mao, Yajun, Meng, Yue, Ning, Xuyang, Pang, Binyu, Qi, Ningchun, Qian, Zhicheng, Ren, Xiangxiang, Shan, Dong, Shang, Xiaofeng, Shao, Xiyuan, Shen, Guofang, Shen, Manbin, Sun, Wenliang, Tao, Yi, Wang, Anqing, Wang, Guanbo, Wang, Hao, Wang, Jiamin, Wang, Lei, Wang, Meng, Wang, Qiuhong, Wang, Shaobo, Wang, Siguang, Wang, Wei, Wang, Xiuli, Wang, Xu, Wang, Zhou, Wei, Yuehuan, Wu, Weihao, Wu, Yuan, Xiao, Mengjiao, Xiao, Xiang, Xiong, Kaizhi, Xu, Yifan, Yao, Shunyu, Yan, Binbin, Yan, Xiyu, Yang, Yong, Ye, Peihua, Yu, Chunxu, Yuan, Ying, Yuan, Zhe, Yun, Youhui, Zeng, Xinning, Zhang, Minzhen, Zhang, Peng, Zhang, Shibo, Zhang, Shu, Zhang, Tao, Zhang, Wei, Zhang, Yang, Zhang, Yingxin, Zhang, Yuanyuan, Zhao, Li, Zhou, Jifang, Zhou, Jiaxu, Zhou, Ning, Zhou, Xiaopeng, Zhou, Yubo, and Zhou, Zhizhen
- Subjects
Nuclear Experiment - Abstract
We perform a search of double beta decay of $^{136}$Xe to the excited state, $0^+_1$, of $^{136}$Ba (2$\nu \beta \beta$-0$_1^+$), using the dual-phase xenon detector of PandaX-4T with the first 94.9-day commissioning data. The multi-site events are reconstructed up to the MeV energy scale, which helps to improve the background model significantly. The background contribution from the stainless steel platform outside PandaX-4T cryostat is evaluated for the first time. No significant evidence for 2$\nu \beta \beta$-0$_1^+$ is observed, resulting in a lower limit of $T_{1/2}^{2 \nu \beta \beta-0_1^+} > 7.5 \times 10^{22}$ yr at the 90% confidence level. This is the first experimental limit on such a rare decay in a natural xenon-based detector.
- Published
- 2025
35. A Retrospective Systematic Study on Hierarchical Sparse Query Transformer-assisted Ultrasound Screening for Early Hepatocellular Carcinoma
- Author
-
She, Chaoyin, Lu, Ruifang, He, Danni, Lv, Jiayi, Lin, Yadan, Cheng, Meiqing, Huang, Hui, Chen, Lida, Wang, Wei, and Huang, Qinghua
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Hepatocellular carcinoma (HCC) ranks as the third leading cause of cancer-related mortality worldwide, with early detection being crucial for improving patient survival rates. However, early screening for HCC using ultrasound suffers from insufficient sensitivity and is highly dependent on the expertise of radiologists for interpretation. Leveraging the latest advancements in artificial intelligence (AI) in medical imaging, this study proposes an innovative Hierarchical Sparse Query Transformer (HSQformer) model that combines the strengths of Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance the accuracy of HCC diagnosis in ultrasound screening. The HSQformer leverages sparse latent space representations to capture hierarchical details at various granularities without the need for complex adjustments, and adopts a modular, plug-and-play design philosophy, ensuring the model's versatility and ease of use. The HSQformer's performance was rigorously tested across three distinct clinical scenarios: single-center, multi-center, and high-risk patient testing. In each of these settings, it consistently outperformed existing state-of-the-art models, such as ConvNext and SwinTransformer. Notably, the HSQformer even matched the diagnostic capabilities of senior radiologists and comprehensively surpassed those of junior radiologists. The experimental results from this study strongly demonstrate the effectiveness and clinical potential of AI-assisted tools in HCC screening. The full code is available at https://github.com/Asunatan/HSQformer.
- Published
- 2025
36. The Stellar Abundances and Galactic Evolution Survey (SAGES). II. Machine Learning-Based Stellar parameters for 21 million stars from the First Data Release
- Author
-
Gu, Hongrui, Fan, Zhou, Zhao, Gang, Huang, Yang, Beers, Timothy C., Wang, Wei, Zheng, Jie, Zhao, Jingkun, Li, Chun, Chen, Yuqin, Yuan, Haibo, Li, Haining, Tan, Kefeng, Song, Yihan, Luo, Ali, Song, Nan, and Liu, Yujuan
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Stellar parameters for large samples of stars play a crucial role in constraining the nature of stars and stellar populations in the Galaxy. An increasing number of medium-band photometric surveys are presently used in estimating stellar parameters. In this study, we present a machine-learning approach to derive estimates of stellar parameters, including [Fe/H], logg, and Teff, based on a combination of medium-band and broad-band photometric observations. Our analysis employs data primarily sourced from the SAGE Survey , which aims to observe much of the Northern Hemisphere. We combine the $uv$-band data from SAGES DR1 with photometric and astrometric data from Gaia EDR3, and apply the random forest method to estimate stellar parameters for approximately 21 million stars. We are able to obtain precisions of 0.09 dex for [Fe/H], 0.12 dex for logg, and 70 K for Teff. Furthermore, by incorporating 2MASS and WISE infrared photometric and GALEX ultraviolet data, we are able to achieve even higher precision estimates for over 2.2 million stars. These results are applicable to both giant and dwarf stars. Building upon this mapping, we construct a foundational dataset for research on metal-poor stars, the structure of the Milky Way, and beyond. With the forthcoming release of additional bands from SAGE Survey such DDO51 and H-alpha, this versatile machine learning approach is poised to play an important role in upcoming surveys featuring expanded filter sets, Comment: Accepted by ApJS.12 pages, 12 figures, 3 tables
- Published
- 2025
37. Propagation-induced Frequency-dependent Polarization Properties of Fast Radio Burst
- Author
-
Wang, Wei-Yang, Liu, Xiaohui, Li, Dongzi, Zhang, Bing, Niu, Chen-Hui, Liu, Jifeng, Xu, Renxin, and Zhu, Weiwei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Frequency-dependent polarization properties provide crucial insights into the radiation mechanisms and magnetic environments of fast radio bursts (FRBs). We explore an analytical solution of radiative transfer of the polarization properties of FRBs as a strong incoming wave propagates in a homogeneous magnetized plasma. The case of a thermal plasma is studied in more detail. The rotational axis of the polarization spectrum undergoes precession with frequency on the Poincar\'e sphere when the medium has both strong Faraday rotation and conversion. Such precession on the Poincar\'e sphere could occur in hot or cold plasma with a strong magnetic field component perpendicular to the line of sight. The analytical solution with the mixing Faraday case offers a more physical description of the physical properties of the magnetic environment of FRBs than the empirical ``generalized Faraday rotation'' method commonly adopted in the literature. Significant absorption can exist in a dense plasma medium, which may give rise to a highly circularly polarized outgoing wave. The frequency-dependent Stokes parameters may be associated with reversing rotation measures or the presence of a persistent radio source around an FRB., Comment: 20 pagers, 11 figures, AAAS journal submitted
- Published
- 2025
38. Scalable Higher Resolution Polar Sea Ice Classification and Freeboard Calculation from ICESat-2 ATL03 Data
- Author
-
Iqrah, Jurdana Masuma, Koo, Younghyun, Wang, Wei, Xie, Hongjie, and Prasad, Sushil K.
- Subjects
Computer Science - Machine Learning - Abstract
ICESat-2 (IS2) by NASA is an Earth-observing satellite that measures high-resolution surface elevation. The IS2's ATL07 and ATL10 sea ice elevation and freeboard products of 10m-200m segments which aggregated 150 signal photons from the raw ATL03 (geolocated photon) data. These aggregated products can potentially overestimate local sea surface height, thus underestimating the calculations of freeboard (sea ice height above sea surface). To achieve a higher resolution of sea surface height and freeboard information, in this work we utilize a 2m window to resample the ATL03 data. Then, we classify these 2m segments into thick sea ice, thin ice, and open water using deep learning methods (Long short-term memory and Multi-layer perceptron models). To obtain labeled training data for our deep learning models, we use segmented Sentinel-2 (S2) multi-spectral imagery overlapping with IS2 tracks in space and time to auto-label IS2 data, followed by some manual corrections in the regions of transition between different ice/water types or cloudy regions. We employ a parallel workflow for this auto-labeling using PySpark to scale, and we achieve 9-fold data loading and 16.25-fold map-reduce speedup. To train our models, we employ a Horovod-based distributed deep-learning workflow on a DGX A100 8 GPU cluster, achieving a 7.25-fold speedup. Next, we calculate the local sea surface heights based on the open water segments. Finally, we scale the freeboard calculation using the derived local sea level and achieve 8.54-fold data loading and 15.7-fold map-reduce speedup. Compared with the ATL07 (local sea level) and ATL10 (freeboard) data products, our results show higher resolutions and accuracy (96.56%).
- Published
- 2025
39. Neural Preconditioning Operator for Efficient PDE Solves
- Author
-
Li, Zhihao, Xiao, Di, Lai, Zhilu, and Wang, Wei
- Subjects
Computer Science - Computational Engineering, Finance, and Science - Abstract
We introduce the Neural Preconditioning Operator (NPO), a novel approach designed to accelerate Krylov solvers in solving large, sparse linear systems derived from partial differential equations (PDEs). Unlike classical preconditioners that often require extensive tuning and struggle to generalize across different meshes or parameters, NPO employs neural operators trained via condition and residual losses. This framework seamlessly integrates with existing neural network models, serving effectively as a preconditioner to enhance the performance of Krylov subspace methods. Further, by melding algebraic multigrid principles with a transformer-based architecture, NPO significantly reduces iteration counts and runtime for solving Poisson, Diffusion, and Linear Elasticity problems on both uniform and irregular meshes. Our extensive numerical experiments demonstrate that NPO outperforms traditional methods and contemporary neural approaches across various resolutions, ensuring robust convergence even on grids as large as 4096, far exceeding its initial training limits. These findings underscore the potential of data-driven preconditioning to transform the computational efficiency of high-dimensional PDE applications.
- Published
- 2025
40. Preference Leakage: A Contamination Problem in LLM-as-a-judge
- Author
-
Li, Dawei, Sun, Renliang, Huang, Yue, Zhong, Ming, Jiang, Bohan, Han, Jiawei, Zhang, Xiangliang, Wang, Wei, and Liu, Huan
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Large Language Models (LLMs) as judges and LLM-based data synthesis have emerged as two fundamental LLM-driven data annotation methods in model development. While their combination significantly enhances the efficiency of model training and evaluation, little attention has been given to the potential contamination brought by this new model development paradigm. In this work, we expose preference leakage, a contamination problem in LLM-as-a-judge caused by the relatedness between the synthetic data generators and LLM-based evaluators. To study this issue, we first define three common relatednesses between data generator LLM and judge LLM: being the same model, having an inheritance relationship, and belonging to the same model family. Through extensive experiments, we empirically confirm the bias of judges towards their related student models caused by preference leakage across multiple LLM baselines and benchmarks. Further analysis suggests that preference leakage is a pervasive issue that is harder to detect compared to previously identified biases in LLM-as-a-judge scenarios. All of these findings imply that preference leakage is a widespread and challenging problem in the area of LLM-as-a-judge. We release all codes and data at: https://github.com/David-Li0406/Preference-Leakage., Comment: 17 pages, 8 figures
- Published
- 2025
41. Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection
- Author
-
Ma, Mingyu Derek, Ding, Yanna, Huang, Zijie, Gao, Jianxi, Sun, Yizhou, and Wang, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Generative Language Models rely on autoregressive decoding to produce the output sequence token by token. Many tasks such as preference optimization, require the model to produce task-level output consisting of multiple tokens directly by selecting candidates from a pool as predictions. Determining a task-level prediction from candidates using the ordinary token-level decoding mechanism is constrained by time-consuming decoding and interrupted gradients by discrete token selection. Existing works have been using decoding-free candidate selection methods to obtain candidate probability from initial output logits over vocabulary. Though these estimation methods are widely used, they are not systematically evaluated, especially on end tasks. We introduce an evaluation of a comprehensive collection of decoding-free candidate selection approaches on a comprehensive set of tasks, including five multiple-choice QA tasks with a small candidate pool and four clinical decision tasks with a massive amount of candidates, some with 10k+ options. We evaluate the estimation methods paired with a wide spectrum of foundation LMs covering different architectures, sizes and training paradigms. The results and insights from our analysis inform the future model design.
- Published
- 2025
42. Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction
- Author
-
Ma, Mingyu Derek, Wang, Xiaoxuan, Xiao, Yijia, Cuturrufo, Anthony, Nori, Vijay S, Halperin, Eran, and Wang, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Clinical diagnosis prediction models, when provided with a patient's medical history, aim to detect potential diseases early, facilitating timely intervention and improving prognostic outcomes. However, the inherent scarcity of patient data and large disease candidate space often pose challenges in developing satisfactory models for this intricate task. The exploration of leveraging Large Language Models (LLMs) for encapsulating clinical decision processes has been limited. We introduce MERA, a clinical diagnosis prediction model that bridges pertaining natural language knowledge with medical practice. We apply hierarchical contrastive learning on a disease candidate ranking list to alleviate the large decision space issue. With concept memorization through fine-tuning, we bridge the natural language clinical knowledge with medical codes. Experimental results on MIMIC-III and IV datasets show that MERA achieves the state-of-the-art diagnosis prediction performance and dramatically elevates the diagnosis prediction capabilities of generative LMs., Comment: To appear at AAAI 2025
- Published
- 2025
43. Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound: The TDSC-ABUS Challenge
- Author
-
Luo, Gongning, Xu, Mingwang, Chen, Hongyu, Liang, Xinjie, Tao, Xing, Ni, Dong, Jeong, Hyunsu, Kim, Chulhong, Stock, Raphael, Baumgartner, Michael, Kirchhoff, Yannick, Rokuss, Maximilian, Maier-Hein, Klaus, Yang, Zhikai, Fan, Tianyu, Boutry, Nicolas, Tereshchenko, Dmitry, Moine, Arthur, Charmetant, Maximilien, Sauer, Jan, Du, Hao, Bai, Xiang-Hui, Raikar, Vipul Pai, Montoya-del-Angel, Ricardo, Marti, Robert, Luna, Miguel, Lee, Dongmin, Qayyum, Abdul, Mazher, Moona, Guo, Qihui, Wang, Changyan, Awasthi, Navchetan, Zhao, Qiaochu, Wang, Wei, Wang, Kuanquan, Wang, Qiucheng, and Dong, Suyu
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Breast cancer is one of the most common causes of death among women worldwide. Early detection helps in reducing the number of deaths. Automated 3D Breast Ultrasound (ABUS) is a newer approach for breast screening, which has many advantages over handheld mammography such as safety, speed, and higher detection rate of breast cancer. Tumor detection, segmentation, and classification are key components in the analysis of medical images, especially challenging in the context of 3D ABUS due to the significant variability in tumor size and shape, unclear tumor boundaries, and a low signal-to-noise ratio. The lack of publicly accessible, well-labeled ABUS datasets further hinders the advancement of systems for breast tumor analysis. Addressing this gap, we have organized the inaugural Tumor Detection, Segmentation, and Classification Challenge on Automated 3D Breast Ultrasound 2023 (TDSC-ABUS2023). This initiative aims to spearhead research in this field and create a definitive benchmark for tasks associated with 3D ABUS image analysis. In this paper, we summarize the top-performing algorithms from the challenge and provide critical analysis for ABUS image examination. We offer the TDSC-ABUS challenge as an open-access platform at https://tdsc-abus2023.grand-challenge.org/ to benchmark and inspire future developments in algorithmic research.
- Published
- 2025
44. Observations of the X-ray Millihertz Quasiperiodic Oscillations in Hercules X-1
- Author
-
Yang, Wen and Wang, Wei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
With a systematic timing investigation of the persistent X-ray binary pulsar Her X-1 based on a large number of Insight-HXMT observations between 2017 to 2019, we confirm the presence of X-ray millihertz quasi-periodic oscillations (mHz QPOs) at $\sim 0.01$ Hz. By applying wavelet analysis in our data analysis procedures, we firstly identified $\sim 0.005-0.009$ Hz QPOs coexisting with the $\sim 0.01$ Hz QPOs. Wavelet analysis suggests that these QPO features show transient behaviors, frequencies of mHz QPOs evolved in short time scales. There exists a positive relation between QPO centroid frequency (from $\sim 0.005-0.009$ Hz) and the X-ray luminosity, while the 10 mHz QPO frequencies keep nearly constant for different luminosities, which suggests different physical mechanisms for two types of mHz QPOs. The 10 mHz QPOs in both X-ray and UV bands would have the same origin related to the beat frequency where the Alfv$\acute{e}$n radius is close to the corotation radius, and the 5 mHz QPOs may originate from magnetic disk precession., Comment: 9 pages, 5 figures, 2 tables, accept for the publication in ApJ
- Published
- 2025
45. Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome
- Author
-
Wang, Wei, Chen, Zhaoxi, Zhang, Wenyu, Wang, Zetao, Zhao, Xiang, Li, Chenyang, Guan, Jian, Yin, Shankai, and Li, Gang
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Objective: The aim of the study is to develop a novel method for improved diagnosis of obstructive sleep apnea-hypopnea syndrome (OSAHS) in clinical or home settings, with the focus on achieving diagnostic performance comparable to the gold-standard polysomnography (PSG) with significantly reduced monitoring burden. Methods: We propose a method using millimeter-wave radar and pulse oximeter for OSAHS diagnosis (ROSA). It contains a sleep apnea-hypopnea events (SAE) detection network, which directly predicts the temporal localization of SAE, and a sleep staging network, which predicts the sleep stages throughout the night, based on radar signals. It also fuses oxygen saturation (SpO2) information from the pulse oximeter to adjust the score of SAE detected by radar. Results: Experimental results on a real-world dataset (>800 hours of overnight recordings, 100 subjects) demonstrated high agreement (ICC=0.9870) on apnea-hypopnea index (AHI) between ROSA and PSG. ROSA also exhibited excellent diagnostic performance, exceeding 90% in accuracy across AHI diagnostic thresholds of 5, 15 and 30 events/h. Conclusion: ROSA improves diagnostic accuracy by fusing millimeter-wave radar and pulse oximeter data. It provides a reliable and low-burden solution for OSAHS diagnosis. Significance: ROSA addresses the limitations of high complexity and monitoring burden associated with traditional PSG. The high accuracy and low burden of ROSA show its potential to improve the accessibility of OSAHS diagnosis among population.
- Published
- 2025
46. Estimating the Black Hole Spin for the X-Ray Binary MAXI J1727-203 Based on Insight-HXMT
- Author
-
Zhu, Haifan and Wang, Wei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics ,High Energy Physics - Phenomenology - Abstract
We constrain the spin of the black hole (BH) candidate MAXI J1727-203 using Insight-HXMT data. Due to limited HXMT observations covering only part of the outburst, NICER data were used to analyze the full outburst's state transitions, we identified two of three HXMT observations in the high soft state and applied the continuum-fitting method to measure the spin. Based on previous estimates and continuum spectral fittings, we explored the parameter space and found that the best-fitting values were $(D, i, M) \approx (6\ \text{kpc}, 30^\circ, 12 M_{\odot})$. We also tested the variation of these parameters using Monte Carlo simulations, sampling over 3000 sets within the parameter ranges: $5.9 \text{kpc}< D<7 \text{kpc}$, $24^\circ
- Published
- 2025
47. Top Ten Challenges Towards Agentic Neural Graph Databases
- Author
-
Bai, Jiaxin, Wang, Zihao, Zhou, Yukun, Yin, Hang, Fei, Weizhi, Hu, Qi, Deng, Zheye, Cheng, Jiayang, Zheng, Tianshi, Tsang, Hong Ting, Gao, Yisen, Xie, Zhongwei, Li, Yufei, Fan, Lixin, Yuan, Binhang, Wang, Wei, Chen, Lei, Zhou, Xiaofang, and Song, Yangqiu
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Databases ,Computer Science - Machine Learning - Abstract
Graph databases (GDBs) like Neo4j and TigerGraph excel at handling interconnected data but lack advanced inference capabilities. Neural Graph Databases (NGDBs) address this by integrating Graph Neural Networks (GNNs) for predictive analysis and reasoning over incomplete or noisy data. However, NGDBs rely on predefined queries and lack autonomy and adaptability. This paper introduces Agentic Neural Graph Databases (Agentic NGDBs), which extend NGDBs with three core functionalities: autonomous query construction, neural query execution, and continuous learning. We identify ten key challenges in realizing Agentic NGDBs: semantic unit representation, abductive reasoning, scalable query execution, and integration with foundation models like large language models (LLMs). By addressing these challenges, Agentic NGDBs can enable intelligent, self-improving systems for modern data-driven applications, paving the way for adaptable and autonomous data management solutions., Comment: 12 Pages
- Published
- 2025
48. A Survey on Multimodal Recommender Systems: Recent Advances and Future Directions
- Author
-
Xu, Jinfeng, Chen, Zheyu, Yang, Shuo, Li, Jinze, Wang, Wei, Hu, Xiping, Hoi, Steven, and Ngai, Edith
- Subjects
Computer Science - Information Retrieval ,Computer Science - Multimedia - Abstract
Acquiring valuable data from the rapidly expanding information on the internet has become a significant concern, and recommender systems have emerged as a widely used and effective tool for helping users discover items of interest. The essence of recommender systems lies in their ability to predict users' ratings or preferences for various items and subsequently recommend the most relevant ones based on historical interaction data and publicly available information. With the advent of diverse multimedia services, including text, images, video, and audio, humans can perceive the world through multiple modalities. Consequently, a recommender system capable of understanding and interpreting different modal data can more effectively refer to individual preferences. Multimodal Recommender Systems (MRS) not only capture implicit interaction information across multiple modalities but also have the potential to uncover hidden relationships between these modalities. The primary objective of this survey is to comprehensively review recent research advancements in MRS and to analyze the models from a technical perspective. Specifically, we aim to summarize the general process and main challenges of MRS from a technical perspective. We then introduce the existing MRS models by categorizing them into four key areas: Feature Extraction, Encoder, Multimodal Fusion, and Loss Function. Finally, we further discuss potential future directions for developing and enhancing MRS. This survey serves as a comprehensive guide for researchers and practitioners in MRS field, providing insights into the current state of MRS technology and identifying areas for future research. We hope to contribute to developing a more sophisticated and effective multimodal recommender system. To access more details of this paper, we open source a repository: https://github.com/Jinfeng-Xu/Awesome-Multimodal-Recommender-Systems.
- Published
- 2025
49. A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment
- Author
-
Hu, Bo, Wang, Wei, Li, Chunyi, He, Lihuo, Li, Leida, and Gao, Xinbo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Wide-angle video is favored for its wide viewing angle and ability to capture a large area of scenery, making it an ideal choice for sports and adventure recording. However, wide-angle video is prone to deformation, exposure and other distortions, resulting in poor video quality and affecting the perception and experience, which may seriously hinder its application in fields such as competitive sports. Up to now, few explorations focus on the quality assessment issue of wide-angle video. This deficiency primarily stems from the absence of a specialized dataset for wide-angle videos. To bridge this gap, we construct the first Multi-annotated and multi-modal Wide-angle Video quality assessment (MWV) dataset. Then, the performances of state-of-the-art video quality methods on the MWV dataset are investigated by inter-dataset testing and intra-dataset testing. Experimental results show that these methods impose significant limitations on their applicability.
- Published
- 2025
50. A White Dwarf Binary Candidate Discovered by LAMOST Using Dynamical Method
- Author
-
Zhu, Haifan, Wang, Wei, Lib, Xue, Li, Jia-jia, and Tian, Pengfu
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
We present the discovery of a binary system containing a white dwarf candidate using data from the LAMOST. Our analysis of the radial velocity data allowed us to determine an orbital period of approximately 0.953 days and a mass function of 0.129 $M_\odot$. Through spectral energy distribution (SED) fitting, we obtained the stellar parameters of the visible star. By combining these results with the mass function, we established a relationship between the mass of the invisible star and the system's inclination angle, along with the Roche lobe radius. We find that the mass of the invisible star is below the Chandrasekhar limit when the inclination angle exceeds $35^\circ$. Given that systems with large variations in radial velocity typically have high inclination angles, we classify the invisible star as a white dwarf candidate. The Roche lobe radius exceeds the physical radius of the visible star, indicating that no mass transfer occurs, which results in a weak ellipsoidal modulation effect. Additionally, we obtained light curves from the TESS, ASAS-SN, and CRTS surveys. The light curves also exhibit a periodicity of approximately 0.95 days, with ellipsoidal modulation only in the 2019 TESS observations. Coupled with the strong $\rm H_{\alpha}$ emission line observed in the LAMOST MRS spectrum, we infer that the surface of the visible star contains significant hot spots. This obscures the system's inherently weak ellipsoidal modulation, resulting in a manifestation of rotational variables. Furthermore, an analysis of the dynamical characteristics of this system indicates that it has a high inclination angle ($>60$ degrees) and its orbital properties are consistent with those of typical thin disk stars, supporting the hypothesis that the invisible object is a white dwarf., Comment: 13pages, 10figures. Accepted by the Journal of High Energy Astrophysics
- Published
- 2025
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.