30,589 results on '"Wang, Xi"'
Search Results
2. Look a Group at Once: Multi-Slide Modeling for Survival Prediction
- Author
-
Li, Xinyang, Zhang, Yi, Xie, Yi, Yang, Jianfei, Wang, Xi, Chen, Hao, and Zhang, Haixian
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Survival prediction is a critical task in pathology. In clinical practice, pathologists often examine multiple cases, leveraging a broader spectrum of cancer phenotypes to enhance pathological assessment. Despite significant advancements in deep learning, current solutions typically model each slide as a sample, struggling to effectively capture comparable and slide-agnostic pathological features. In this paper, we introduce GroupMIL, a novel framework inspired by the clinical practice of collective analysis, which models multiple slides as a single sample and organizes groups of patches and slides sequentially to capture cross-slide prognostic features. We also present GPAMamba, a model designed to facilitate intra- and inter-slide feature interactions, effectively capturing local micro-environmental characteristics within slide-level graphs while uncovering essential prognostic patterns across an extended patch sequence within the group framework. Furthermore, we develop a dual-head predictor that delivers comprehensive survival risk and probability assessments for each patient. Extensive empirical evaluations demonstrate that our model significantly outperforms state-of-the-art approaches across five datasets from The Cancer Genome Atlas.
- Published
- 2024
3. Responsible AI in Construction Safety: Systematic Evaluation of Large Language Models and Prompt Engineering
- Author
-
Sammour, Farouq, Xu, Jia, Wang, Xi, Hu, Mo, and Zhang, Zhenyu
- Subjects
Computer Science - Artificial Intelligence - Abstract
Construction remains one of the most hazardous sectors. Recent advancements in AI, particularly Large Language Models (LLMs), offer promising opportunities for enhancing workplace safety. However, responsible integration of LLMs requires systematic evaluation, as deploying them without understanding their capabilities and limitations risks generating inaccurate information, fostering misplaced confidence, and compromising worker safety. This study evaluates the performance of two widely used LLMs, GPT-3.5 and GPT-4o, across three standardized exams administered by the Board of Certified Safety Professionals (BCSP). Using 385 questions spanning seven safety knowledge areas, the study analyzes the models' accuracy, consistency, and reliability. Results show that both models consistently exceed the BCSP benchmark, with GPT-4o achieving an accuracy rate of 84.6% and GPT-3.5 reaching 73.8%. Both models demonstrate strengths in safety management systems and hazard identification and control, but exhibit weaknesses in science, mathematics, emergency response, and fire prevention. An error analysis identifies four primary limitations affecting LLM performance: lack of knowledge, reasoning flaws, memory issues, and calculation errors. Our study also highlights the impact of prompt engineering strategies, with variations in accuracy reaching 13.5% for GPT-3.5 and 7.9% for GPT-4o. However, no single prompt configuration proves universally effective. This research advances knowledge in three ways: by identifying areas where LLMs can support safety practices and where human oversight remains essential, by offering practical insights into improving LLM implementation through prompt engineering, and by providing evidence-based direction for future research and development. These contributions support the responsible integration of AI in construction safety management toward achieving zero injuries., Comment: 29 pages, 5 figures
- Published
- 2024
4. Lambda-pure global dimension of Grothendieck categories and some applications
- Author
-
Wang, Xi, Yao, Hailou, and Shen, Lei
- Subjects
Mathematics - Category Theory - Abstract
We study the $\lambda$-pure global dimension of a Grothendieck category $\cal A$, and provide two different applications about this dimension. We obtain that if the $\lambda$-pure global dimension $\plgldA<\infty$, then (1) The ordinary bounded derived category (where $\cal A$ has enough projective objects) and the bounded $\lambda$-pure one differ only by a homotopy category; (2) The $\lambda$-pure singularity category $\DlsgA =0$. At last, we explore the reason why the general construction of classic Buchweitz-Happel Theorem is not feasible for $\lambda$-pure one.
- Published
- 2024
5. Augmenting the Veracity and Explanations of Complex Fact Checking via Iterative Self-Revision with LLMs
- Author
-
Zhang, Xiaocheng, Wang, Xi, Lu, Yifei, Ye, Zhuangzhuang, Wang, Jianing, Bao, Mengjiao, Yan, Peng, and Su, Xiaohong
- Subjects
Computer Science - Computation and Language - Abstract
Explanation generation plays a more pivotal role than fact verification in producing interpretable results and facilitating comprehensive fact-checking, which has recently garnered considerable attention. However, previous studies on explanation generation has shown several limitations, such as being confined to English scenarios, involving overly complex inference processes, and not fully unleashing the potential of the mutual feedback between veracity labels and explanation texts. To address these issues, we construct two complex fact-checking datasets in the Chinese scenarios: CHEF-EG and TrendFact. These datasets involve complex facts in areas such as health, politics, and society, presenting significant challenges for fact verification methods. In response to these challenges, we propose a unified framework called FactISR (Augmenting Fact-Checking via Iterative Self-Revision) to perform mutual feedback between veracity and explanations by leveraging the capabilities of large language models(LLMs). FactISR uses a single model to address tasks such as fact verification and explanation generation. Its self-revision mechanism can further revision the consistency between veracity labels, explanation texts, and evidence, as well as eliminate irrelevant noise. We conducted extensive experiments with baselines and FactISR on the proposed datasets. The experimental results demonstrate the effectiveness of our method.
- Published
- 2024
6. LEAD: Latent Realignment for Human Motion Diffusion
- Author
-
Andreou, Nefeli, Wang, Xi, Abrevaya, Victoria Fernández, Cani, Marie-Paule, Chrysanthou, Yiorgos, and Kalogeiton, Vicky
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Graphics - Abstract
Our goal is to generate realistic human motion from natural language. Modern methods often face a trade-off between model expressiveness and text-to-motion alignment. Some align text and motion latent spaces but sacrifice expressiveness; others rely on diffusion models producing impressive motions, but lacking semantic meaning in their latent space. This may compromise realism, diversity, and applicability. Here, we address this by combining latent diffusion with a realignment mechanism, producing a novel, semantically structured space that encodes the semantics of language. Leveraging this capability, we introduce the task of textual motion inversion to capture novel motion concepts from a few examples. For motion synthesis, we evaluate LEAD on HumanML3D and KIT-ML and show comparable performance to the state-of-the-art in terms of realism, diversity, and text-motion consistency. Our qualitative analysis and user study reveal that our synthesized motions are sharper, more human-like and comply better with the text compared to modern methods. For motion textual inversion, our method demonstrates improved capacity in capturing out-of-distribution characteristics in comparison to traditional VAEs.
- Published
- 2024
7. Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
- Author
-
Xiang, Xuezhi, Wang, Xi, Zhang, Lei, Ombati, Denis, Himu, Himaloy, and Zhen, Xiantong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Scene flow estimation aims to generate the 3D motion field of points between two consecutive frames of point clouds, which has wide applications in various fields. Existing point-based methods ignore the irregularity of point clouds and have difficulty capturing long-range dependencies due to the inefficiency of point-level computation. Voxel-based methods suffer from the loss of detail information. In this paper, we propose a point-voxel fusion method, where we utilize a voxel branch based on sparse grid attention and the shifted window strategy to capture long-range dependencies and a point branch to capture fine-grained features to compensate for the information loss in the voxel branch. In addition, since xyz coordinates are difficult to describe the geometric structure of complex 3D objects in the scene, we explicitly encode the local surface information of the point cloud through the umbrella surface feature extraction (USFE) module. We verify the effectiveness of our method by conducting experiments on the Flyingthings3D and KITTI datasets. Our method outperforms all other self-supervised methods and achieves highly competitive results compared to fully supervised methods. We achieve improvements in all metrics, especially EPE, which is reduced by 8.51% and 10.52% on the KITTIo and KITTIs datasets, respectively., Comment: The paper is under consideration at 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
- Published
- 2024
8. KBLaM: Knowledge Base augmented Language Model
- Author
-
Wang, Xi, Mikaelyan, Liana, Isazawa, Taketomo, and Hensman, James
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
In this paper, we propose Knowledge Base augmented Language Model (KBLaM), a new method for augmenting Large Language Models (LLMs) with external knowledge. KBLaM works with a knowledge base (KB) constructed from a corpus of documents, transforming each piece of knowledge in the KB into continuous key-value vector pairs via pre-trained sentence encoders with linear adapters and integrating them into pre-trained LLMs via a specialized rectangular attention mechanism. Unlike Retrieval-Augmented Generation, KBLaM eliminates external retrieval modules, and unlike in-context learning, its computational overhead scales linearly with KB size rather than quadratically. Our approach enables integrating a large KB of more than 10K triples into an 8B pre-trained LLM of only 8K context window on one single A100 80GB GPU and allows for dynamic updates without model fine-tuning or retraining. Experiments demonstrate KBLaM's effectiveness in various tasks, including question-answering and open-ended reasoning, while providing interpretable insights into its use of the augmented knowledge.
- Published
- 2024
9. Formation of Anisotropic Polarons in Antimony Selenide
- Author
-
Shi, Yijie, Wang, Xi, Wang, Zhong, Zhang, Zheng, Hua, Fuyong, Chen, Chao, Hu, Chunlong, Tang, Jiang, and Liang, Wenxi
- Subjects
Condensed Matter - Materials Science ,Physics - Optics - Abstract
Antimony Selenide (Sb$_2$Se$_3$) is an attractive candidate of photovoltaics with not yet satisfying efficiency. Beside defects, polaron formation originated from lattice distortion was proposed to account for trapping free carriers, and the subsequent photoexcitation dynamics and optoelectronic properties, but such a mechanism is still lack of structural observations. Here we directly track the pathways of carrier and lattice evolutions after photoexcitation through optical and electron diffraction pump-probe methods, revealing the temporal correlations between dynamics of both degrees of freedom. The observed opposite separation changes of Se2-Sb2 and Sb2-Sb1 atom pairs in a few picoseconds, and the intermediate state induced by local structural distortions lasting several tens of picoseconds, coinciding with the optical phonons population and coupling, and the trapping process of carriers, respectively, together with the analyses of modulation on diffuse scattering by the atomic displacement fields of polaron model, indicate the formation of anisotropic polarons with large size. Our findings provide carrier and structural information for helping the elucidation of polaron scenario in Sb2Se3, and probably in materials with anisotropic structure and soft lattice which are popular in developing novel optoelectronics.
- Published
- 2024
10. Active nonreciprocal cloaking for pseudo-Hermitian magnons
- Author
-
Schulz, Dominik, Berakdar, Jamal, and Wang, Xi-guang
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Cloaking has important applications but entails sophisticated control of signal propagation and scattering characteristics. Here, we show that invisibility for magnon signals is achievable in a non-reciprocal and electrically controlled way by engineering the magnonic channels such that they exhibit PT-symmetry. This is accomplished by attaching current-carrying heavy metal contacts to the magnon waveguides and exerting fields from an attached bias layer. Tuning the current density in the metal layer, the magnons in this setup experience electrically controlled, compensated gain and loss due to spin-orbit torque which renders the setup PT-symmetric. The magnon dynamics is then shown to be pseudo-Hermitian with exceptional points (EPs) determined actively by an external electric field. We analyze the magnon scattering from single and periodic PT-symmetric regions and identify the conditions necessary for the formation of unidirectional invisibility which can be steered by specific combinations of bias layers and current amplitudes in the heavy metal as to reach the EP. The unidirectional invisibility at EP is found to be extended for a periodic PT-symmetric region. Intrinsic damping on PT-symmetric unidirectional invisibility is shown to be marginal confirming the experimental feasibility. It is shown how the unidirectional magnons can be utilized to amplify and generate magnonic orbital angular momentum states in coupled magnetic rings demonstrating a new path for manipulating magnon propagation and processing., Comment: 25 pages, 6 figures
- Published
- 2024
11. Design, manufacturing, and inverse dynamic modeling of soft parallel robots actuated by dielectric elastomer actuators
- Author
-
Chang, Jung-Che, Wang, Xi, Axinte, Dragos, and Dong, Xin
- Subjects
Computer Science - Robotics ,Electrical Engineering and Systems Science - Systems and Control - Abstract
Soft parallel robots with their manipulation safety and low commercial cost show a promising future for delicate operations and safe human-robot interactions. However, promoting the use of electroactive polymers (EAPs) is still challenging due to the under-improving quality of the product and the dynamic modelling of the collaborations between multiple actuators. This article presents the design, fabrication, modelling and control of a parallel kinematics Delta robot actuated by dielectric elastomer actuators (DEAs). The trade-off between the actuation force and stroke is retaken by an angular stroke amplification mechanism, and the weight of the robot frame is reduced by utilizing 3D puzzling strip structures. A generic way of constructing a high-stability conductive paint on a silicon-based film has been achieved by laser scanning the DE-film and then sandwiching a conductive particle-based electrode with a paint which is mixed by the particles and photosensitive resin. Compared to the wildly used carbon grease, the fabricated electrode shows a higher consistency in its dynamic behaviour before and after the on-stand test. Finally, to predict the output force and inverse motion of the robot end effector, we constructed the inverse dynamic model by introducing an expanded Bergstrom-Boyce model to the constitutive behavior of the dielectric film. The experimental results show a prediction of robot output force with RSME of 12.4% when the end effector remains stationary, and a well-followed trajectory with less than RSME 2.5%., Comment: 17 pages, 12 figures
- Published
- 2024
12. Bi-stable thin soft robot for in-plane locomotion in narrow space
- Author
-
Wang, Xi, Chang, Jung-che, Wang, Feiran, Axinte, Dragos, and Dong, Xin
- Subjects
Computer Science - Robotics ,Physics - Classical Physics - Abstract
Dielectric elastomer actuators (DEAs), also recognized as artificial muscle, have been widely developed for the soft locomotion robot. With the complaint skeleton and miniaturized dimension, they are well suited for the narrow space inspection. In this work, we propose a novel low profile (1.1mm) and lightweight (1.8g) bi-stable in-plane DEA (Bi-DEA) constructed by supporting a dielectric elastomer onto a flat bi-stable mechanism. It has an amplified displacement and output force compared with the in-plane DEA (I-DEA) without the bi-stable mechanism. Then, the Bi-DEA is applied to a thin soft robot, using three electrostatic adhesive pads (EA-Pads) as anchoring elements. This robot is capable of crawling and climbing to access millimetre-scale narrow gaps. A theoretical model of the bi-stable mechanism and the DEA are presented. The enhanced performance of the Bi-DEA induced by the mechanism is experimentally validated. EA-Pad provides the adhesion between the actuator and the locomotion substrate, allowing crawling and climbing on various surfaces, i.e., paper and acrylic. The thin soft robot has been demonstrated to be capable of crawling through a 4mm narrow gap with a speed up to 3.3mm/s (0.07 body length per second and 2.78 body thickness per second)., Comment: 8 pages, 12 figures
- Published
- 2024
13. Distinguishing black holes with and without spontaneous scalarization in Einstein-scalar-Gauss-Bonnet theories via optical features
- Author
-
Wang, Xi-Jing, Meng, Yuan, Kuang, Xiao-Mei, and Liao, Kai
- Subjects
General Relativity and Quantum Cosmology - Abstract
Spontaneous scalarization in Einstein-scalar-Gauss-Bonnet theory admits both vacuum-general relativity (GR) and scalarized hairy black holes as valid solutions, which provides a distinctive signature of new physics in strong gravity regime. In this paper, we shall examine the optical features of Gauss-Bonnet black holes with spontaneous scalarization, which is governed by the coupling parameter $\lambda$. We find that the photon sphere, critical impact parameter and innermost stable circular orbit all decrease as the increasing of $\lambda$. Using observable data from Event Horizon Telescope, we establish the upper limit for $\lambda$. Then we construct the optical appearances of the scalarized black holes illuminated by various thin accretions. Our findings reveal that the scalarized black holes consistently exhibit smaller shadow sizes and reduced brightness compared to Schwarzschild black holes. Notably, in the case of thin spherical accretion, the shadow of the scalarized black hole is smaller, but the surrounding bright ring is more pronounced. Our results highlight the observable features of the scalarized black holes, providing a distinguishable probe from their counterpart in GR in strong gravity regime., Comment: 18 pages, 10 figures
- Published
- 2024
14. Report on the Workshop on Simulations for Information Access (Sim4IA 2024) at SIGIR 2024
- Author
-
Breuer, Timo, Kreutz, Christin Katharina, Fuhr, Norbert, Balog, Krisztian, Schaer, Philipp, Bernard, Nolwenn, Frommholz, Ingo, Gohsen, Marcel, Ji, Kaixin, Jones, Gareth J. F., Keller, Jüri, Liu, Jiqun, Mladenov, Martin, Pasi, Gabriella, Trippas, Johanne, Wang, Xi, Zerhoudi, Saber, and Zhai, ChengXiang
- Subjects
Computer Science - Information Retrieval - Abstract
This paper is a report of the Workshop on Simulations for Information Access (Sim4IA) workshop at SIGIR 2024. The workshop had two keynotes, a panel discussion, nine lightning talks, and two breakout sessions. Key takeaways were user simulation's importance in academia and industry, the possible bridging of online and offline evaluation, and the issues of organizing a companion shared task around user simulations for information access. We report on how we organized the workshop, provide a brief overview of what happened at the workshop, and summarize the main topics and findings of the workshop and future work., Comment: Preprint of a SIGIR Forum submission for Vol. 58 No. 2 - December 2024
- Published
- 2024
15. Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking
- Author
-
Wang, Xi, Chen, Tianxing, Yu, Qiaojun, Xu, Tianling, Chen, Zanxin, Fu, Yiting, Lu, Cewu, Mu, Yao, and Luo, Ping
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Graphics ,Computer Science - Machine Learning - Abstract
Articulated object manipulation requires precise object interaction, where the object's axis must be carefully considered. Previous research employed interactive perception for manipulating articulated objects, but typically, open-loop approaches often suffer from overlooking the interaction dynamics. To address this limitation, we present a closed-loop pipeline integrating interactive perception with online axis estimation from segmented 3D point clouds. Our method leverages any interactive perception technique as a foundation for interactive perception, inducing slight object movement to generate point cloud frames of the evolving dynamic scene. These point clouds are then segmented using Segment Anything Model 2 (SAM2), after which the moving part of the object is masked for accurate motion online axis estimation, guiding subsequent robotic actions. Our approach significantly enhances the precision and efficiency of manipulation tasks involving articulated objects. Experiments in simulated environments demonstrate that our method outperforms baseline approaches, especially in tasks that demand precise axis-based control. Project Page: https://hytidel.github.io/video-tracking-for-axis-estimation/., Comment: Project Page: https://hytidel.github.io/video-tracking-for-axis-estimation/
- Published
- 2024
16. MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation
- Author
-
He, Junqing, Zhu, Liang, Wang, Rui, Wang, Xi, Haffari, Reza, and Zhang, Jiaxing
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Long-term memory is important for chatbots and dialogue systems (DS) to create consistent and human-like conversations, evidenced by numerous developed memory-augmented DS (MADS). To evaluate the effectiveness of such MADS, existing commonly used evaluation metrics, like retrieval accuracy and perplexity (PPL), mainly focus on query-oriented factualness and language quality assessment. However, these metrics often lack practical value. Moreover, the evaluation dimensions are insufficient for human-like assessment in DS. Regarding memory-recalling paradigms, current evaluation schemes only consider passive memory retrieval while ignoring diverse memory recall with rich triggering factors, e.g., emotions and surroundings, which can be essential in emotional support scenarios. To bridge the gap, we construct a novel Memory-Augmented Dialogue Benchmark (MADail-Bench) covering various memory-recalling paradigms based on cognitive science and psychology theories. The benchmark assesses two tasks separately: memory retrieval and memory recognition with the incorporation of both passive and proactive memory recall data. We introduce new scoring criteria to the evaluation, including memory injection, emotion support (ES) proficiency, and intimacy, to comprehensively assess generated responses. Results from cutting-edge embedding models and large language models on this benchmark indicate the potential for further advancement. Extensive testing further reveals correlations between memory injection, ES proficiency, and intimacy., Comment: Submitted to NAACL 2025
- Published
- 2024
17. Location is Key: Leveraging Large Language Model for Functional Bug Localization in Verilog
- Author
-
Yao, Bingkun, Wang, Ning, Zhou, Jie, Wang, Xi, Gao, Hong, Jiang, Zhe, and Guan, Nan
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Artificial Intelligence - Abstract
Bug localization in Verilog code is a crucial and time-consuming task during the verification of hardware design. Since introduction, Large Language Models (LLMs) have showed their strong programming capabilities. However, no work has yet considered using LLMs for bug localization in Verilog code. This paper presents Location-is-Key, an opensource LLM solution to locate functional errors in Verilog snippets. LiK achieves high localization accuracy, with a pass@1 localization accuracy of 93.3% on our test dataset based on RTLLM, surpassing GPT-4's 77.9% and comparable to Claude-3.5's 90.8%. Additionally, the bug location obtained by LiK significantly improves GPT-3.5's bug repair efficiency (Functional pass@1 increased from 40.39% to 58.92%), highlighting the importance of bug localization in LLM-based Verilog debugging. Compared to existing methods, LiK only requires the design specification and the erroneous code snippet, without the need for testbenches, assertions, or any other EDA tools. This research demonstrates the feasibility of using LLMs for Verilog error localization, thus providing a new direction for automatic Verilog code debugging.
- Published
- 2024
18. Self-Updating Vehicle Monitoring Framework Employing Distributed Acoustic Sensing towards Real-World Settings
- Author
-
Wang, Xi, Liu, Xin, Zhu, Songming, Li, Zhanwen, and Gao, Lina
- Subjects
Physics - Geophysics ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Signal Processing - Abstract
The recent emergence of Distributed Acoustic Sensing (DAS) technology has facilitated the effective capture of traffic-induced seismic data. The traffic-induced seismic wave is a prominent contributor to urban vibrations and contain crucial information to advance urban exploration and governance. However, identifying vehicular movements within massive noisy data poses a significant challenge. In this study, we introduce a real-time semi-supervised vehicle monitoring framework tailored to urban settings. It requires only a small fraction of manual labels for initial training and exploits unlabeled data for model improvement. Additionally, the framework can autonomously adapt to newly collected unlabeled data. Before DAS data undergo object detection as two-dimensional images to preserve spatial information, we leveraged comprehensive one-dimensional signal preprocessing to mitigate noise. Furthermore, we propose a novel prior loss that incorporates the shapes of vehicular traces to track a single vehicle with varying speeds. To evaluate our model, we conducted experiments with seismic data from the Stanford 2 DAS Array. The results showed that our model outperformed the baseline model Efficient Teacher and its supervised counterpart, YOLO (You Only Look Once), in both accuracy and robustness. With only 35 labeled images, our model surpassed YOLO's mAP 0.5:0.95 criterion by 18% and showed a 7% increase over Efficient Teacher. We conducted comparative experiments with multiple update strategies for self-updating and identified an optimal approach. This approach surpasses the performance of non-overfitting training conducted with all data in a single pass.
- Published
- 2024
19. Toward satisfactory public accessibility: A crowdsourcing approach through online reviews to inclusive urban design
- Author
-
Li, Lingyao, Hu, Songhua, Dai, Yinpei, Deng, Min, Momeni, Parisa, Laverghetta, Gabriel, Fan, Lizhou, Ma, Zihui, Wang, Xi, Ma, Siyuan, Ligatti, Jay, and Hemphill, Libby
- Subjects
Computer Science - Social and Information Networks - Abstract
As urban populations grow, the need for accessible urban design has become urgent. Traditional survey methods for assessing public perceptions of accessibility are often limited in scope. Crowdsourcing via online reviews offers a valuable alternative to understanding public perceptions, and advancements in large language models can facilitate their use. This study uses Google Maps reviews across the United States and fine-tunes Llama 3 model with the Low-Rank Adaptation technique to analyze public sentiment on accessibility. At the POI level, most categories -- restaurants, retail, hotels, and healthcare -- show negative sentiments. Socio-spatial analysis reveals that areas with higher proportions of white residents and greater socioeconomic status report more positive sentiment, while areas with more elderly, highly-educated residents exhibit more negative sentiment. Interestingly, no clear link is found between the presence of disabilities and public sentiments. Overall, this study highlights the potential of crowdsourcing for identifying accessibility challenges and providing insights for urban planners.
- Published
- 2024
20. Learning to Discover Forgery Cues for Face Forgery Detection
- Author
-
Tian, Jiahe, Chen, Peng, Yu, Cai, Fu, Xiaomeng, Wang, Xi, Dai, Jiao, and Han, Jizhong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Locating manipulation maps, i.e., pixel-level annotation of forgery cues, is crucial for providing interpretable detection results in face forgery detection. Related learning objects have also been widely adopted as auxiliary tasks to improve the classification performance of detectors whereas they require comparisons between paired real and forged faces to obtain manipulation maps as supervision. This requirement restricts their applicability to unpaired faces and contradicts real-world scenarios. Moreover, the used comparison methods annotate all changed pixels, including noise introduced by compression and upsampling. Using such maps as supervision hinders the learning of exploitable cues and makes models prone to overfitting. To address these issues, we introduce a weakly supervised model in this paper, named Forgery Cue Discovery (FoCus), to locate forgery cues in unpaired faces. Unlike some detectors that claim to locate forged regions in attention maps, FoCus is designed to sidestep their shortcomings of capturing partial and inaccurate forgery cues. Specifically, we propose a classification attentive regions proposal module to locate forgery cues during classification and a complementary learning module to facilitate the learning of richer cues. The produced manipulation maps can serve as better supervision to enhance face forgery detectors. Visualization of the manipulation maps of the proposed FoCus exhibits superior interpretability and robustness compared to existing methods. Experiments on five datasets and four multi-task models demonstrate the effectiveness of FoCus in both in-dataset and cross-dataset evaluations., Comment: TIFS 2024
- Published
- 2024
- Full Text
- View/download PDF
21. SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
- Author
-
Rahmani, Hossein A., Wang, Xi, Yilmaz, Emine, Craswell, Nick, Mitra, Bhaskar, and Thomas, Paul
- Subjects
Computer Science - Information Retrieval - Abstract
Large-scale test collections play a crucial role in Information Retrieval (IR) research. However, according to the Cranfield paradigm and the research into publicly available datasets, the existing information retrieval research studies are commonly developed on small-scale datasets that rely on human assessors for relevance judgments - a time-intensive and expensive process. Recent studies have shown the strong capability of Large Language Models (LLMs) in producing reliable relevance judgments with human accuracy but at a greatly reduced cost. In this paper, to address the missing large-scale ad-hoc document retrieval dataset, we extend the TREC Deep Learning Track (DL) test collection via additional language model synthetic labels to enable researchers to test and evaluate their search systems at a large scale. Specifically, such a test collection includes more than 1,900 test queries from the previous years of tracks. We compare system evaluation with past human labels from past years and find that our synthetically created large-scale test collection can lead to highly correlated system rankings., Comment: 9 pages, resource paper
- Published
- 2024
22. Cloud-Based Federation Framework and Prototype for Open, Scalable, and Shared Access to NextG and IoT Testbeds
- Author
-
McManus, Maxwell, Rinchen, Tenzin, Dey, Annoy, Thota, Sumanth, Zhang, Zhaoxi, Hu, Jiangqi, Wang, Xi, Ji, Mingyue, Mastronarde, Nicholas, Bentley, Elizabeth Serena, Medley, Michael, and Guan, Zhangyu
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Networking and Internet Architecture - Abstract
In this work, we present a new federation framework for UnionLabs, an innovative cloud-based resource-sharing infrastructure designed for next-generation (NextG) and Internet of Things (IoT) over-the-air (OTA) experiments. The framework aims to reduce the federation complexity for testbeds developers by automating tedious backend operations, thereby providing scalable federation and remote access to various wireless testbeds. We first describe the key components of the new federation framework, including the Systems Manager Integration Engine (SMIE), the Automated Script Generator (ASG), and the Database Context Manager (DCM). We then prototype and deploy the new Federation Plane on the Amazon Web Services (AWS) public cloud, demonstrating its effectiveness by federating two wireless testbeds: i) UB NeXT, a 5G-and-beyond (5G+) testbed at the University at Buffalo, and ii) UT IoT, an IoT testbed at the University of Utah. Through this work we aim to initiate a grassroots campaign to democratize access to wireless research testbeds with heterogeneous hardware resources and network environment, and accelerate the establishment of a mature, open experimental ecosystem for the wireless community. The API of the new Federation Plane will be released to the community after internal testing is completed.
- Published
- 2024
23. Decoupling Feature Representations of Ego and Other Modalities for Incomplete Multi-modal Brain Tumor Segmentation
- Author
-
Yang, Kaixiang, Shan, Wenqi, Li, Xudong, Wang, Xuan, Yang, Xikai, Wang, Xi, Heng, Pheng-Ann, Li, Qiang, and Wang, Zhiwei
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Multi-modal brain tumor segmentation typically involves four magnetic resonance imaging (MRI) modalities, while incomplete modalities significantly degrade performance. Existing solutions employ explicit or implicit modality adaptation, aligning features across modalities or learning a fused feature robust to modality incompleteness. They share a common goal of encouraging each modality to express both itself and the others. However, the two expression abilities are entangled as a whole in a seamless feature space, resulting in prohibitive learning burdens. In this paper, we propose DeMoSeg to enhance the modality adaptation by Decoupling the task of representing the ego and other Modalities for robust incomplete multi-modal Segmentation. The decoupling is super lightweight by simply using two convolutions to map each modality onto four feature sub-spaces. The first sub-space expresses itself (Self-feature), while the remaining sub-spaces substitute for other modalities (Mutual-features). The Self- and Mutual-features interactively guide each other through a carefully-designed Channel-wised Sparse Self-Attention (CSSA). After that, a Radiologist-mimic Cross-modality expression Relationships (RCR) is introduced to have available modalities provide Self-feature and also `lend' their Mutual-features to compensate for the absent ones by exploiting the clinical prior knowledge. The benchmark results on BraTS2020, BraTS2018 and BraTS2015 verify the DeMoSeg's superiority thanks to the alleviated modality adaptation difficulty. Concretely, for BraTS2020, DeMoSeg increases Dice by at least 0.92%, 2.95% and 4.95% on whole tumor, tumor core and enhanced tumor regions, respectively, compared to other state-of-the-arts. Codes are at https://github.com/kk42yy/DeMoSeg, Comment: 8 pages, 4 figures
- Published
- 2024
24. Spatio-Temporal Communication Compression for Distributed Prime-Dual Optimization
- Author
-
Ren, Zihao, Wang, Lei, Yi, Xinlei, Wang, Xi, Yuan, Deming, Yang, Tao, Wu, Zhengguang, and Shi, Guodong
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Several data compressors have been proposed in distributed optimization frameworks of network systems to reduce communication overhead in large-scale applications. In this paper, we demonstrate that effective information compression may occur over time or space during sequences of node communications in distributed algorithms, leading to the concept of spatio-temporal compressors. This abstraction classifies existing compressors as spatio-temporal compressors, with their effectiveness described by constructive stability criteria from nonlinear system theory. Subsequently, we apply these spatio-temporal compressors to standard continuous-time consensus flows and distributed prime-dual flows, establishing conditions ensuring convergence. Additionally, we introduce a novel observer-based distributed primal-dual continuous flow integrated with spatio-temporal compressors, which provides broader convergence conditions. These continuous flows achieve exponential convergence to the global optimum when the objective function is strongly convex and can be discretized using Euler approximations. Finally, numerical simulations illustrate the versatility of the proposed spatio-temporal compressors and verify the convergence of algorithms., Comment: arXiv admin note: text overlap with arXiv:2408.02332
- Published
- 2024
25. Context-Driven Index Trimming: A Data Quality Perspective to Enhancing Precision of RALMs
- Author
-
Ma, Kexin, Jin, Ruochun, Wang, Xi, Chen, Huan, Ren, Jing, and Tang, Yuhua
- Subjects
Computer Science - Computation and Language ,Computer Science - Databases - Abstract
Retrieval-Augmented Large Language Models (RALMs) have made significant strides in enhancing the accuracy of generated responses.However, existing research often overlooks the data quality issues within retrieval results, often caused by inaccurate existing vector-distance-based retrieval methods.We propose to boost the precision of RALMs' answers from a data quality perspective through the Context-Driven Index Trimming (CDIT) framework, where Context Matching Dependencies (CMDs) are employed as logical data quality rules to capture and regulate the consistency between retrieved contexts.Based on the semantic comprehension capabilities of Large Language Models (LLMs), CDIT can effectively identify and discard retrieval results that are inconsistent with the query context and further modify indexes in the database, thereby improving answer quality.Experiments demonstrate on challenging question-answering tasks.Also, the flexibility of CDIT is verified through its compatibility with various language models and indexing methods, which offers a promising approach to bolster RALMs' data quality and retrieval precision jointly.
- Published
- 2024
26. Source-Free Domain-Invariant Performance Prediction
- Author
-
Khramtsova, Ekaterina, Baktashmotlagh, Mahsa, Zuccon, Guido, Wang, Xi, and Salzmann, Mathieu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Accurately estimating model performance poses a significant challenge, particularly in scenarios where the source and target domains follow different data distributions. Most existing performance prediction methods heavily rely on the source data in their estimation process, limiting their applicability in a more realistic setting where only the trained model is accessible. The few methods that do not require source data exhibit considerably inferior performance. In this work, we propose a source-free approach centred on uncertainty-based estimation, using a generative model for calibration in the absence of source data. We establish connections between our approach for unsupervised calibration and temperature scaling. We then employ a gradient-based strategy to evaluate the correctness of the calibrated predictions. Our experiments on benchmark object recognition datasets reveal that existing source-based methods fall short with limited source sample availability. Furthermore, our approach significantly outperforms the current state-of-the-art source-free and source-based methods, affirming its effectiveness in domain-invariant performance estimation., Comment: Accepted in ECCV 2024
- Published
- 2024
27. Adaptive Retrieval-Augmented Generation for Conversational Systems
- Author
-
Wang, Xi, Sen, Procheta, Li, Ruizhe, and Yilmaz, Emine
- Subjects
Computer Science - Computation and Language ,Computer Science - Information Retrieval - Abstract
Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses. Hence, many existing studies commonly assume the always need for Retrieval Augmented Generation (RAG) in a conversational system without explicit control. This raises a research question about such a necessity. In this study, we propose to investigate the need for each turn of system response to be augmented with external knowledge. In particular, by leveraging human judgements on the binary choice of adaptive augmentation, we develop RAGate, a gating model, which models conversation context and relevant inputs to predict if a conversational system requires RAG for improved responses. We conduct extensive experiments on devising and applying RAGate to conversational models and well-rounded analyses of different conversational scenarios. Our experimental results and analysis indicate the effective application of RAGate in RAG-based conversational systems in identifying system responses for appropriate RAG with high-quality responses and a high generation confidence. This study also identifies the correlation between the generation's confidence level and the relevance of the augmented knowledge., Comment: 12 pages, under review
- Published
- 2024
28. Three-Photon Polarization Entanglement of Green Light
- Author
-
Lou, Yan-Chao, Ren, Zhi-Cheng, Chen, Chao, Wan, Pei, Zhu, Wen-Zheng, Wang, Jing, Xue, Shu-Tian, Dong, Bo-Wen, Ding, Jianping, Wang, Xi-Lin, and Wang, Hui-Tian
- Subjects
Quantum Physics - Abstract
Recently, great progress has been made in the entanglement of multiple photons at various wavelengths and in different degrees of freedom for optical quantum information applied in diverse scenarios. However, multi-photon entanglement in the transmission window of green light under the water has not been reported yet. Here, by combining femtosecond laser based multi-photon entanglement and entanglement-maintaining frequency upconversion techniques, we successfully generate a green two-photon polarization-entangled Bell state and a green three-photon Greenberger-Horne-Zeilinger (GHZ) state, whose state fidelities are 0.893$\mathbf{\pm}$0.002 and 0.595$\mathbf{\pm}$0.023, respectively. Our result provides a scalable method to prepare green multi-photon entanglement, which may have wide applications in underwater quantum information., Comment: 8 pages, 4 figures
- Published
- 2024
- Full Text
- View/download PDF
29. Heralded High-Dimensional Photon-Photon Quantum Gate
- Author
-
Liu, Zhi-Feng, Ren, Zhi-Cheng, Wan, Pei, Zhu, Wen-Zheng, Cheng, Zi-Mo, Wang, Jing, Shi, Yu-Peng, Xi, Han-Bing, Huber, Marcus, Friis, Nicolai, Gao, Xiaoqin, Wang, Xi-Lin, and Wang, Hui-Tian
- Subjects
Quantum Physics - Abstract
High-dimensional encoding of quantum information holds the potential to greatly increase the computational power of existing devices by enlarging the accessible state space for fixed register size and by reducing the number of required entangling gates. However, qudit-based quantum computation remains far less developed than conventional qubit-based approaches, in particular for photons, which represent natural multi-level information carriers that play a crucial role in the development of quantum networks. A major obstacle for realizing quantum gates between two individual photons is the restriction of direct interaction between photons in linear media. In particular, essential logic components for quantum operations such as native qudit-qudit entangling gates are still missing for optical quantum information processing. Here we address this challenge by presenting a protocol for realizing an entangling gate -- the controlled phase-flip (CPF) gate -- for two photonic qudits in arbitrary dimension. We experimentally demonstrate this protocol by realizing a four-dimensional qudit-qudit CPF gate, whose decomposition would require at least 13 two-qubit entangling gates. Our photonic qudits are encoded in orbital angular momentum (OAM) and we have developed a new active high-precision phase-locking technology to construct a high-dimensional OAM beam splitter that increases the stability of the CPF gate, resulting in a process fidelity within a range of $ [0.64 \pm 0.01, 0.82 \pm 0.01]$. Our experiment represents a significant advance for high-dimensional optical quantum information processing and has the potential for wider applications beyond optical system., Comment: 14 pages, 7 figures
- Published
- 2024
30. Large Language Model for Verilog Generation with Golden Code Feedback
- Author
-
Wang, Ning, Yao, Bingkun, Zhou, Jie, Wang, Xi, Jiang, Zhe, and Guan, Nan
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Artificial Intelligence - Abstract
Recent advancements in large language models (LLMs) have catalyzed significant interest in the automatic generation of Register-Transfer Level (RTL) code, particularly Verilog, from natural language instructions. While commercial LLMs like ChatGPT have dominated this domain, open-source alternatives have lagged considerably in performance, limiting the flexibility and data privacy of this emerging technology. This study introduces a novel approach utilizing reinforcement learning with golden code feedback to enhance the performance of pre-trained models. Leveraging open-source data and base models, we have achieved state-of-the-art (SOTA) results with a substantial margin. Notably, our 6.7B parameter model \ours{} demonstrates superior performance compared to current best-in-class 13B and 16B models. Furthermore, through a comprehensive analysis of the limitations in direct fine-tuning and the training dynamics of reinforcement learning, we posit that the development of comprehensive supervisory signals, which are align with the inherent parallel semantics of Verilog code, is critical to effective generation. The code and data associated with this research are publicly available at \url{https://github.com/CatIIIIIIII/veriseek}. The model weights can be accessed at \url{https://huggingface.co/WANGNingroci/VeriSeek}.
- Published
- 2024
31. Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models
- Author
-
Li, Qiao, Fu, Xiaomeng, Wang, Xi, Liu, Jin, Gao, Xingyu, Dai, Jiao, and Han, Jizhong
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
With the rapid advancements of large-scale text-to-image diffusion models, various practical applications have emerged, bringing significant convenience to society. However, model developers may misuse the unauthorized data to train diffusion models. These data are at risk of being memorized by the models, thus potentially violating citizens' privacy rights. Therefore, in order to judge whether a specific image is utilized as a member of a model's training set, Membership Inference Attack (MIA) is proposed to serve as a tool for privacy protection. Current MIA methods predominantly utilize pixel-wise comparisons as distinguishing clues, considering the pixel-level memorization characteristic of diffusion models. However, it is practically impossible for text-to-image models to memorize all the pixel-level information in massive training sets. Therefore, we move to the more advanced structure-level memorization. Observations on the diffusion process show that the structures of members are better preserved compared to those of nonmembers, indicating that diffusion models possess the capability to remember the structures of member images from training sets. Drawing on these insights, we propose a simple yet effective MIA method tailored for text-to-image diffusion models. Extensive experimental results validate the efficacy of our approach. Compared to current pixel-level baselines, our approach not only achieves state-of-the-art performance but also demonstrates remarkable robustness against various distortions.
- Published
- 2024
32. Transition magnetic moment about neutrinos
- Author
-
Ruan, Long, Zhao, Shu-Min, Liu, Ming-Yue, Han, Xing-Yu, Wang, Xi, and Dong, Xing-Xing
- Subjects
High Energy Physics - Phenomenology - Abstract
This paper investigates the neutrino transition magnetic moment in the $U(1)_X$SSM. $U(1)_X$SSM is the $U(1)$ extension of Minimal Supersymmetric Standard Model (MSSM) and its local gauge group is extended to $SU(3)_C\times SU(2)_L \times U(1)_Y\times U(1)_X$. To obtain this model, three singlet new Higgs superfields and right-handed neutrinos are added to the MSSM, which can explain the results of neutrino oscillation experiments. The neutrino transition magnetic moment is induced by electroweak radiative corrections. By applying effective Lagrangian method and on-shell scheme, we study the associated Feynman diagrams and the transition magnetic moment of neutrinos in the model. We fit experimental data for neutrino mass variances and mixing angle. Based on the range of data selection, the influences of different sensitive parameters on the results are analysed. The numerical analysis shows that many parameters have an effect on the neutrino transition moment, such as $g_X$, $M_2$, $\lambda_H$ and $g_{YX}$. For our numerical results, the order of magnitude of $\mu_{ij}^M/\mu_B$ is around $10^{-20}$ $\sim$ $10^{-19}$.
- Published
- 2024
33. E.T. the Exceptional Trajectories: Text-to-camera-trajectory generation with character awareness
- Author
-
Courant, Robin, Dufour, Nicolas, Wang, Xi, Christie, Marc, and Kalogeiton, Vicky
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Stories and emotions in movies emerge through the effect of well-thought-out directing decisions, in particular camera placement and movement over time. Crafting compelling camera trajectories remains a complex iterative process, even for skilful artists. To tackle this, in this paper, we propose a dataset called the Exceptional Trajectories (E.T.) with camera trajectories along with character information and textual captions encompassing descriptions of both camera and character. To our knowledge, this is the first dataset of its kind. To show the potential applications of the E.T. dataset, we propose a diffusion-based approach, named DIRECTOR, which generates complex camera trajectories from textual captions that describe the relation and synchronisation between the camera and characters. To ensure robust and accurate evaluations, we train on the E.T. dataset CLaTr, a Contrastive Language-Trajectory embedding for evaluation metrics. We posit that our proposed dataset and method significantly advance the democratization of cinematography, making it more accessible to common users., Comment: ECCV 2024. Project page: https://www.lix.polytechnique.fr/vista/projects/2024_et_courant/
- Published
- 2024
34. EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting
- Author
-
Zhang, Daiwei, Li, Gengyan, Li, Jiajie, Bressieux, Mickaël, Hilliges, Otmar, Pollefeys, Marc, Van Gool, Luc, and Wang, Xi
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Human activities are inherently complex, often involving numerous object interactions. To better understand these activities, it is crucial to model their interactions with the environment captured through dynamic changes. The recent availability of affordable head-mounted cameras and egocentric data offers a more accessible and efficient means to understand human-object interactions in 3D environments. However, most existing methods for human activity modeling neglect the dynamic interactions with objects, resulting in only static representations. The few existing solutions often require inputs from multiple sources, including multi-camera setups, depth-sensing cameras, or kinesthetic sensors. To this end, we introduce EgoGaussian, the first method capable of simultaneously reconstructing 3D scenes and dynamically tracking 3D object motion from RGB egocentric input alone. We leverage the uniquely discrete nature of Gaussian Splatting and segment dynamic interactions from the background, with both having explicit representations. Our approach employs a clip-level online learning pipeline that leverages the dynamic nature of human activities, allowing us to reconstruct the temporal evolution of the scene in chronological order and track rigid object motion. EgoGaussian shows significant improvements in terms of both dynamic object and background reconstruction quality compared to the state-of-the-art. We also qualitatively demonstrate the high quality of the reconstructed models.
- Published
- 2024
35. Towards Synchronous Memorizability and Generalizability with Site-Modulated Diffusion Replay for Cross-Site Continual Segmentation
- Author
-
Xu, Dunyuan, Wang, Xi, Zhang, Jingyang, and Heng, Pheng-Ann
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The ability to learn sequentially from different data sites is crucial for a deep network in solving practical medical image diagnosis problems due to privacy restrictions and storage limitations. However, adapting on incoming site leads to catastrophic forgetting on past sites and decreases generalizablity on unseen sites. Existing Continual Learning (CL) and Domain Generalization (DG) methods have been proposed to solve these two challenges respectively, but none of them can address both simultaneously. Recognizing this limitation, this paper proposes a novel training paradigm, learning towards Synchronous Memorizability and Generalizability (SMG-Learning). To achieve this, we create the orientational gradient alignment to ensure memorizability on previous sites, and arbitrary gradient alignment to enhance generalizability on unseen sites. This approach is named as Parallel Gradient Alignment (PGA). Furthermore, we approximate the PGA as dual meta-objectives using the first-order Taylor expansion to reduce computational cost of aligning gradients. Considering that performing gradient alignments, especially for previous sites, is not feasible due to the privacy constraints, we design a Site-Modulated Diffusion (SMD) model to generate images with site-specific learnable prompts, replaying images have similar data distributions as previous sites. We evaluate our method on two medical image segmentation tasks, where data from different sites arrive sequentially. Experimental results show that our method efficiently enhances both memorizability and generalizablity better than other state-of-the-art methods, delivering satisfactory performance across all sites. Our code will be available at: https://github.com/dyxu-cuhkcse/SMG-Learning., Comment: This work has been submitted to the IEEE for possible publication
- Published
- 2024
36. SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
- Author
-
Ma, Zeyao, Zhang, Bohan, Zhang, Jing, Yu, Jifan, Zhang, Xiaokang, Zhang, Xiaohan, Luo, Sijia, Wang, Xi, and Tang, Jie
- Subjects
Computer Science - Computation and Language ,Computer Science - Software Engineering - Abstract
We introduce SpreadsheetBench, a challenging spreadsheet manipulation benchmark exclusively derived from real-world scenarios, designed to immerse current large language models (LLMs) in the actual workflow of spreadsheet users. Unlike existing benchmarks that rely on synthesized queries and simplified spreadsheet files, SpreadsheetBench is built from 912 real questions gathered from online Excel forums, which reflect the intricate needs of users. The associated spreadsheets from the forums contain a variety of tabular data such as multiple tables, non-standard relational tables, and abundant non-textual elements. Furthermore, we propose a more reliable evaluation metric akin to online judge platforms, where multiple spreadsheet files are created as test cases for each instruction, ensuring the evaluation of robust solutions capable of handling spreadsheets with varying values. Our comprehensive evaluation of various LLMs under both single-round and multi-round inference settings reveals a substantial gap between the state-of-the-art (SOTA) models and human performance, highlighting the benchmark's difficulty., Comment: Neurips 2024 (Spotlight); Homepage: https://spreadsheetbench.github.io/
- Published
- 2024
37. Using the shadow of a black hole to examine the energy exchange between axion matter and a rotating black hole
- Author
-
Kuang, Xiao-Mei, Meng, Yuan, Papantonopoulos, Eleftherios, and Wang, Xi-Jing
- Subjects
General Relativity and Quantum Cosmology - Abstract
We find that a \textit{slowly} rotating axion-modified black hole resulting from the backreaction of an axion field on a rotating Kerr black hole can have a \textit{D-shaped} shadow as that for a \textit{highly} counter-rotating Kerr black hole. This attributes to the fact that the energy exchange between the axion matter and the black hole influences the rotation of the black hole, so the black hole angular momentum first decreases to zero and then the black hole starts to rotate to the opposite direction. Further increasing the coupling leads to \textit{``human-face-like" shaped} shadows and new lensing due to the chaotic scattering, which are novel and drastically different from Kerr black hole. Our analysis provides the first counterexample to that slowly rotating black hole has nearly circular shadow., Comment: Version published as a Letter in Physical Review D
- Published
- 2024
- Full Text
- View/download PDF
38. Fudan Multi-purpose Active TArget Time Projection Chamber (fMeta-TPC) for Photonnuclear Reaction Experiments
- Author
-
Wu, Huang-Kai, Wang, Xi-Yang, Wang, Yu-Miao, Wang, You-Jing, Fang, De-Qing, He, Wan-Bing, Ma, Wei-Hu, Cao, Xi-Guang, Fu, Chang-Bo, Deng, Xian-Gai, and Ma, Yu-Gang
- Subjects
Physics - Instrumentation and Detectors ,Nuclear Experiment ,Nuclear Theory - Abstract
Active Target Time Projection Chambers (AT-TPCs) are state-of-the-art tools in the field of low-energy nuclear physics, particularly suitable for experiments using low-intensity radioactive ion beams or gamma rays. The Fudan Multi-purpose Active Target Time Projection Chamber (fMeta-TPC) with 2048 channels has been developed to study $\alpha$-clustering nuclei. {\fcb In this work, the focus is on the study of the photonuclear reaction with the Laser Compton Scattering (LCS) gamma source, especially for the decay of the highly excited $\alpha$-cluster state.} The design of fMeta-TPC is described and a comprehensive evaluation of its offline performance is performed by ultraviolet (UV) laser and $^{241}$Am $\alpha$ source. The result shows that the intrinsic angular resolution of the detector is within 0.30$^{\circ}$ and has an energy resolution of 6.85\% for 3.0 MeV $\alpha$ particles. The gain uniformity of the detector is about 10\% (RMS/Mean), tested by the $^{55}$Fe X-ray source., Comment: 10 pages, 12 figures
- Published
- 2024
39. Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding
- Author
-
Ghermi, Ridouane, Wang, Xi, Kalogeiton, Vicky, and Laptev, Ivan
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Recent advances in vision-language models have significantly propelled video understanding. Existing datasets and tasks, however, have notable limitations. Most datasets are confined to short videos with limited events and narrow narratives. For example, datasets with instructional and egocentric videos often document the activities of one person in a single scene. Although some movie datasets offer richer content, they are often limited to short-term tasks, lack publicly available videos and frequently encounter data leakage given the use of movie forums and other resources in LLM training. To address the above limitations, we propose the Short Film Dataset (SFD) with 1,078 publicly available amateur movies, a wide variety of genres and minimal data leakage issues. SFD offers long-term story-oriented video tasks in the form of multiple-choice and open-ended question answering. Our extensive experiments emphasize the need for long-term reasoning to solve SFD tasks. Notably, we find strong signals in movie transcripts leading to the on-par performance of people and LLMs. We also show significantly lower performance of current models compared to people when using vision data alone.
- Published
- 2024
40. Hybrid terahertz emitter for pulse shaping and chirality control
- Author
-
Wu, Weipeng, Acuna, Wilder, Huang, Zhixiang, Wang, Xi, Gundlach, Lars, Doty, Matthew F., Zide, Joshua M. O., and Jungfleisch, M. Benjamin
- Subjects
Physics - Optics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Materials Science ,Physics - Applied Physics - Abstract
Terahertz (THz) radiation, spanning from 0.3 to 3x10^12 Hz, fills the crucial gap between the microwave and infrared spectral range. THz technology has found applications in various fields, from imaging and sensing to telecommunication and biosensing. However, the full potential of these applications is often hindered by the need for precise control and manipulation of the frequency and polarization state, which typically requires external THz modulators. Here, we demonstrate a hybrid THz source that overcomes this limitation. Our device consists of two THz emitters integrated into one single device, enabling pulse shaping and chirality control of the emitted radiation without additional external components. The two sources are a spintronic emitter and a semiconductor photoconductive antenna (PCA). Using a combination of dual-wavelength excitation, allowing for control of the relative time delay between the two laser excitation pulses, and tuning external parameters for each emitter (i.e., biasing voltage for the PCA and magnetic field for the spintronic THz emitter) enables precise control of the mixing of the two signals and results in frequency, polarization, and chirality control of the overall THz radiation. This on-chip hybrid emitter provides an essential platform for engineered THz radiation with wide-ranging potential applications.
- Published
- 2024
41. Time-resolved optical assessment of exciton formation in mixed two-dimensional perovskite films
- Author
-
Zhang, Zheng, Wang, Jianan, Shi, Yijie, Wang, Xi, Wang, Zhong, Zhu, Xiangyu, Hu, Chunlong, Liu, Zonghao, Chen, Wei, and Liang, Wenxi
- Subjects
Physics - Optics ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
We report the observation of exciton formation from the cooled band-edge carriers in mixed two-dimensional hybrid organic-inorganic perovskites using femtosecond transient absorption spectroscopy. By monitoring the changes of bleach signal upon excitations with various photon energy, we are able to extract the values of exciton binding energy and the occupancies of carriers of free and bound states for each two-dimensional phase. We also confirm the existence of Mahan exciton when injected carrier density is above the Mott criterion., Comment: Main text: 15 pages, 4 figures. Supplementary Information: 16 pages, 16 figures, 10 tables
- Published
- 2024
42. Distributed Riemannian Stochastic Gradient Tracking Algorithm on the Stiefel Manifold
- Author
-
Zhao, Jishu, Wang, Xi, and Lei, Jinlong
- Subjects
Mathematics - Optimization and Control - Abstract
This paper focus on investigating the distributed Riemannian stochastic optimization problem on the Stiefel manifold for multi-agent systems, where all the agents work collaboratively to optimize a function modeled by the average of their expectation-valued local costs. Each agent only processes its own local cost function and communicate with neighboring agents to achieve optimal results while ensuring consensus. Since the local Riemannian gradient in stochastic regimes cannot be directly calculated, we will estimate the gradient by the average of a variable number of sampled gradient, which however brings about noise to the system. We then propose a distributed Riemannian stochastic optimization algorithm on the Stiefel manifold by combining the variable sample size gradient approximation method with the gradient tracking dynamic. It is worth noticing that the suitably chosen increasing sample size plays an important role in improving the algorithm efficiency, as it reduces the noise variance. In an expectation-valued sense, the iterates of all agents are proved to converge to a stationary point (or neighborhood) with fixed step sizes. We further establish the convergence rate of the iterates for the cases when the sample size is exponentially increasing, polynomial increasing, or a constant, respectively. Finally, numerical experiments are implemented to demonstrate the theoretical results.
- Published
- 2024
43. JUNO Sensitivity to Invisible Decay Modes of Neutrons
- Author
-
JUNO Collaboration, Abusleme, Angel, Adam, Thomas, Adamowicz, Kai, Ahmad, Shakeel, Ahmed, Rizwan, Aiello, Sebastiano, An, Fengpeng, An, Qi, Andronico, Giuseppe, Anfimov, Nikolay, Antonelli, Vito, Antoshkina, Tatiana, de André, João Pedro Athayde Marcondes, Auguste, Didier, Bai, Weidong, Balashov, Nikita, Baldini, Wander, Barresi, Andrea, Basilico, Davide, Baussan, Eric, Bellato, Marco, Beretta, Marco, Bergnoli, Antonio, Bick, Daniel, Bieger, Lukas, Biktemerova, Svetlana, Birkenfeld, Thilo, Blake, Iwan, Blyth, Simon, Bolshakova, Anastasia, Bongrand, Mathieu, Breton, Dominique, Brigatti, Augusto, Brugnera, Riccardo, Bruno, Riccardo, Budano, Antonio, Busto, Jose, Cabrera, Anatael, Caccianiga, Barbara, Cai, Hao, Cai, Xiao, Cai, Yanke, Cai, Zhiyan, Callier, Stéphane, Calvez, Steven, Cammi, Antonio, Campeny, Agustin, Cao, Chuanya, Cao, Guofu, Cao, Jun, Caruso, Rossella, Cerna, Cédric, Cerrone, Vanessa, Chang, Jinfan, Chang, Yun, Chatrabhuti, Auttakit, Chen, Chao, Chen, Guoming, Chen, Pingping, Chen, Shaomin, Chen, Xin, Chen, Yiming, Chen, Yixue, Chen, Yu, Chen, Zelin, Chen, Zhangming, Chen, Zhiyuan, Chen, Zikang, Cheng, Jie, Cheng, Yaping, Cheng, Yu Chin, Chepurnov, Alexander, Chetverikov, Alexey, Chiesa, Davide, Chimenti, Pietro, Chin, Yen-Ting, Chou, Po-Lin, Chu, Ziliang, Chukanov, Artem, Claverie, Gérard, Clementi, Catia, Clerbaux, Barbara, Molla, Marta Colomer, Di Lorenzo, Selma Conforti, Coppi, Alberto, Corti, Daniele, Csakli, Simon, Cui, Chenyang, Corso, Flavio Dal, Dalager, Olivia, Datta, Jaydeep, De La Taille, Christophe, Deng, Zhi, Deng, Ziyan, Ding, Xiaoyu, Ding, Xuefeng, Ding, Yayun, Dirgantara, Bayu, Dittrich, Carsten, Dmitrievsky, Sergey, Dohnal, Tadeas, Dolzhikov, Dmitry, Donchenko, Georgy, Dong, Jianmeng, Doroshkevich, Evgeny, Dou, Wei, Dracos, Marcos, Druillole, Frédéric, Du, Ran, Du, Shuxian, Duan, Yujie, Dugas, Katherine, Dusini, Stefano, Duyang, Hongyue, Eck, Jessica, Enqvist, Timo, Fabbri, Andrea, Fahrendholz, Ulrike, Fan, Lei, Fang, Jian, Fang, Wenxing, Fedoseev, Dmitry, Feng, Li-Cheng, Feng, Qichun, Ferraro, Federico, Fournier, Amélie, Fritsch, Fritsch, Gan, Haonan, Gao, Feng, Garfagnini, Alberto, Gavrikov, Arsenii, Giammarchi, Marco, Giudice, Nunzio, Gonchar, Maxim, Gong, Guanghua, Gong, Hui, Gornushkin, Yuri, Grassi, Marco, Gromov, Maxim, Gromov, Vasily, Gu, Minghao, Gu, Xiaofei, Gu, Yu, Guan, Mengyun, Guan, Yuduo, Guardone, Nunzio, Guizzetti, Rosa Maria, Guo, Cong, Guo, Wanlei, Hagner, Caren, Han, Hechong, Han, Ran, Han, Yang, He, Jinhong, He, Miao, He, Wei, He, Xinhai, Heinz, Tobias, Hellmuth, Patrick, Heng, Yuekun, Herrera, Rafael, Hor, YuenKeung, Hou, Shaojing, Hsiung, Yee, Hu, Bei-Zhen, Hu, Hang, Hu, Jun, Hu, Peng, Hu, Shouyang, Hu, Tao, Hu, Yuxiang, Hu, Zhuojun, Huang, Guihong, Huang, Hanxiong, Huang, Jinhao, Huang, Junting, Huang, Kaixuan, Huang, Shengheng, Huang, Wenhao, Huang, Xin, Huang, Xingtao, Huang, Yongbo, Hui, Jiaqi, Huo, Lei, Huo, Wenju, Huss, Cédric, Hussain, Safeer, Imbert, Leonard, Ioannisian, Ara, Isocrate, Roberto, Jafar, Arshak, Jelmini, Beatrice, Jeria, Ignacio, Ji, Xiaolu, Jia, Huihui, Jia, Junji, Jian, Siyu, Jiang, Cailian, Jiang, Di, Jiang, Guangzheng, Jiang, Wei, Jiang, Xiaoshan, Jiang, Xiaozhao, Jiang, Yixuan, Jing, Xiaoping, Jollet, Cécile, Kang, Li, Karaparabil, Rebin, Kazarian, Narine, Khan, Ali, Khatun, Amina, Khosonthongkee, Khanchai, Korablev, Denis, Kouzakov, Konstantin, Krasnoperov, Alexey, Kuleshov, Sergey, Kumaran, Sindhujha, Kutovskiy, Nikolay, Labit, Loïc, Lachenmaier, Tobias, Lai, Haojing, Landini, Cecilia, Leblanc, Sébastien, Lefevre, Frederic, Lei, Ruiting, Leitner, Rupert, Leung, Jason, Li, Demin, Li, Fei, Li, Fule, Li, Gaosong, Li, Hongjian, Li, Huang, Li, Jiajun, Li, Min, Li, Nan, Li, Qingjiang, Li, Ruhui, Li, Rui, Li, Shanfeng, Li, Shuo, Li, Tao, Li, Teng, Li, Weidong, Li, Weiguo, Li, Xiaomei, Li, Xiaonan, Li, Xinglong, Li, Yi, Li, Yichen, Li, Yufeng, Li, Zhaohan, Li, Zhibing, Li, Ziyuan, Li, Zonghai, Liang, An-An, Liang, Hao, Liao, Jiajun, Liao, Yilin, Liao, Yuzhong, Limphirat, Ayut, Lin, Guey-Lin, Lin, Shengxin, Lin, Tao, Ling, Jiajie, Ling, Xin, Lippi, Ivano, Liu, Caimei, Liu, Fang, Liu, Fengcheng, Liu, Haidong, Liu, Haotian, Liu, Hongbang, Liu, Hongjuan, Liu, Hongtao, Liu, Hongyang, Liu, Jianglai, Liu, Jiaxi, Liu, Jinchang, Liu, Min, Liu, Qian, Liu, Qin, Liu, Runxuan, Liu, Shenghui, Liu, Shubin, Liu, Shulin, Liu, Xiaowei, Liu, Xiwen, Liu, Xuewei, Liu, Yankai, Liu, Zhen, Loi, Lorenzo, Lokhov, Alexey, Lombardi, Paolo, Lombardo, Claudio, Loo, Kai, Lu, Chuan, Lu, Haoqi, Lu, Jingbin, Lu, Junguang, Lu, Meishu, Lu, Peizhi, Lu, Shuxiang, Lu, Xianguo, Lubsandorzhiev, Bayarto, Lubsandorzhiev, Sultim, Ludhova, Livia, Lukanov, Arslan, Luo, Fengjiao, Luo, Guang, Luo, Jianyi, Luo, Shu, Luo, Wuming, Luo, Xiaojie, Lyashuk, Vladimir, Ma, Bangzheng, Ma, Bing, Ma, Qiumei, Ma, Si, Ma, Xiaoyan, Ma, Xubo, Maalmi, Jihane, Mai, Jingyu, Malabarba, Marco, Malyshkin, Yury, Mandujano, Roberto Carlos, Mantovani, Fabio, Mao, Xin, Mao, Yajun, Mari, Stefano M., Marini, Filippo, Martini, Agnese, Mayer, Matthias, Mayilyan, Davit, Mednieks, Ints, Meng, Yue, Meraviglia, Anita, Meregaglia, Anselmo, Meroni, Emanuela, Miramonti, Lino, Mohan, Nikhil, Montuschi, Michele, Reveco, Cristobal Morales, Nastasi, Massimiliano, Naumov, Dmitry V., Naumova, Elena, Navas-Nicolas, Diana, Nemchenok, Igor, Thi, Minh Thuan Nguyen, Nikolaev, Alexey, Ning, Feipeng, Ning, Zhe, Nunokawa, Hiroshi, Oberauer, Lothar, Ochoa-Ricoux, Juan Pedro, Olshevskiy, Alexander, Orestano, Domizia, Ortica, Fausto, Othegraven, Rainer, Paoloni, Alessandro, Parker, George, Parmeggiano, Sergio, Patsias, Achilleas, Pei, Yatian, Pelicci, Luca, Peng, Anguo, Peng, Haiping, Peng, Yu, Peng, Zhaoyuan, Percalli, Elisa, Perrin, Willy, Perrot, Frédéric, Petitjean, Pierre-Alexandre, Petrucci, Fabrizio, Pilarczyk, Oliver, Rico, Luis Felipe Piñeres, Popov, Artyom, Poussot, Pascal, Previtali, Ezio, Qi, Fazhi, Qi, Ming, Qi, Xiaohui, Qian, Sen, Qian, Xiaohui, Qian, Zhen, Qiao, Hao, Qin, Zhonghua, Qiu, Shoukang, Qu, Manhao, Qu, Zhenning, Ranucci, Gioacchino, Re, Alessandra, Rebii, Abdel, Redchuk, Mariia, Reina, Gioele, Ren, Bin, Ren, Jie, Ren, Yuhan, Ricci, Barbara, Rientong, Komkrit, Rifai, Mariam, Roche, Mathieu, Rodphai, Narongkiat, Romani, Aldo, Roskovec, Bedřich, Ruan, Xichao, Rybnikov, Arseniy, Sadovsky, Andrey, Saggese, Paolo, Sandanayake, Deshan, Sangka, Anut, Sava, Giuseppe, Sawangwit, Utane, Schever, Michaela, Schwab, Cédric, Schweizer, Konstantin, Selyunin, Alexandr, Serafini, Andrea, Settimo, Mariangela, Shao, Junyu, Sharov, Vladislav, Shi, Hexi, Shi, Jingyan, Shi, Yanan, Shutov, Vitaly, Sidorenkov, Andrey, Šimkovic, Fedor, Singhal, Apeksha, Sirignano, Chiara, Siripak, Jaruchit, Sisti, Monica, Smirnov, Mikhail, Smirnov, Oleg, Sokolov, Sergey, Songwadhana, Julanan, Soonthornthum, Boonrucksar, Sotnikov, Albert, Sreethawong, Warintorn, Stahl, Achim, Stanco, Luca, Stankevich, Konstantin, Steiger, Hans, Steinmann, Jochen, Sterr, Tobias, Stock, Matthias Raphael, Strati, Virginia, Strizh, Michail, Studenikin, Alexander, Su, Aoqi, Su, Jun, Sun, Guangbao, Sun, Shifeng, Sun, Xilei, Sun, Yongjie, Sun, Yongzhao, Sun, Zhengyang, Suwonjandee, Narumon, Takenaka, Akira, Tan, Xiaohan, Tang, Jian, Tang, Jingzhe, Tang, Qiang, Tang, Quan, Tang, Xiao, Hariharan, Vidhya Thara, Tkachev, Igor, Tmej, Tomas, Torri, Marco Danilo Claudio, Triossi, Andrea, Trzaska, Wladyslaw, Tung, Yu-Chen, Tuve, Cristina, Ushakov, Nikita, Vedin, Vadim, Venettacci, Carlo, Verde, Giuseppe, Vialkov, Maxim, Viaud, Benoit, Vollbrecht, Cornelius Moritz, von Sturm, Katharina, Vorobel, Vit, Voronin, Dmitriy, Votano, Lucia, Walker, Pablo, Wang, Caishen, Wang, Chung-Hsiang, Wang, En, Wang, Guoli, Wang, Hanwen, Wang, Jian, Wang, Jun, Wang, Li, Wang, Lu, Wang, Meng, Wang, Mingyuan, Wang, Qianchuan, Wang, Ruiguang, Wang, Sibo, Wang, Siguang, Wang, Wei, Wang, Wenshuai, Wang, Xi, Wang, Xiangyue, Wang, Yangfu, Wang, Yaoguang, Wang, Yi, Wang, Yifang, Wang, Yuanqing, Wang, Yuyi, Wang, Zhe, Wang, Zheng, Wang, Zhimin, Watcharangkool, Apimook, Wei, Wei, Wei, Wenlu, Wei, Yadong, Wei, Yuehuan, Wen, Liangjian, Weng, Jun, Wiebusch, Christopher, Wirth, Rosmarie, Wu, Chengxin, Wu, Diru, Wu, Qun, Wu, Yinhui, Wu, Yiyang, Wu, Zhi, Wurm, Michael, Wurtz, Jacques, Wysotzki, Christian, Xi, Yufei, Xia, Dongmei, Xian, Shishen, Xiang, Ziqian, Xiao, Fei, Xiao, Xiang, Xie, Xiaochuan, Xie, Yijun, Xie, Yuguang, Xin, Zhao, Xing, Zhizhong, Xu, Benda, Xu, Cheng, Xu, Donglian, Xu, Fanrong, Xu, Hangkun, Xu, Jiayang, Xu, Jilei, Xu, Jing, Xu, Jinghuan, Xu, Meihang, Xu, Xunjie, Xu, Yin, Xu, Yu, Yan, Baojun, Yan, Qiyu, Yan, Taylor, Yan, Xiongbo, Yan, Yupeng, Yang, Changgen, Yang, Chengfeng, Yang, Fengfan, Yang, Jie, Yang, Lei, Yang, Pengfei, Yang, Xiaoyu, Yang, Yifan, Yang, Yixiang, Yang, Zekun, Yao, Haifeng, Ye, Jiaxuan, Ye, Mei, Ye, Ziping, Yermia, Frédéric, You, Zhengyun, Yu, Boxiang, Yu, Chiye, Yu, Chunxu, Yu, Guojun, Yu, Hongzhao, Yu, Miao, Yu, Xianghui, Yu, Zeyuan, Yu, Zezhong, Yuan, Cenxi, Yuan, Chengzhuo, Yuan, Ying, Yuan, Zhenxiong, Yue, Baobiao, Zafar, Noman, Zamogilnyi, Kirill, Zavadskyi, Vitalii, Zeng, Fanrui, Zeng, Shan, Zeng, Tingxuan, Zeng, Yuda, Zhan, Liang, Zhang, Aiqiang, Zhang, Bin, Zhang, Binting, Zhang, Feiyang, Zhang, Hangchang, Zhang, Haosen, Zhang, Honghao, Zhang, Jialiang, Zhang, Jiawen, Zhang, Jie, Zhang, Jingbo, Zhang, Jinnan, Zhang, Junwei, Zhang, Lei, Zhang, Peng, Zhang, Ping, Zhang, Qingmin, Zhang, Shiqi, Zhang, Shu, Zhang, Shuihan, Zhang, Siyuan, Zhang, Tao, Zhang, Xiaomei, Zhang, Xin, Zhang, Xuantong, Zhang, Yibing, Zhang, Yinhong, Zhang, Yiyu, Zhang, Yongpeng, Zhang, Yu, Zhang, Yuanyuan, Zhang, Yumei, Zhang, Zhenyu, Zhang, Zhijian, Zhao, Jie, Zhao, Rong, Zhao, Runze, Zhao, Shujun, Zhao, Tianhao, Zheng, Hua, Zheng, Yangheng, Zhou, Jing, Zhou, Li, Zhou, Nan, Zhou, Shun, Zhou, Tong, Zhou, Xiang, Zhou, Xing, Zhu, Jingsen, Zhu, Kangfu, Zhu, Kejun, Zhu, Zhihang, Zhuang, Bo, Zhuang, Honglin, Zong, Liang, and Zou, Jiaheng
- Subjects
High Energy Physics - Experiment ,High Energy Physics - Phenomenology - Abstract
We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 \nu$ or $nn \rightarrow 2 \nu$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation modes of the excited residual nuclei can produce a time- and space-correlated triple coincidence signal in the JUNO detector. Based on a full Monte Carlo simulation informed with the latest available data, we estimate all backgrounds, including inverse beta decay events of the reactor antineutrino $\bar{\nu}_e$, natural radioactivity, cosmogenic isotopes and neutral current interactions of atmospheric neutrinos. Pulse shape discrimination and multivariate analysis techniques are employed to further suppress backgrounds. With two years of exposure, JUNO is expected to give an order of magnitude improvement compared to the current best limits. After 10 years of data taking, the JUNO expected sensitivities at a 90% confidence level are $\tau/B( n \rightarrow { inv} ) > 5.0 \times 10^{31} \, {\rm yr}$ and $\tau/B( nn \rightarrow { inv} ) > 1.4 \times 10^{32} \, {\rm yr}$., Comment: 28 pages, 7 figures, 4 tables
- Published
- 2024
44. The Higgs boson decay $h \rightarrow bs$ in the $U(1)_X$SSM
- Author
-
Gao, Song, Zhao, Shu-Min, Liu, Ming-Yue, Han, Xing-Yu, Wang, Xi, and Feng, Tai-Fu
- Subjects
High Energy Physics - Phenomenology - Abstract
In the $U(1)_X$SSM, we delve into the flavor violation of $h \rightarrow bs$, where $h$ is identified with the SM-like Higgs boson discovered at the LHC. As the U(1) extension of the minimal supersymmetric standard model (MSSM), the U(1)XSSM has new super fields such as right-handed neutrinos and three Higgs singlets. We conduct a thorough analysis of the underlying mechanisms and parameter dependencies of $h \rightarrow bs$ in the $U(1)_X$SSM. In the $U(1)_X$SSM, we discover that the Br$(h \rightarrow bs)$ for the Higgs decay to $bs$ could significantly differ from the expectation in the standard model (SM), depending on the values of the new parameters introduced in the model. Our research not only contributes to a deeper understanding of Higgs physics within the $U(1)_X$SSM, but also provides valuable guidance for new physics (NP)., Comment: 24 pages,8 figures
- Published
- 2024
- Full Text
- View/download PDF
45. Study some two loop contribution to muon MDM in the N-B-LSSM
- Author
-
Han, Xing-Yu, Ma, Jiao, Ruan, Long, Wang, Xi, Dong, Xing-Xing, and Zhao, Shu-Min
- Subjects
High Energy Physics - Phenomenology - Abstract
It is well known that the muon magnetic dipole moment (MDM) has close relation with the new physics (NP) in the development of the Standard Model (SM). Combined with the Fermilab National Accelerator Laboratory (FNAL) and the Brookhaven National Laboratory (BNL) E821 result, the departure from the SM prediction is about 5.0 $\sigma$. We study the electroweak corrections from several type two-loop SUSY diagrams and the virtual SUSY particles include chargino, neutralino, scalar lepton and scalar neutrino. Based on the latest experimental constraints, we study the muon MDM under the next to the minimal supersymmetric extension of the SM with local B-L gauge symmetry (N-B-LSSM). The abundant numerical results verify that $\tan{\beta},~T_e,~M^2_L,~M^2_e,~M_{BB'}$ play an important role in muon MDM. $M^2_e,~\tan{\beta}$ and $T_e$ are sensitive parameters to muon MDM. From the data obtained in all the figures of the numerical results, most of the values of $a_{\mu}^{NBL}$ are in 2$\sigma$ interval, which can compensate the departure between the experiment data and the SM prediction.
- Published
- 2024
46. Exploring and Evaluating Real-world CXL: Use Cases and System Adoption
- Author
-
Liu, Jie, Wang, Xi, Wu, Jianbo, Yang, Shuangyan, Ren, Jie, Shankar, Bhanu, and Li, Dong
- Subjects
Computer Science - Performance ,Computer Science - Hardware Architecture - Abstract
Compute eXpress Link (CXL) is emerging as a promising memory interface technology. Because of the common unavailiability of CXL devices, the performance of the CXL memory is largely unknown. What are the use cases for the CXL memory? What are the impacts of the CXL memory on application performance? How to use the CXL memory in combination with existing memory components? In this work, we study the performance of three genuine CXL memory-expansion cards from different vendors. We characterize the basic performance of the CXL memory, study how HPC applications and large language models can benefit from the CXL memory, and study the interplay between memory tiering and page interleaving. We also propose a novel data object-level interleaving policy to match the interleaving policy with memory access patterns. We reveal the challenges and opportunities of using the CXL memory.
- Published
- 2024
47. How to set AdamW's weight decay as you scale model and dataset size
- Author
-
Wang, Xi and Aitchison, Laurence
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
We show that weights learned by AdamW can be understood as an exponential moving average (EMA) of recent updates. This gives critical insights for how to set the weight decay in AdamW, and how the weight decay should scale with model and dataset size. In particular, the key hyperparameter for an exponential moving average is the EMA timescale. Intuitively, the EMA timescale can be understood as the number of recent iterations the EMA averages over. Given a fixed learning rate, there is a one-to-one mapping from the EMA timescale to the usual weight decay hyperparameter. Thus, choosing an EMA timescale implicitly sets the weight decay. Importantly, there are natural guidelines for sensible values for the EMA timescale: we need to average over all datapoints, so the EMA timescale should not be (much) smaller than 1 epoch, and we need to forget early updates, so the EMA timescale should not be (much) bigger than the total number of training epochs. In our experiments, we find that optimal EMA timescales are consistent with these guidelines, as are the hyperparameters chosen in recent large-scale LLM pretraining runs (e.g.\ Llama 1+2 and Stable LM). Critically, these guidelines suggest that the optimal EMA timescale should not change (much) as we scale the model and dataset. That implies that as the dataset size increases, the optimal weight decay should fall. Moreover, as the model size increases, the optimal weight decay should also increase (if we follow the muP recommendation for scaling the learning rate).
- Published
- 2024
48. MEIC: Re-thinking RTL Debug Automation using LLMs
- Author
-
Xu, Ke, Sun, Jialin, Hu, Yuchen, Fang, Xinwei, Shan, Weiwei, Wang, Xi, and Jiang, Zhe
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Software Engineering - Abstract
The deployment of Large Language Models (LLMs) for code debugging (e.g., C and Python) is widespread, benefiting from their ability to understand and interpret intricate concepts. However, in the semiconductor industry, utilising LLMs to debug Register Transfer Level (RTL) code is still insufficient, largely due to the underrepresentation of RTL-specific data in training sets. This work introduces a novel framework, Make Each Iteration Count (MEIC), which contrasts with traditional one-shot LLM-based debugging methods that heavily rely on prompt engineering, model tuning, and model training. MEIC utilises LLMs in an iterative process to overcome the limitation of LLMs in RTL code debugging, which is suitable for identifying and correcting both syntax and function errors, while effectively managing the uncertainties inherent in LLM operations. To evaluate our framework, we provide an open-source dataset comprising 178 common RTL programming errors. The experimental results demonstrate that the proposed debugging framework achieves fix rate of 93% for syntax errors and 78% for function errors, with up to 48x speedup in debugging processes when compared with experienced engineers. The Repo. of dataset and code: https://anonymous.4open.science/r/Verilog-Auto-Debug-6E7F/.
- Published
- 2024
49. Room temperature Si:S barrier infrared detector with broadband response up to 4.4{\mu}m
- Author
-
Zhu, He, Xiao, Yunlong, Yu, Zhongyang, Zhu, Jiaqi, Li, Qing, Ye, Zhenyu, Wang, Xi, Liu, Changlong, Pan, Changyu, Shan, Yufeng, Duan, Junli, Wu, Huizhen, Hu, Weida, and Dai, Ning
- Subjects
Physics - Applied Physics - Abstract
Mid-infrared spectrum is a critical tool for chemical analysis, industrial inspection, environment, and other fields due to its rich chemical bond information. However, the complicated growth or fabrication procedures of existing mid-infrared sensitive materials hinder the large-scale production and utilization of mid-infrared detectors. To address this issue, we developed Si:S barrier detectors employing sulfur doped silicon and a sophisticated band barrier design. Since the transport of dark current and photo current is separated, the barrier design effectively suppresses the dark current while allowing the photo current to leverage gain mechanisms, thereby substantially improving signal-to-noise ratio. As a result, the detector exhibits an infrared response range covering from 1.12 to 4.4{\mu}m with a peak at 3.3{\mu}m, excluding its intrinsic response in visible range. Its peak quantum efficiency surpasses that of the best mid-infrared silicon-based detector reported to date by an order of magnitude, reaching 2% at room temperature. The peak detectivity at 90K is 1.4E11 Jones @1.4V and decreases to 4.4E9 Jones @1.4V, 210K, comparable to the typical III-V and IV-VI photodetectors at one thousandth fabrication cost. Leveraging the well-established silicon-based manufacturing process, this device holds promise for large-scale production at a reduced price, offering a cost-effective solution for future mid-infrared detection.
- Published
- 2024
50. Explicit Correlation Learning for Generalizable Cross-Modal Deepfake Detection
- Author
-
Yu, Cai, Jia, Shan, Fu, Xiaomeng, Liu, Jin, Tian, Jiahe, Dai, Jiao, Wang, Xi, Lyu, Siwei, and Han, Jizhong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
With the rising prevalence of deepfakes, there is a growing interest in developing generalizable detection methods for various types of deepfakes. While effective in their specific modalities, traditional detection methods fall short in addressing the generalizability of detection across diverse cross-modal deepfakes. This paper aims to explicitly learn potential cross-modal correlation to enhance deepfake detection towards various generation scenarios. Our approach introduces a correlation distillation task, which models the inherent cross-modal correlation based on content information. This strategy helps to prevent the model from overfitting merely to audio-visual synchronization. Additionally, we present the Cross-Modal Deepfake Dataset (CMDFD), a comprehensive dataset with four generation methods to evaluate the detection of diverse cross-modal deepfakes. The experimental results on CMDFD and FakeAVCeleb datasets demonstrate the superior generalizability of our method over existing state-of-the-art methods. Our code and data can be found at \url{https://github.com/ljj898/CMDFD-Dataset-and-Deepfake-Detection}., Comment: accepted by ICME 2024
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.