1,572 results on '"Li Zixuan"'
Search Results
2. Establishment and stability assessment of mouse cervical heterotopic heart transplantation model with 'Anchoring Node'
- Author
-
LI Zixuan, FANG Yibing, and CHENG Wei
- Subjects
heart transplantation ,cuff technique ,mice ,model ,Medicine (General) ,R5-920 - Abstract
Objective To optimize the operational steps and processes in mouse cervical heterotopic heart transplantation by modifying the conventional cuff technique for vascular anastomosis and consequently establish a more stable cervical heterotopic heart transplantation model in mice. Methods C57BL/6 male mice (6~8 weeks old, weighing 20~24 g) were categorized into control (conventional cuff technique) and experimental groups (our "Anchoring Node" technique). Time for each surgical step, frequencies of vascular everting and vascular trimming, and the reasons for failure were recorded and compared between the 2 groups. Postoperative survival of heart allograft was determined by daily observation and touching, and the mice with the survival time >48 h were considered as successful model establishment. On the 7th and 14th days after surgery, HE staining was used to observe the pathologic changes in the vascular tissues at anastomosis. The expression of troponin T (cTnT) in the heart on the 7th day was detected with immunofluorescence assay. Results ① In the 25 hearts from each group, 2 hearts from the experimental group and 8 from the control group failed, and the survival rate of heart allografts was 92% and 68%, respectively. In the experimental group, arterial and venous everting occurred at an average of 1.16 times, with an average frequency of trimming of 0.04 times, while in the control group, arterial and venous everting was 2.00 and 2.28 times, respectively, with an average frequency of trimming of 0.21 and 0.46 times, respectively. ② Significant differences were observed in the overall duration for cervical heterotopic heart transplantation (77.22±3.82 vs 87.49±8.01 min), vascular separation plus cannulation (30.06±2.68 vs 36.50±6.67 min), and cervical anastomosis (7.31±1.08 vs 12.34±2.58 min) between the experimental and control groups (all P < 0.05). ③HE staining displayed that vascular patency was observed in the experimental group on the 7th and 14th days after surgery. ④cTnT staining indicated no massive myocardial necrosis was seen in both groups. Conclusion Based on conventional cuff technique for mouse cervical heterotopic heart transplantation, our modified "Anchoring Node" technique ensures the stability and efficiency of one-man microscopic operation with controllable quality, with the advantages of longer postoperative survival of heart allograft, high patency rate of anastomotic vessels, good cardiac function, and fewer postoperative complications.
- Published
- 2024
- Full Text
- View/download PDF
3. Soil Erosion and Its Spatial Distribution Characteristics in Three-River-Source National Park
- Author
-
Huang Tingting, Zhao Hui, Zhao Yuan, Ren Jingyu, Li Zixuan, and Li Binbin
- Subjects
soil erosion ,elevation ,slope ,vegetation coverage ,spatial distribution ,three-river-source national park ,Environmental sciences ,GE1-350 ,General. Including nature conservation, geographical distribution ,QH1-199.5 - Abstract
[Objective] The Three-River-Source National Park (TRSNP) is considered to be the “water tower of China”, and is an important ecological security barrier in China. The soil erosion distribution law of TRSNP was studied to provide a basis for implementing ecological protection policy, soil and water conservation, and ecological civilization construction in TRSNP. [Methods] Based on the Chinese soil loss equation (CSLE), wind erosion model and freeze-thaw erosion intensity model, the soil erosion status and its distribution characteristics at different space and surface of TRSNP were analyzed by superposition analysis. [Results] In 2020, an area of 2.64×104 km2 suffered from soil erosion in TRSNP. Among the three sub-parks, the Yellow-River-Source Park exhibited the most extensive soil erosion, whereas the Yangtze-River-Source Park was subject to severe erosion comparatively. Soil erosion and its spatial distribution varied significantly at different elevations. Water erosion occurred mainly in the area with elevations above 4 900 m, which occupied 70% of the land area. However, 85% of the wind erosion occurred in zones with elevations less than 4 900 m. The wind erosion area with slopes between 0° and 5° accounted for 60%, which is the relatively concentrated distribution area of wind erosion. And three-quarters of water erosion areas were concentrated in regions where the slope ranged from 8° to 25°, all of which require urgent conservation measures. Grassland was the most important land cover in TRSNP, occupying about 80% of the area, with low and medium-low vegetation cover being responsible for significant soil losses. Additionally, sandy land and bare land were prone to high intensity soil erosion, which deserved special attention. [Conclusion] Two-thirds of water erosion areas were primarily located in zones where the elevation was above 4 900 m, slope gradients were between 8° and 35°, and grassland cover was below medium-low cover. Wind erosion was primarily located at elevations ranging from 4 200 m to 4 900 m, slopes were less than 5°, and grassland coverage was below medium-low cover.
- Published
- 2023
- Full Text
- View/download PDF
4. Analysis of late detection of HIV/AIDS cases aged 50 years and above in Jilin Province
- Author
-
HU Yan, FAN Jixiang, QIU Baihong, HUANG Lining, LI Zixuan, LI Na, and WU Dan
- Subjects
aids ,cd4+ t cell count ,late detection ,influencing factor ,Medicine - Abstract
ObjectiveTo investigate the late detection of HIV/AIDS cases in the elderly in Jilin Province and analyze its influencing factors, to provide theoretical basis for improving their life quality.MethodsThe first CD4 values of HIV/AIDS patients aged 50 years and above living in Jilin Province were used to estimate late detection, and the influencing factors of late detection in elderly cases were analyzed.ResultsThe average CD4 cell count of newly reported HIV/AIDS cases aged 50 and above in Jilin Province from 1996 to 2021 was (230.55±191.97), the low value group accounted for the largest proportion (50.8%), and the late detection rate was 59.3% (1397/2325). The late detection cases were mainly from sexual transmission (46.8% for same-sex and 48.2% for heterosexual contact). From the perspective of sample sources, most of the late detection patients were diagnosed while testing for other illnesses, followed by testing and consulting. In terms of contact history, the late detection of cases of men who have sex with men was higher. The binary logistic regression analysis showed that gender, marriage, sample source and report year were the factors affecting the late detection of AIDS. The late detection rate of males was higher, and cases of married couples were more likely to be late detection. With the increase of report year, the late detection rate decreased, and testing and counseling could effectively reduce the late detection rate of AIDS.ConclusionThe CD4 cell count in the first detection of HIV/AIDS in the elderly in Jilin Province is low, and the late detection rate of male cases is high. In recent years, the expansion of voluntary counseling and testing in Jilin Province has effectively reduced the late detection rate of HIV/AIDS. At the same time, sex education should be strengthened for the elderly, healthy marital relationships should be advocated and more attention should be paid to the mental health of the elderly.
- Published
- 2022
- Full Text
- View/download PDF
5. Epidemiological characteristics of COVID-19 patients in Shaanxi Province
- Author
-
LI Zixuan, FU Guotao, and LI Xiaowei
- Subjects
covid-19 ,delta variant ,epidemiology ,distribution of three ,the clustered ,community spread ,Medicine - Abstract
ObjectiveTo describe the epidemiological characteristics of COVID-19 patients in Shaanxi Province from December 2021 to January 2022.MethodsAll COVID-19 patients’ information was obtained from the Health Committee of Shaanxi Province. SPSS 26.0 and Stata MP 16.0 were used to analyze the distribution of Time, Population and Region. Descriptive statistical method was used to investigate the correlation between age, gender and clinical syndrome types of patients.ResultsThe duration of this epidemic was 43 days, and 2 080 confirmed cases in total, which distributed in cities of Xi’an (2 053 cases), Xianyang (13 cases) , Yan’an (13 cases) and Weinan (1 cases). The mean age of the patients was 35.91±17.72 years old, the number of male patients was higher than that of female, and 93.7% of the patients had mild symptoms. The age and gender of the patients were statistically correlated with the symptom type (P
- Published
- 2022
- Full Text
- View/download PDF
6. Distributed control strategy of temperature control loads considering switch life loss
- Author
-
LI Zixuan, BAO Yuqing, SONG Meng, WANG Wei, CHENG Limin, and CHEN Chen
- Subjects
thermostatically controlled loads (tcls) ,demand response ,multi-agent ,distributed control ,thermodynamic modeling ,switch life loss ,Applications of electric power ,TK4001-4102 - Abstract
As important demand-side resources, thermostatically controlled loads (TCLs) have great potential in suppressing fluctuations in renewable energy. However, traditional TCLs control strategy usually does not consider the control cost and the life loss caused by the high switching frequency of TCLs. In order to reduce the switch life loss of TCLs, a distributed cooperative control strategy for TCLs based on multi-agent consistency is proposed in this paper. Firstly, the concept of correcting switching time is introduced, and a control cost model that takes into account the thermodynamic characteristics of TCLs and switch life loss is established. Then, multi-agent consistency control is introduced, and the distributed cooperative control method is adopted to achieve the optimal control cost of TCLs. The results of calculation examples show that the proposed control strategy can reflect the thermodynamic characteristics of TCLs, thus greatly reducing the switch life loss under the premise of achieving the control goal.
- Published
- 2022
- Full Text
- View/download PDF
7. Audio-FLAN: A Preliminary Release
- Author
-
Xue, Liumeng, Zhou, Ziya, Pan, Jiahao, Li, Zixuan, Fan, Shuai, Ma, Yinghao, Cheng, Sitong, Yang, Dongchao, Guo, Haohan, Xiao, Yujia, Wang, Xinsheng, Shen, Zixuan, Zhu, Chuanbo, Zhang, Xinshen, Liu, Tianchi, Yuan, Ruibin, Tian, Zeyue, Liu, Haohe, Benetos, Emmanouil, Zhang, Ge, Guo, Yike, and Xue, Wei
- Subjects
Computer Science - Sound ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Multimedia ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Recent advancements in audio tokenization have significantly enhanced the integration of audio capabilities into large language models (LLMs). However, audio understanding and generation are often treated as distinct tasks, hindering the development of truly unified audio-language models. While instruction tuning has demonstrated remarkable success in improving generalization and zero-shot learning across text and vision, its application to audio remains largely unexplored. A major obstacle is the lack of comprehensive datasets that unify audio understanding and generation. To address this, we introduce Audio-FLAN, a large-scale instruction-tuning dataset covering 80 diverse tasks across speech, music, and sound domains, with over 100 million instances. Audio-FLAN lays the foundation for unified audio-language models that can seamlessly handle both understanding (e.g., transcription, comprehension) and generation (e.g., speech, music, sound) tasks across a wide range of audio domains in a zero-shot manner. The Audio-FLAN dataset is available on HuggingFace and GitHub and will be continuously updated.
- Published
- 2025
8. Robust Target Speaker Direction of Arrival Estimation
- Author
-
Li, Zixuan, He, Shulin, and Zhang, Xueliang
- Subjects
Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
In multi-speaker environments the direction of arrival (DOA) of a target speaker is key for improving speech clarity and extracting target speaker's voice. However, traditional DOA estimation methods often struggle in the presence of noise, reverberation, and particularly when competing speakers are present. To address these challenges, we propose RTS-DOA, a robust real-time DOA estimation system. This system innovatively uses the registered speech of the target speaker as a reference and leverages full-band and sub-band spectral information from a microphone array to estimate the DOA of the target speaker's voice. Specifically, the system comprises a speech enhancement module for initially improving speech quality, a spatial module for learning spatial information, and a speaker module for extracting voiceprint features. Experimental results on the LibriSpeech dataset demonstrate that our RTS-DOA system effectively tackles multi-speaker scenarios and established new optimal benchmarks.
- Published
- 2024
9. AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
- Author
-
Zuo, Yuxin, Jiang, Wenxuan, Liu, Wenxuan, Li, Zixuan, Bai, Long, Wang, Hanbin, Zeng, Yutao, Jin, Xiaolong, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Empirical evidence suggests that LLMs exhibit spontaneous cross-lingual alignment. Our findings suggest that although LLMs also demonstrate promising cross-lingual alignment in Information Extraction, there remains significant imbalance across languages, revealing an underlying deficiency in the IE alignment. To address this issue, we propose AlignXIE, a powerful code-based LLM that significantly enhances cross-lingual IE alignment through two strategies. Firstly, AlignXIE formulates IE across different languages, especially non-English ones, as code generation tasks, standardizing the representation of various schemas using Python classes to ensure consistency of the same ontology in different languages and align the schema. Secondly, it incorporates an IE cross-lingual alignment phase through a translated instance prediction task proposed in this paper to align the extraction process, utilizing ParallelNER, an IE bilingual parallel dataset with 257,190 samples, generated by our proposed LLM-based automatic pipeline for IE parallel data construction, with manual annotation to ensure quality. Ultimately, we obtain AlignXIE through multilingual IE instruction tuning. Although without training in 9 unseen languages, AlignXIE surpasses ChatGPT by $30.17\%$ and SoTA by $20.03\%$, thereby demonstrating superior cross-lingual IE capabilities. Comprehensive evaluations on 63 IE benchmarks in Chinese and English under various settings, demonstrate that AlignXIE significantly enhances cross-lingual and multilingual IE through boosting the IE alignment., Comment: Work in progress
- Published
- 2024
10. Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models
- Author
-
Lu, Hongyuan, Li, Zixuan, and Lam, Wai
- Subjects
Computer Science - Computation and Language - Abstract
As current training data for Large Language Models (LLMs) are dominated by English corpus, they are English-centric and they present impressive performance on English reasoning tasks.\footnote{This paper primarily studies English-centric models, but our method could be universal by using the centric language in the dictionary for non-English-centric LLMs.} Yet, they usually suffer from lower performance in other languages. There are about 7,000 languages over the world, and many are low-resourced on English-centric LLMs. For the sake of people who primarily speak these languages, it is especially urgent to enable our LLMs in those languages. Model training is usually effective, but computationally expensive and requires experienced NLP practitioners. This paper presents a novel and simple yet effective method called \textbf{D}ictionary \textbf{I}nsertion \textbf{P}rompting (\textbf{DIP}). When providing a non-English prompt, DIP looks up a word dictionary and inserts words' English counterparts into the prompt for LLMs. It then enables better translation into English and better English model thinking steps which leads to obviously better results. We experiment with about 200 languages from FLORES-200. Since there are no adequate datasets, we use the NLLB translator to create synthetic multilingual benchmarks from the existing 4 English reasoning benchmarks such as GSM8K and AQuA. Despite the simplicity and computationally lightweight, we surprisingly found the effectiveness of DIP on math and commonsense reasoning tasks on multiple open-source and close-source LLMs.\footnote{Our dictionaries, code, and synthetic benchmarks will be open-sourced to facilitate future research.}
- Published
- 2024
11. UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation
- Author
-
Li, Zixuan, Xiong, Jing, Ye, Fanghua, Zheng, Chuanyang, Wu, Xun, Lu, Jianqiao, Wan, Zhongwei, Liang, Xiaodan, Li, Chengming, Sun, Zhenan, Kong, Lingpeng, and Wong, Ngai
- Subjects
Computer Science - Computation and Language - Abstract
We present UncertaintyRAG, a novel approach for long-context Retrieval-Augmented Generation (RAG) that utilizes Signal-to-Noise Ratio (SNR)-based span uncertainty to estimate similarity between text chunks. This span uncertainty enhances model calibration, improving robustness and mitigating semantic inconsistencies introduced by random chunking. Leveraging this insight, we propose an efficient unsupervised learning technique to train the retrieval model, alongside an effective data sampling and scaling strategy. UncertaintyRAG outperforms baselines by 2.03% on LLaMA-2-7B, achieving state-of-the-art results while using only 4% of the training data compared to other advanced open-source retrieval models under distribution shift settings. Our method demonstrates strong calibration through span uncertainty, leading to improved generalization and robustness in long-context RAG tasks. Additionally, UncertaintyRAG provides a lightweight retrieval model that can be integrated into any large language model with varying context window lengths, without the need for fine-tuning, showcasing the flexibility of our approach.
- Published
- 2024
12. SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal
- Author
-
Long, Fang, Su, Wenkang, Li, Zixuan, Cai, Lei, Li, Mingjie, Wang, Yuan-Gen, and Cao, Xiaochun
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Adverse weather removal aims to restore clear vision under adverse weather conditions. Existing methods are mostly tailored for specific weather types and rely heavily on extensive labeled data. In dealing with these two limitations, this paper presents a pioneering semi-supervised all-in-one adverse weather removal framework built on the teacher-student network with a Denoising Diffusion Model (DDM) as the backbone, termed SemiDDM-Weather. As for the design of DDM backbone in our SemiDDM-Weather, we adopt the SOTA Wavelet Diffusion Model-Wavediff with customized inputs and loss functions, devoted to facilitating the learning of many-to-one mapping distributions for efficient all-in-one adverse weather removal with limited label data. To mitigate the risk of misleading model training due to potentially inaccurate pseudo-labels generated by the teacher network in semi-supervised learning, we introduce quality assessment and content consistency constraints to screen the "optimal" outputs from the teacher network as the pseudo-labels, thus more effectively guiding the student network training with unlabeled data. Experimental results show that on both synthetic and real-world datasets, our SemiDDM-Weather consistently delivers high visual quality and superior adverse weather removal, even when compared to fully supervised competitors. Our code and pre-trained model are available at this repository.
- Published
- 2024
13. Fiber-level Woven Fabric Capture from a Single Photo
- Author
-
Li, Zixuan, Shen, Pengfei, Sun, Hanxiao, Zhang, Zibo, Guo, Yu, Liu, Ligang, Yan, Ling-Qi, Marschner, Steve, Hasan, Milos, and Wang, Beibei
- Subjects
Computer Science - Graphics - Abstract
Accurately rendering the appearance of fabrics is challenging, due to their complex 3D microstructures and specialized optical properties. If we model the geometry and optics of fabrics down to the fiber level, we can achieve unprecedented rendering realism, but this raises the difficulty of authoring or capturing the fiber-level assets. Existing approaches can obtain fiber-level geometry with special devices (e.g., CT) or complex hand-designed procedural pipelines (manually tweaking a set of parameters). In this paper, we propose a unified framework to capture fiber-level geometry and appearance of woven fabrics using a single low-cost microscope image. We first use a simple neural network to predict initial parameters of our geometric and appearance models. From this starting point, we further optimize the parameters of procedural fiber geometry and an approximated shading model via differentiable rasterization to match the microscope photo more accurately. Finally, we refine the fiber appearance parameters via differentiable path tracing, converging to accurate fiber optical parameters, which are suitable for physically-based light simulations to produce high-quality rendered results. We believe that our method is the first to utilize differentiable rendering at the microscopic level, supporting physically-based scattering from explicit fiber assemblies. Our fabric parameter estimation achieves high-quality re-rendering of measured woven fabric samples in both distant and close-up views. These results can further be used for efficient rendering or converted to downstream representations. We also propose a patch-space fiber geometry procedural generation and a two-scale path tracing framework for efficient rendering of fabric scenes., Comment: due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file
- Published
- 2024
14. AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents
- Author
-
Chen, Guhong, Fan, Liyang, Gong, Zihan, Xie, Nan, Li, Zixuan, Liu, Ziqiang, Li, Chengming, Qu, Qiang, Ni, Shiwen, and Yang, Min
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
In this paper, we present a simulation system called AgentCourt that simulates the entire courtroom process. The judge, plaintiff's lawyer, defense lawyer, and other participants are autonomous agents driven by large language models (LLMs). Our core goal is to enable lawyer agents to learn how to argue a case, as well as improving their overall legal skills, through courtroom process simulation. To achieve this goal, we propose an adversarial evolutionary approach for the lawyer-agent. Since AgentCourt can simulate the occurrence and development of court hearings based on a knowledge base and LLM, the lawyer agents can continuously learn and accumulate experience from real court cases. The simulation experiments show that after two lawyer-agents have engaged in a thousand adversarial legal cases in AgentCourt (which can take a decade for real-world lawyers), compared to their pre-evolutionary state, the evolved lawyer agents exhibit consistent improvement in their ability to handle legal tasks. To enhance the credibility of our experimental results, we enlisted a panel of professional lawyers to evaluate our simulations. The evaluation indicates that the evolved lawyer agents exhibit notable advancements in responsiveness, as well as expertise and logical rigor. This work paves the way for advancing LLM-driven agent technology in legal scenarios. Code is available at https://github.com/relic-yuexi/AgentCourt.
- Published
- 2024
15. Unifying Visual and Semantic Feature Spaces with Diffusion Models for Enhanced Cross-Modal Alignment
- Author
-
Zheng, Yuze, Li, Zixuan, Li, Xiangxian, Liu, Jinxing, Wang, Yuqing, Meng, Xiangxu, and Meng, Lei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Image classification models often demonstrate unstable performance in real-world applications due to variations in image information, driven by differing visual perspectives of subject objects and lighting discrepancies. To mitigate these challenges, existing studies commonly incorporate additional modal information matching the visual data to regularize the model's learning process, enabling the extraction of high-quality visual features from complex image regions. Specifically, in the realm of multimodal learning, cross-modal alignment is recognized as an effective strategy, harmonizing different modal information by learning a domain-consistent latent feature space for visual and semantic features. However, this approach may face limitations due to the heterogeneity between multimodal information, such as differences in feature distribution and structure. To address this issue, we introduce a Multimodal Alignment and Reconstruction Network (MARNet), designed to enhance the model's resistance to visual noise. Importantly, MARNet includes a cross-modal diffusion reconstruction module for smoothly and stably blending information across different domains. Experiments conducted on two benchmark datasets, Vireo-Food172 and Ingredient-101, demonstrate that MARNet effectively improves the quality of image information extracted by the model. It is a plug-and-play framework that can be rapidly integrated into various image classification frameworks, boosting model performance.
- Published
- 2024
16. APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation
- Author
-
Hu, Yuxuan, Tan, Minghuan, Zhang, Chenwei, Li, Zixuan, Liang, Xiaodan, Yang, Min, Li, Chengming, and Hu, Xiping
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Empathetic response generation is designed to comprehend the emotions of others and select the most appropriate strategies to assist them in resolving emotional challenges. Empathy can be categorized into cognitive empathy and affective empathy. The former pertains to the ability to understand and discern the emotional issues and situations of others, while the latter involves the capacity to provide comfort. To enhance one's empathetic abilities, it is essential to develop both these aspects. Therefore, we develop an innovative framework that combines retrieval augmentation and emotional support strategy integration. Our framework starts with the introduction of a comprehensive emotional palette for empathy. We then apply appraisal theory to decompose this palette and create a database of empathetic responses. This database serves as an external resource and enhances the LLM's empathy by integrating semantic retrieval mechanisms. Moreover, our framework places a strong emphasis on the proper articulation of response strategies. By incorporating emotional support strategies, we aim to enrich the model's capabilities in both cognitive and affective empathy, leading to a more nuanced and comprehensive empathetic response. Finally, we extract datasets ED and ET from the empathetic dialogue dataset \textsc{EmpatheticDialogues} and ExTES based on dialogue length. Experiments demonstrate that our framework can enhance the empathy ability of LLMs from both cognitive and affective empathy perspectives. Our code is released at https://github.com/CAS-SIAT-XinHai/APTNESS., Comment: Appectped to CIKM2024
- Published
- 2024
17. Assessment of acute radial artery injury after distal transradial access for coronary intervention: an optical coherence tomography study
- Author
-
Niu, Dan, Wang, Yuntao, Wu, Yongxia, Li, Zixuan, Liu, Hao, and Guo, Jincheng
- Published
- 2025
- Full Text
- View/download PDF
18. Andrographolide Suppresses the Growth and Metastasis of Luminal-Like Breast Cancer by Inhibiting the NF-κB/miR-21-5p/PDCD4 Signaling Pathway
- Author
-
Junchen Li, Lixun Huang, Zinan He, Minggui Chen, Yi Ding, Yuying Yao, Youfa Duan, Li Zixuan, Cuiling Qi, Lingyun Zheng, Jiangchao Li, Rongxin Zhang, Xiaoming Li, Jianwei Dai, Lijing Wang, and Qian-Qian Zhang
- Subjects
luminal-like breast cancer ,andrographolide ,growth ,metastasis ,NF-κB/miR-21-5p/PDCD4 signaling pathway ,Biology (General) ,QH301-705.5 - Abstract
Tumor growth and metastasis are responsible for breast cancer-related mortality. Andrographolide (Andro) is a traditional anti-inflammatory drug used in the clinic that inhibits NF-κB activation. Recently, Andro has been found in the treatment of various cancers. Andro inhibits breast cell proliferation and invasion and induces apoptosis via activating various signaling pathways. Therefore, the underlying mechanisms with regard to the antitumor effects of Andro still need to be further confirmed. Herein, a MMTV-PyMT spontaneous luminal-like breast cancer lung metastatic transgenic tumor model was employed to estimate the antitumor effects of Andro on breast cancer in vivo. Andro significantly inhibited tumor growth and metastasis in MMTV-PyMT mice and suppressed the cell proliferation, migration, and invasion of MCF-7 breast cancer cells in vitro. Meanwhile, Andro significantly inhibited the expression of NF-κB, and the downregulated NF-κB reduced miR-21-5p expression. In addition, miR-21-5p dramatically inhibited the target gene expression of programmed cell death protein 4 (PDCD4). In the current study, we demonstrated the potential anticancer effects of Andro on luminal-like breast cancer and indicated that Andro inhibits the expression of miR-21-5p and further promotes PDCD4 via NF-κB suppression. Therefore, Andro could be an antitumor agent for the treatment of luminal-like breast cancer in the clinic.
- Published
- 2021
- Full Text
- View/download PDF
19. Leaching Behavior of Hydrogen Peroxide in the Process of Recovering Silver from Spent Silver Catalysts
- Author
-
Yang, Quan, Han, Jibiao, Ma, Wangrui, Yuan, Ruiling, Li, Yong, Zhao, Yu, Li, Zixuan, Forsberg, Kerstin, editor, Karamalidis, Athanasios, editor, Ouchi, Takanari, editor, Azimi, Gisele, editor, Alam, Shafiq, editor, Neelameggham, Neale R., editor, Baba, Alafara Abdullahi, editor, Peng, Hong, editor, and Kim, Hojong, editor
- Published
- 2025
- Full Text
- View/download PDF
20. Multi-task Instruction Tuning for Temporal Question Answering over Knowledge Graphs
- Author
-
Su, Miao, Li, Zixuan, Jin, Xiaolong, Guo, Jiafeng, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, He, Xiangnan, editor, Ren, Zhaochun, editor, and Tang, Ruiming, editor
- Published
- 2025
- Full Text
- View/download PDF
21. GPT-4V Explorations: Mining Autonomous Driving
- Author
-
Li, Zixuan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper explores the application of the GPT-4V(ision) large visual language model to autonomous driving in mining environments, where traditional systems often falter in understanding intentions and making accurate decisions during emergencies. GPT-4V introduces capabilities for visual question answering and complex scene comprehension, addressing challenges in these specialized settings.Our evaluation focuses on its proficiency in scene understanding, reasoning, and driving functions, with specific tests on its ability to recognize and interpret elements such as pedestrians, various vehicles, and traffic devices. While GPT-4V showed robust comprehension and decision-making skills, it faced difficulties in accurately identifying specific vehicle types and managing dynamic interactions. Despite these challenges, its effective navigation and strategic decision-making demonstrate its potential as a reliable agent for autonomous driving in the complex conditions of mining environments, highlighting its adaptability and operational viability in industrial settings.
- Published
- 2024
22. Temporal Knowledge Graph Question Answering: A Survey
- Author
-
Su, Miao, Li, Zixuan, Chen, Zhuo, Bai, Long, Jin, Xiaolong, and Guo, Jiafeng
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Knowledge Base Question Answering (KBQA) has been a long-standing field to answer questions based on knowledge bases. Recently, the evolving dynamics of knowledge have attracted a growing interest in Temporal Knowledge Graph Question Answering (TKGQA), an emerging task to answer temporal questions. However, this field grapples with ambiguities in defining temporal questions and lacks a systematic categorization of existing methods for TKGQA. In response, this paper provides a thorough survey from two perspectives: the taxonomy of temporal questions and the methodological categorization for TKGQA. Specifically, we first establish a detailed taxonomy of temporal questions engaged in prior studies. Subsequently, we provide a comprehensive review of TKGQA techniques of two categories: semantic parsing-based and TKG embedding-based. Building on this review, the paper outlines potential research directions aimed at advancing the field of TKGQA. This work aims to serve as a comprehensive reference for TKGQA and to stimulate further research., Comment: 8 pages, 3 figures
- Published
- 2024
23. Design Editing for Offline Model-based Optimization
- Author
-
Yuan, Ye, Zhang, Youyuan, Chen, Can, Wu, Haolun, Li, Zixuan, Li, Jianmo, Clark, James J., and Liu, Xue
- Subjects
Computer Science - Machine Learning ,Computer Science - Computational Engineering, Finance, and Science - Abstract
Offline model-based optimization (MBO) aims to maximize a black-box objective function using only an offline dataset of designs and scores. These tasks span various domains, such as robotics, material design, and protein and molecular engineering. A common approach involves training a surrogate model using existing designs and their corresponding scores, and then generating new designs through gradient-based updates with respect to the surrogate model. This method suffers from the out-of-distribution issue, where the surrogate model may erroneously predict high scores for unseen designs. To address this challenge, we introduce a novel method, Design Editing for Offline Model-based Optimization} (DEMO), which leverages a diffusion prior to calibrate overly optimized designs. DEMO first generates pseudo design candidates by performing gradient ascent with respect to a surrogate model. Then, an editing process refines these pseudo design candidates by introducing noise and subsequently denoising them with a diffusion prior trained on the offline dataset, ensuring they align with the distribution of valid designs. We provide a theoretical proof that the difference between the final optimized designs generated by DEMO and the prior distribution of the offline dataset is controlled by the noise injected during the editing process. Empirical evaluations on seven offline MBO tasks show that DEMO outperforms various baseline methods, achieving the highest mean rank of 2.1 and a median rank of 1.
- Published
- 2024
24. Woven Fabric Capture with a Reflection-Transmission Photo Pair
- Author
-
Tang, Yingjie, Li, Zixuan, Hašan, Miloš, Yang, Jian, and Wang, Beibei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics - Abstract
Digitizing woven fabrics would be valuable for many applications, from digital humans to interior design. Previous work introduces a lightweight woven fabric acquisition approach by capturing a single reflection image and estimating the fabric parameters with a differentiable geometric and shading model. The renderings of the estimated fabric parameters can closely match the photo; however, the captured reflection image is insufficient to fully characterize the fabric sample reflectance. For instance, fabrics with different thicknesses might have similar reflection images but lead to significantly different transmission. We propose to recover the woven fabric parameters from two captured images: reflection and transmission. At the core of our method is a differentiable bidirectional scattering distribution function (BSDF) model, handling reflection and transmission, including single and multiple scattering. We propose a two-layer model, where the single scattering uses an SGGX phase function as in previous work, and multiple scattering uses a new azimuthally-invariant microflake definition, which we term ASGGX. This new fabric BSDF model closely matches real woven fabrics in both reflection and transmission. We use a simple setup for capturing reflection and transmission photos with a cell phone camera and two point lights, and estimate the fabric parameters via a lightweight network, together with a differentiable optimization. We also model the out-of-focus effects explicitly with a simple solution to match the thin-lens camera better. As a result, the renderings of the estimated parameters can agree with the input images on both reflection and transmission for the first time. The code for this paper is at https://github.com/lxtyin/FabricBTDF-Recovery., Comment: 10 pages, 16 figures (in the main paper). Accepted by SIGGRAPH 2024 conference
- Published
- 2024
25. cuFastTuckerPlus: A Stochastic Parallel Sparse FastTucker Decomposition Using GPU Tensor Cores
- Author
-
Li, Zixuan, Duan, Mingxing, Luo, Huizhang, Yang, Wangdong, Li, Kenli, and Li, Keqin
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Sparse tensors are prevalent in real-world applications, often characterized by their large-scale, high-order, and high-dimensional nature. Directly handling raw tensors is impractical due to the significant memory and computational overhead involved. The current mainstream approach involves compressing or decomposing the original tensor. One popular tensor decomposition algorithm is the Tucker decomposition. However, existing state-of-the-art algorithms for large-scale Tucker decomposition typically relax the original optimization problem into multiple convex optimization problems to ensure polynomial convergence. Unfortunately, these algorithms tend to converge slowly. In contrast, tensor decomposition exhibits a simple optimization landscape, making local search algorithms capable of converging to a global (approximate) optimum much faster. In this paper, we propose the FastTuckerPlus algorithm, which decomposes the original optimization problem into two non-convex optimization problems and solves them alternately using the Stochastic Gradient Descent method. Furthermore, we introduce cuFastTuckerPlus, a fine-grained parallel algorithm designed for GPU platforms, leveraging the performance of tensor cores. This algorithm minimizes memory access overhead and computational costs, surpassing the state-of-the-art algorithms. Our experimental results demonstrate that our method achieves a speedup of $3X$ to $5X$ compared to state-of-the-art algorithms.
- Published
- 2024
26. Self-Improvement Programming for Temporal Knowledge Graph Question Answering
- Author
-
Chen, Zhuo, Zhang, Zhao, Li, Zixuan, Wang, Fei, Zeng, Yutao, Jin, Xiaolong, and Xu, Yongjun
- Subjects
Computer Science - Computation and Language - Abstract
Temporal Knowledge Graph Question Answering (TKGQA) aims to answer questions with temporal intent over Temporal Knowledge Graphs (TKGs). The core challenge of this task lies in understanding the complex semantic information regarding multiple types of time constraints (e.g., before, first) in questions. Existing end-to-end methods implicitly model the time constraints by learning time-aware embeddings of questions and candidate answers, which is far from understanding the question comprehensively. Motivated by semantic-parsing-based approaches that explicitly model constraints in questions by generating logical forms with symbolic operators, we design fundamental temporal operators for time constraints and introduce a novel self-improvement Programming method for TKGQA (Prog-TQA). Specifically, Prog-TQA leverages the in-context learning ability of Large Language Models (LLMs) to understand the combinatory time constraints in the questions and generate corresponding program drafts with a few examples given. Then, it aligns these drafts to TKGs with the linking module and subsequently executes them to generate the answers. To enhance the ability to understand questions, Prog-TQA is further equipped with a self-improvement strategy to effectively bootstrap LLMs using high-quality self-generated drafts. Extensive experiments demonstrate the superiority of the proposed Prog-TQA on MultiTQ and CronQuestions datasets, especially in the Hits@1 metric., Comment: Accepted by LREC-COLING 2024 (long paper)
- Published
- 2024
27. Selective Temporal Knowledge Graph Reasoning
- Author
-
Hou, Zhongni, Jin, Xiaolong, Li, Zixuan, Bai, Long, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Machine Learning - Abstract
Temporal Knowledge Graph (TKG), which characterizes temporally evolving facts in the form of (subject, relation, object, timestamp), has attracted much attention recently. TKG reasoning aims to predict future facts based on given historical ones. However, existing TKG reasoning models are unable to abstain from predictions they are uncertain, which will inevitably bring risks in real-world applications. Thus, in this paper, we propose an abstention mechanism for TKG reasoning, which helps the existing models make selective, instead of indiscriminate, predictions. Specifically, we develop a confidence estimator, called Confidence Estimator with History (CEHis), to enable the existing TKG reasoning models to first estimate their confidence in making predictions, and then abstain from those with low confidence. To do so, CEHis takes two kinds of information into consideration, namely, the certainty of the current prediction and the accuracy of historical predictions. Experiments with representative TKG reasoning models on two benchmark datasets demonstrate the effectiveness of the proposed CEHis.
- Published
- 2024
28. Evaluation Methods for Breast Cancer Prediction in Machine Learning Field
- Author
-
Zhang Zirui and Li Zixuan
- Subjects
Social Sciences - Abstract
Breast cancer is the most common malignant tumor found in women, and there is no cure for advanced breast cancer. Early detection and treatment can effectively improve patient survival. This paper uses five machine learning classification models, namely Support Vector Machine (SVM), Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), and K-Nearest Neighbors Algorithm (KNN). The training data for the five models are provided by the Wisconsin Breast Cancer Dataset (WBCD). By evaluating and comparing the performance of the five models in accuracy, F1Score, ROC curve, and PR curve, the study finds that LR has the best performance.
- Published
- 2022
- Full Text
- View/download PDF
29. Visible Light-Responsive AlFeO3@g-C3N4 Heterojunction for Efficient Degradation of Organic Wastewater
- Author
-
Li, Zixuan, Mu, Rui, Zhang, Wei, Lin, Xue, Cui, Qi, and Gu, Di
- Published
- 2024
- Full Text
- View/download PDF
30. KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction
- Author
-
Li, Zixuan, Zeng, Yutao, Zuo, Yuxin, Ren, Weicheng, Liu, Wenxuan, Su, Miao, Guo, Yucan, Liu, Yantao, Li, Xiang, Hu, Zhilei, Bai, Long, Li, Wei, Liu, Yidan, Yang, Pan, Jin, Xiaolong, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In this paper, we propose KnowCoder, a Large Language Model (LLM) to conduct Universal Information Extraction (UIE) via code generation. KnowCoder aims to develop a kind of unified schema representation that LLMs can easily understand and an effective learning framework that encourages LLMs to follow schemas and extract structured knowledge accurately. To achieve these, KnowCoder introduces a code-style schema representation method to uniformly transform different schemas into Python classes, with which complex schema information, such as constraints among tasks in UIE, can be captured in an LLM-friendly manner. We further construct a code-style schema library covering over $\textbf{30,000}$ types of knowledge, which is the largest one for UIE, to the best of our knowledge. To ease the learning process of LLMs, KnowCoder contains a two-phase learning framework that enhances its schema understanding ability via code pretraining and its schema following ability via instruction tuning. After code pretraining on around $1.5$B automatically constructed data, KnowCoder already attains remarkable generalization ability and achieves relative improvements by $\textbf{49.8%}$ F1, compared to LLaMA2, under the few-shot setting. After instruction tuning, KnowCoder further exhibits strong generalization ability on unseen schemas and achieves up to $\textbf{12.5%}$ and $\textbf{21.9%}$, compared to sota baselines, under the zero-shot setting and the low resource setting, respectively. Additionally, based on our unified schema representations, various human-annotated datasets can simultaneously be utilized to refine KnowCoder, which achieves significant improvements up to $\textbf{7.5%}$ under the supervised setting.
- Published
- 2024
31. Learning to Ask Critical Questions for Assisting Product Search
- Author
-
Li, Zixuan, Liao, Lizi, and Chua, Tat-Seng
- Subjects
Computer Science - Information Retrieval - Abstract
Product search plays an essential role in eCommerce. It was treated as a special type of information retrieval problem. Most existing works make use of historical data to improve the search performance, which do not take the opportunity to ask for user's current interest directly. Some session-aware methods take the user's clicks within the session as implicit feedback, but it is still just a guess on user's preference. To address this problem, recent conversational or question-based search models interact with users directly for understanding the user's interest explicitly. However, most users do not have a clear picture on what to buy at the initial stage. Asking critical attributes that the user is looking for after they explored for a while should be a more efficient way to help them searching for the target items. In this paper, we propose a dual-learning model that hybrids the best from both implicit session feedback and proactively clarifying with users on the most critical questions. We first establish a novel utility score to measure whether a clicked item provides useful information for finding the target. Then we develop the dual Selection Net and Ranking Net for choosing the critical questions and ranking the items. It innovatively links traditional click-stream data and text-based questions together. To verify our proposal, we did extensive experiments on a public dataset, and our model largely outperformed other state-of-the-art methods., Comment: SIGIR eCom'22
- Published
- 2024
32. Contrastive Pre-training for Deep Session Data Understanding
- Author
-
Li, Zixuan, Liao, Lizi, Ma, Yunshan, and Chua, Tat-Seng
- Subjects
Computer Science - Information Retrieval - Abstract
Session data has been widely used for understanding user's behavior in e-commerce. Researchers are trying to leverage session data for different tasks, such as purchase intention prediction, remaining length prediction, recommendation, etc., as it provides context clues about the user's dynamic interests. However, online shopping session data is semi-structured and complex in nature, which contains both unstructured textual data about the products, search queries, and structured user action sequences. Most existing works focus on leveraging the coarse-grained item sequences for specific tasks, while largely ignore the fine-grained information from text and user action details. In this work, we delve into deep session data understanding via scrutinizing the various clues inside the rich information in user sessions. Specifically, we propose to pre-train a general-purpose User Behavior Model (UBM) over large-scale session data with rich details, such as product title, attributes and various kinds of user actions. A two-stage pre-training scheme is introduced to encourage the model to self-learn from various augmentations with contrastive learning objectives, which spans different granularity levels of session data. Then the well-trained session understanding model can be easily fine-tuned for various downstream tasks. Extensive experiments show that UBM better captures the complex intra-item semantic relations, inter-item connections and inter-interaction dependencies, leading to large performance gains as compared to the baselines on several downstream tasks. And it also demonstrates strong robustness when data is sparse.
- Published
- 2024
33. Unlocking the Power of Large Language Models for Entity Alignment
- Author
-
Jiang, Xuhui, Shen, Yinghan, Shi, Zhichao, Xu, Chengjin, Li, Wei, Li, Zixuan, Guo, Jian, Shen, Huawei, and Wang, Yuanzhuo
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Entity Alignment (EA) is vital for integrating diverse knowledge graph (KG) data, playing a crucial role in data-driven AI applications. Traditional EA methods primarily rely on comparing entity embeddings, but their effectiveness is constrained by the limited input KG data and the capabilities of the representation learning techniques. Against this backdrop, we introduce ChatEA, an innovative framework that incorporates large language models (LLMs) to improve EA. To address the constraints of limited input KG data, ChatEA introduces a KG-code translation module that translates KG structures into a format understandable by LLMs, thereby allowing LLMs to utilize their extensive background knowledge to improve EA accuracy. To overcome the over-reliance on entity embedding comparisons, ChatEA implements a two-stage EA strategy that capitalizes on LLMs' capability for multi-step reasoning in a dialogue format, thereby enhancing accuracy while preserving efficiency. Our experimental results verify ChatEA's superior performance, highlighting LLMs' potential in facilitating EA tasks.
- Published
- 2024
34. Impacts of landscape patterns on habitat quality in coal resource-exhausted cities: Spatial–temporal dynamics and non-stationary scale effects
- Author
-
Li, Zixuan, Xu, Ziqi, Chen, Yedong, Gu, Sihao, and Li, Cheng
- Published
- 2025
- Full Text
- View/download PDF
35. The construction technology and finite element analysis of the whole lifting of large grid
- Author
-
Deng Deyuan, Zhang Weixiong, Li Zixuan, Li Guangjun, Mo TianFang, Chen Zhou, and Lu Hanwen
- Subjects
Environmental sciences ,GE1-350 - Abstract
Taking a large grid frame in a maintenance hangar of Guangzhou Baiyun Airport as the research object, the construction steps of lifting the grid structure are briefly described, and the static mechanical characteristics, dynamic mechanical characteristics and stability of the large grid frame were studied by using Midas/Civil finite element analysis software, which meets the design and the practical application requirements. Since its first six orders safety factors are greater than 4. The stress state of the nodes at the important supports in the grid is mainly controlled by axial force.
- Published
- 2021
- Full Text
- View/download PDF
36. Retrieval-Augmented Code Generation for Universal Information Extraction
- Author
-
Guo, Yucan, Li, Zixuan, Jin, Xiaolong, Liu, Yantao, Zeng, Yutao, Liu, Wenxuan, Li, Xiang, Yang, Pan, Bai, Long, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Information Retrieval - Abstract
Information Extraction (IE) aims to extract structural knowledge (e.g., entities, relations, events) from natural language texts, which brings challenges to existing methods due to task-specific schemas and complex text expressions. Code, as a typical kind of formalized language, is capable of describing structural knowledge under various schemas in a universal way. On the other hand, Large Language Models (LLMs) trained on both codes and texts have demonstrated powerful capabilities of transforming texts into codes, which provides a feasible solution to IE tasks. Therefore, in this paper, we propose a universal retrieval-augmented code generation framework based on LLMs, called Code4UIE, for IE tasks. Specifically, Code4UIE adopts Python classes to define task-specific schemas of various structural knowledge in a universal way. By so doing, extracting knowledge under these schemas can be transformed into generating codes that instantiate the predefined Python classes with the information in texts. To generate these codes more precisely, Code4UIE adopts the in-context learning mechanism to instruct LLMs with examples. In order to obtain appropriate examples for different tasks, Code4UIE explores several example retrieval strategies, which can retrieve examples semantically similar to the given texts. Extensive experiments on five representative IE tasks across nine datasets demonstrate the effectiveness of the Code4UIE framework.
- Published
- 2023
37. An In-Context Schema Understanding Method for Knowledge Base Question Answering
- Author
-
Liu, Yantao, Li, Zixuan, Jin, Xiaolong, Guo, Yucan, Bai, Long, Guan, Saiping, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Computation and Language - Abstract
The Knowledge Base Question Answering (KBQA) task aims to answer natural language questions based on a given knowledge base. Recently, Large Language Models (LLMs) have shown strong capabilities in language understanding and can be used to solve this task. In doing so, a major challenge for LLMs is to overcome the immensity and heterogeneity of knowledge base schemas.Existing methods bypass this challenge by initially employing LLMs to generate drafts of logic forms without schema-specific details.Then, an extra module is used to inject schema information to these drafts.In contrast, in this paper, we propose a simple In-Context Schema Understanding (ICSU) method that enables LLMs to directly understand schemas by leveraging in-context learning. Specifically, ICSU provides schema information to LLMs using schema-related annotated examples. We investigate three example retrieval strategies based on raw questions, anonymized questions, and generated SPARQL queries. Experimental results show that ICSU demonstrates competitive performance compared to baseline methods on both the KQA Pro and WebQSP datasets.
- Published
- 2023
38. DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning
- Author
-
Xiong, Jing, Li, Zixuan, Zheng, Chuanyang, Guo, Zhijiang, Yin, Yichun, Xie, Enze, Yang, Zhicheng, Cao, Qingxing, Wang, Haiming, Han, Xiongwei, Tang, Jing, Li, Chengming, and Liang, Xiaodan
- Subjects
Computer Science - Computation and Language - Abstract
Recent advances in natural language processing, primarily propelled by Large Language Models (LLMs), have showcased their remarkable capabilities grounded in in-context learning. A promising avenue for guiding LLMs in intricate reasoning tasks involves the utilization of intermediate reasoning steps within the Chain-of-Thought (CoT) paradigm. Nevertheless, the central challenge lies in the effective selection of exemplars for facilitating in-context learning. In this study, we introduce a framework that leverages Dual Queries and Low-rank approximation Re-ranking (DQ-LoRe) to automatically select exemplars for in-context learning. Dual Queries first query LLM to obtain LLM-generated knowledge such as CoT, then query the retriever to obtain the final exemplars via both question and the knowledge. Moreover, for the second query, LoRe employs dimensionality reduction techniques to refine exemplar selection, ensuring close alignment with the input question's knowledge. Through extensive experiments, we demonstrate that DQ-LoRe significantly outperforms prior state-of-the-art methods in the automatic selection of exemplars for GPT-4, enhancing performance from 92.5% to 94.2%. Our comprehensive analysis further reveals that DQ-LoRe consistently outperforms retrieval-based approaches in terms of both performance and adaptability, especially in scenarios characterized by distribution shifts. DQ-LoRe pushes the boundary of in-context learning and opens up new avenues for addressing complex reasoning challenges. Our code is released at https://github.com/AI4fun/DQ-LoRe}{https://github.com/AI4fun/DQ-LoRe., Comment: Accepted in ICLR 2024
- Published
- 2023
39. Effect of Normalizing Treatment on Microstructure and Mechanical Properties of Non-oriented Fe-3.0% Si Steel
- Author
-
Xu, Cheng, Xu, Haijie, Shu, Xuedao, Lu, Xubeng, Jiang, Lulan, Li, Zixuan, and Zhang, Yuanxiang
- Published
- 2024
- Full Text
- View/download PDF
40. Nested Event Extraction upon Pivot Element Recogniton
- Author
-
Ren, Weicheng, Li, Zixuan, Jin, Xiaolong, Bai, Long, Su, Miao, Liu, Yantao, Guan, Saiping, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Computation and Language - Abstract
Nested Event Extraction (NEE) aims to extract complex event structures where an event contains other events as its arguments recursively. Nested events involve a kind of Pivot Elements (PEs) that simultaneously act as arguments of outer-nest events and as triggers of inner-nest events, and thus connect them into nested structures. This special characteristic of PEs brings challenges to existing NEE methods, as they cannot well cope with the dual identities of PEs. Therefore, this paper proposes a new model, called PerNee, which extracts nested events mainly based on recognizing PEs. Specifically, PerNee first recognizes the triggers of both inner-nest and outer-nest events and further recognizes the PEs via classifying the relation type between trigger pairs. The model uses prompt learning to incorporate information from both event types and argument roles for better trigger and argument representations to improve NEE performance. Since existing NEE datasets (e.g., Genia11) are limited to specific domains and contain a narrow range of event types with nested structures, we systematically categorize nested events in the generic domain and construct a new NEE dataset, called ACE2005-Nest. Experimental results demonstrate that PerNee consistently achieves state-of-the-art performance on ACE2005-Nest, Genia11, and Genia13. The ACE2005-Nest dataset and the code of the PerNee model are available at https://github.com/waysonren/PerNee., Comment: Accepted at LREC-COLING 2024
- Published
- 2023
41. ProtoEM: A Prototype-Enhanced Matching Framework for Event Relation Extraction
- Author
-
Hu, Zhilei, Li, Zixuan, Xu, Daozhu, Bai, Long, Jin, Cheng, Jin, Xiaolong, Guo, Jiafeng, and Cheng, Xueqi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Event Relation Extraction (ERE) aims to extract multiple kinds of relations among events in texts. However, existing methods singly categorize event relations as different classes, which are inadequately capturing the intrinsic semantics of these relations. To comprehensively understand their intrinsic semantics, in this paper, we obtain prototype representations for each type of event relation and propose a Prototype-Enhanced Matching (ProtoEM) framework for the joint extraction of multiple kinds of event relations. Specifically, ProtoEM extracts event relations in a two-step manner, i.e., prototype representing and prototype matching. In the first step, to capture the connotations of different event relations, ProtoEM utilizes examples to represent the prototypes corresponding to these relations. Subsequently, to capture the interdependence among event relations, it constructs a dependency graph for the prototypes corresponding to these relations and utilized a Graph Neural Network (GNN)-based module for modeling. In the second step, it obtains the representations of new event pairs and calculates their similarity with those prototypes obtained in the first step to evaluate which types of event relations they belong to. Experimental results on the MAVEN-ERE dataset demonstrate that the proposed ProtoEM framework can effectively represent the prototypes of event relations and further obtain a significant improvement over baseline models., Comment: Work in progress
- Published
- 2023
42. Observation of odd-parity superconductivity in UTe2
- Author
-
Li, Zixuan, Moir, Camilla M., McKee, Nathan J., Lee-Wong, Eric, Baumbach, Ryan E., Maple, M. Brian, and Liu, Ying
- Subjects
Condensed Matter - Superconductivity - Abstract
Symmetry properties of the order parameter are among the most fundamental characteristics of a superconductor. The pairing symmetry of recently discovered heavy fermion superconductor UTe2 featuring an exceedingly large upper critical field has attracted a great deal of attention. Even though it is widely believed that UTe2 possesses an odd-parity, spin-triplet pairing symmetry, direct evidence for it is lacking, especially at zero or low magnetic fields. We report here the selection-rule results of Josephson coupling between In, an s-wave superconductor, and UTe2. The orientation dependence of the Josephson coupling suggests very strongly that UTe2 possess an odd-parity pairing state of B_1u in zero magnetic fields. We also report the formation of Andreev surface bound states on the (1-10) surface of UTe2., Comment: 16 pages, 4 figures, 2 tables
- Published
- 2023
43. A Ground Segmentation Method Based on Point Cloud Map for Unstructured Roads
- Author
-
Li, Zixuan, Lin, Haiying, Wang, Zhangyu, Li, Huazhi, Yu, Miao, and Wang, Jie
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Ground segmentation, as the basic task of unmanned intelligent perception, provides an important support for the target detection task. Unstructured road scenes represented by open-pit mines have irregular boundary lines and uneven road surfaces, which lead to segmentation errors in current ground segmentation methods. To solve this problem, a ground segmentation method based on point cloud map is proposed, which involves three parts: region of interest extraction, point cloud registration and background subtraction. Firstly, establishing boundary semantic associations to obtain regions of interest in unstructured roads. Secondly, establishing the location association between point cloud map and the real-time point cloud of region of interest by semantics information. Thirdly, establishing a background model based on Gaussian distribution according to location association, and segments the ground in real-time point cloud by the background substraction method. Experimental results show that the correct segmentation rate of ground points is 99.95%, and the running time is 26ms. Compared with state of the art ground segmentation algorithm Patchwork++, the average accuracy of ground point segmentation is increased by 7.43%, and the running time is increased by 17ms. Furthermore, the proposed method is practically applied to unstructured road scenarios represented by open pit mines.
- Published
- 2023
44. A time series study of the association between extreme temperature and ozone on varicella incidence
- Author
-
Zhang, Juan, Wang, Binhao, Li, Zixuan, Zhang, Wanze, Yan, Siyao, Geng, Qiaoling, Guo, Xian, Zhao, Zitong, Cai, Jianning, Liu, Lijuan, and Zhang, Xiaolin
- Published
- 2024
- Full Text
- View/download PDF
45. Comparative effectiveness of physical exercise interventions on sociability and communication in children and adolescents with autism: a systematic review and network meta-analysis
- Author
-
Kou, Ruijie, Li, Zixuan, Li, Ming, Zhou, Rui, Zhu, Feilong, Ruan, Weiqi, and Zhang, Jia
- Published
- 2024
- Full Text
- View/download PDF
46. Chemical inhibition of stomatal differentiation by perturbation of the master-regulatory bHLH heterodimer via an ACT-Like domain
- Author
-
Nakagawa, Ayami, Sepuru, Krishna Mohan, Yip, Shu Jan, Seo, Hyemin, Coffin, Calvin M., Hashimoto, Kota, Li, Zixuan, Segawa, Yasutomo, Iwasaki, Rie, Kato, Hiroe, Kurihara, Daisuke, Aihara, Yusuke, Kim, Stephanie, Kinoshita, Toshinori, Itami, Kenichiro, Han, Soon-Ki, Murakami, Kei, and Torii, Keiko U.
- Published
- 2024
- Full Text
- View/download PDF
47. Dietary patterns related to attention and physiological function in high-altitude migrants
- Author
-
Su, Rui, Zhang, Wenrui, Huang, Jie, Fan, Jing, Peng, Ping, Li, Hao, Zhang, Delong, Li, Yong, Ma, Hailin, Nie, Lijuan, and Li, Zixuan
- Published
- 2024
- Full Text
- View/download PDF
48. Airway management for right thoracoscopic tracheal tumour resection after left pneumonectomy assisted by cardiopulmonary bypass: a case report
- Author
-
Jiang, Xue, Li, Zixuan, Xu, Rukun, Wang, Xiaoliang, and Xu, Lei
- Published
- 2024
- Full Text
- View/download PDF
49. An integrated micromachined flexible ultrasonic-inductive sensor for pipe contaminant multiparameter detection
- Author
-
Yuan, Zheng, Wu, Xiaoyu, Li, Zhikang, Yuan, Jiawei, Zhao, Yihe, Li, Zixuan, Qin, Shaohui, Ma, Qi, Shi, Xuan, Zhao, Zilong, Li, Jiazhu, Zhang, Shiwang, Jing, Weixuan, Wang, Xiaozhang, and Zhao, Libo
- Published
- 2024
- Full Text
- View/download PDF
50. Author Correction: Recent northward shift of tropical cyclone economic risk in China
- Author
-
Qin, Lianjie, Zhu, Laiyin, Liao, Xinli, Meng, Chenna, Han, Qinmei, Li, Zixuan, Shen, Shifei, Xu, Wei, and Chen, Jianguo
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.