Author: "Lu, Jiaying" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Lu, Jiaying"' showing total 417 results

Start Over Author "Lu, Jiaying"

417 results on '"Lu, Jiaying"'

1. Measuring Spiritual Values and Bias of Large Language Models

Author: Liu, Songyuan, Zhang, Ziyang, Yan, Runze, Wu, Wei, Yang, Carl, and Lu, Jiaying
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have become integral tool for users from various backgrounds. LLMs, trained on vast corpora, reflect the linguistic and cultural nuances embedded in their pre-training data. However, the values and perspectives inherent in this data can influence the behavior of LLMs, leading to potential biases. As a result, the use of LLMs in contexts involving spiritual or moral values necessitates careful consideration of these underlying biases. Our work starts with verification of our hypothesis by testing the spiritual values of popular LLMs. Experimental results show that LLMs' spiritual values are quite diverse, as opposed to the stereotype of atheists or secularists. We then investigate how different spiritual values affect LLMs in social-fairness scenarios e.g., hate speech identification). Our findings reveal that different spiritual values indeed lead to different sensitivity to different hate target groups. Furthermore, we propose to continue pre-training LLMs on spiritual texts, and empirical results demonstrate the effectiveness of this approach in mitigating spiritual bias., Comment: 9 pages including appendix; 5 figures; 5 tables; submitted to ARR - Octobor 2024
Published: 2024

2. Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling

Author: Zhao, Shifan, Lu, Jiaying, Yang, Ji, Chow, Edmond, and Xi, Yuanzhe
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Probability, Statistics - Machine Learning, G.3, J.3
Abstract: Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical applications. However, a systematic approach to handle these misspecifications is lacking in the literature. In this work, we propose a general framework to address these issues. Firstly, we introduce a flexible two-stage GPR framework that separates mean prediction and uncertainty quantification (UQ) to prevent mean misspecification, which can introduce bias into the model. Secondly, kernel function misspecification is addressed through a novel automatic kernel search algorithm, supported by theoretical analysis, that selects the optimal kernel from a candidate set. Additionally, we propose a subsampling-based warm-start strategy for hyperparameter initialization to improve efficiency and avoid hyperparameter misspecification. With much lower computational cost, our subsampling-based strategy can yield competitive or better performance than training exclusively on the full dataset. Combining all these components, we recommend two GPR methods-exact and scalable-designed to match available computational resources and specific UQ requirements. Extensive evaluation on real-world datasets, including UCI benchmarks and a safety-critical medical case study, demonstrates the robustness and precision of our methods.
Published: 2024

3. PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking

Author: Xie, Yuzhang, Lu, Jiaying, Ho, Joyce, Nahab, Fadi, Hu, Xiao, and Yang, Carl
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Linking (aligning) biomedical concepts across diverse data sources enables various integrative analyses, but it is challenging due to the discrepancies in concept naming conventions. Various strategies have been developed to overcome this challenge, such as those based on string-matching rules, manually crafted thesauri, and machine learning models. However, these methods are constrained by limited prior biomedical knowledge and can hardly generalize beyond the limited amounts of rules, thesauri, or training samples. Recently, large language models (LLMs) have exhibited impressive results in diverse biomedical NLP tasks due to their unprecedentedly rich prior knowledge and strong zero-shot prediction abilities. However, LLMs suffer from issues including high costs, limited context length, and unreliable predictions. In this research, we propose PromptLink, a novel biomedical concept linking framework that leverages LLMs. It first employs a biomedical-specialized pre-trained language model to generate candidate concepts that can fit in the LLM context windows. Then it utilizes an LLM to link concepts through two-stage prompts, where the first-stage prompt aims to elicit the biomedical prior knowledge from the LLM for the concept linking task and the second-stage prompt enforces the LLM to reflect on its own predictions to further enhance their reliability. Empirical results on the concept linking task between two EHR datasets and an external biomedical KG demonstrate the effectiveness of PromptLink. Furthermore, PromptLink is a generic framework without reliance on additional prior knowledge, context, or training data, making it well-suited for concept linking across various types of data sources. The source code is available at https://github.com/constantjxyz/PromptLink.
Published: 2024
Full Text: View/download PDF

4. AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

Author: Pan, Bo, Lu, Jiaying, Wang, Ke, Zheng, Li, Wen, Zhen, Feng, Yingchaojie, Zhu, Minfeng, and Chen, Wei
Subjects: Computer Science - Human-Computer Interaction
Abstract: The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with existing coordination frameworks. This difficulty stems from the inherent ambiguity of natural language for specifying the collaboration process and the significant cognitive effort required to extract crucial information (e.g. agent relationship, task dependency, result correspondence) from a vast amount of text-form content during exploration. In this work, we present a visual exploration framework to facilitate the design of coordination strategies in multi-agent collaboration. We first establish a structured representation for LLM-based multi-agent coordination strategy to regularize the ambiguity of natural language. Based on this structure, we devise a three-stage generation method that leverages LLMs to convert a user's general goal into an executable initial coordination strategy. Users can further intervene at any stage of the generation process, utilizing LLMs and a set of interactions to explore alternative strategies. Whenever a satisfactory strategy is identified, users can commence the collaboration and examine the visually enhanced execution result. We develop AgentCoord, a prototype interactive system, and conduct a formal user study to demonstrate the feasibility and effectiveness of our approach.
Published: 2024

5. LogicPrpBank: A Corpus for Logical Implication and Equivalence

Author: Liu, Zhexiong, Zhang, Jing, Lu, Jiaying, Ma, Wenjing, and Ho, Joyce C
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Logic reasoning has been critically needed in problem-solving and decision-making. Although Language Models (LMs) have demonstrated capabilities of handling multiple reasoning tasks (e.g., commonsense reasoning), their ability to reason complex mathematical problems, specifically propositional logic, remains largely underexplored. This lack of exploration can be attributed to the limited availability of annotated corpora. Here, we present a well-labeled propositional logic corpus, LogicPrpBank, containing 7093 Propositional Logic Statements (PLSs) across six mathematical subjects, to study a brand-new task of reasoning logical implication and equivalence. We benchmark LogicPrpBank with widely-used LMs to show that our corpus offers a useful resource for this challenging task and there is ample room for model improvement., Comment: In the 5th AI4ED Workshop, held in conjunction with The 38th AAAI Conference on Artificial Intelligence, February 2024
Published: 2024

6. AgentLens: Visual Analysis for Agent Behaviors in LLM-based Autonomous Systems

Author: Lu, Jiaying, Pan, Bo, Chen, Jieyi, Feng, Yingchaojie, Hu, Jingyuan, Peng, Yuchen, and Chen, Wei
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence
Abstract: Recently, Large Language Model based Autonomous system(LLMAS) has gained great popularity for its potential to simulate complicated behaviors of human societies. One of its main challenges is to present and analyze the dynamic events evolution of LLMAS. In this work, we present a visualization approach to explore detailed statuses and agents' behavior within LLMAS. We propose a general pipeline that establishes a behavior structure from raw LLMAS execution events, leverages a behavior summarization algorithm to construct a hierarchical summary of the entire structure in terms of time sequence, and a cause trace method to mine the causal relationship between agent behaviors. We then develop AgentLens, a visual analysis system that leverages a hierarchical temporal visualization for illustrating the evolution of LLMAS, and supports users to interactively investigate details and causes of agents' behaviors. Two usage scenarios and a user study demonstrate the effectiveness and usability of our AgentLens.
Published: 2024

7. Evaluation of General Large Language Models in Contextually Assessing Semantic Concepts Extracted from Adult Critical Care Electronic Health Record Notes

Author: Liu, Darren, Ding, Cheng, Bold, Delgersuren, Bouvier, Monique, Lu, Jiaying, Shickel, Benjamin, Jabaley, Craig S., Zhang, Wenhui, Park, Soojin, Young, Michael J., Wainwright, Mark S., Clermont, Gilles, Rashidi, Parisa, Rosenthal, Eric S., Dimisko, Laurie, Xiao, Ran, Yoon, Joo Heung, Yang, Carl, and Hu, Xiao
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Software Engineering
Abstract: The field of healthcare has increasingly turned its focus towards Large Language Models (LLMs) due to their remarkable performance. However, their performance in actual clinical applications has been underexplored. Traditional evaluations based on question-answering tasks don't fully capture the nuanced contexts. This gap highlights the need for more in-depth and practical assessments of LLMs in real-world healthcare settings. Objective: We sought to evaluate the performance of LLMs in the complex clinical context of adult critical care medicine using systematic and comprehensible analytic methods, including clinician annotation and adjudication. Methods: We investigated the performance of three general LLMs in understanding and processing real-world clinical notes. Concepts from 150 clinical notes were identified by MetaMap and then labeled by 9 clinicians. Each LLM's proficiency was evaluated by identifying the temporality and negation of these concepts using different prompts for an in-depth analysis. Results: GPT-4 showed overall superior performance compared to other LLMs. In contrast, both GPT-3.5 and text-davinci-003 exhibit enhanced performance when the appropriate prompting strategies are employed. The GPT family models have demonstrated considerable efficiency, evidenced by their cost-effectiveness and time-saving capabilities. Conclusion: A comprehensive qualitative performance evaluation framework for LLMs is developed and operationalized. This framework goes beyond singular performance aspects. With expert annotations, this methodology not only validates LLMs' capabilities in processing complex medical data but also establishes a benchmark for future LLM evaluations across specialized domains.
Published: 2024

8. Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models

Author: Bai, Guangji, Chai, Zheng, Ling, Chen, Wang, Shiyu, Lu, Jiaying, Zhang, Nan, Shi, Tingwei, Yu, Ziyang, Zhu, Mengdan, Zhang, Yifei, Yang, Carl, Cheng, Yue, and Zhao, Liang
Subjects: Computer Science - Machine Learning
Abstract: The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims to systematically address these challenges by reviewing a broad spectrum of techniques designed to enhance the resource efficiency of LLMs. We categorize methods based on their optimization focus: computational, memory, energy, financial, and network resources and their applicability across various stages of an LLM's lifecycle, including architecture design, pretraining, finetuning, and system design. Additionally, the survey introduces a nuanced categorization of resource efficiency techniques by their specific resource types, which uncovers the intricate relationships and mappings between various resources and corresponding optimization techniques. A standardized set of evaluation metrics and datasets is also presented to facilitate consistent and fair comparisons across different models and techniques. By offering a comprehensive overview of the current sota and identifying open research avenues, this survey serves as a foundational reference for researchers and practitioners, aiding them in developing more sustainable and efficient LLMs in a rapidly evolving landscape., Comment: Preprint. GitHub repo: https://github.com/tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers
Published: 2023

9. Diagnostic and prognostic nomograms for laryngeal carcinoma patients with lung metastasis: a SEER-based study

Author: Qu, Wanxi, Qin, Zhaohui, Cui, Li, Yuan, Shiwang, Yao, Nan, Ma, Ji, Lu, Jiaying, Wang, Jiang, Wang, Minhan, and Yao, Yuanhu
Published: 2024
Full Text: View/download PDF

10. Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models

Author: Lu, Jiaying, Rao, Jinmeng, Chen, Kezhen, Guo, Xiaoyuan, Zhang, Yawen, Sun, Baochen, Yang, Carl, and Yang, Jie
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Large Vision-Language Models (LVLMs) offer remarkable benefits for a variety of vision-language tasks. However, a challenge hindering their application in real-world scenarios, particularly regarding safety, robustness, and reliability, is their constrained semantic grounding ability, which pertains to connecting language to the physical-world entities or concepts referenced in images. Therefore, a crucial need arises for a comprehensive study to assess the semantic grounding ability of widely used LVLMs. Despite the significance, sufficient investigation in this direction is currently lacking. Our work bridges this gap by designing a pipeline for generating large-scale evaluation datasets covering fine-grained semantic information, such as color, number, material, etc., along with a thorough assessment of seven popular LVLMs' semantic grounding ability. Results highlight prevalent misgrounding across various aspects and degrees. To address this issue, we propose a data-centric enhancement method that aims to improve LVLMs' semantic grounding ability through multimodal instruction tuning on fine-grained conversations. Experiments on enhanced LVLMs demonstrate notable improvements in addressing misgrounding issues., Comment: This paper has been accepted to the AAAI'24 Workshop on Responsible Language Models (ReLM 2024)
Published: 2023

11. A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

Author: Yang, Carl, Cui, Hejie, Lu, Jiaying, Wang, Shiyu, Xu, Ran, Ma, Wenjing, Yu, Yue, Yu, Shaojun, Kan, Xuan, Ling, Chen, Fu, Tianfan, Zhao, Liang, Ho, Joyce, and Wang, Fei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: Healthcare knowledge graphs (HKGs) are valuable tools for organizing biomedical concepts and their relationships with interpretable structures. The recent advent of large language models (LLMs) has paved the way for building more comprehensive and accurate HKGs. This, in turn, can improve the reliability of generated content and enable better evaluation of LLMs. However, the challenges of HKGs such as regarding data heterogeneity and limited coverage are not fully understood, highlighting the need for detailed reviews. This work provides the first comprehensive review of HKGs. It summarizes the pipeline and key techniques for HKG construction, as well as the common utilization approaches, i.e., model-free and model-based. The existing HKG resources are also organized based on the data types they capture and application domains they cover, along with relevant statistical information (Resource available at https://github.com/lujiaying/Awesome-HealthCare-KnowledgeBase). At the application level, we delve into the successful integration of HKGs across various health domains, ranging from fine-grained basic science research to high-level clinical decision support and public health. Lastly, the paper highlights the opportunities for HKGs in the era of LLMs. This work aims to serve as a valuable resource for understanding the potential and opportunities of HKG in health research., Comment: 21 pages, preprint submitted to ACM
Published: 2023

12. Effects of brain radiotherapy strategies on survival in the era of MRI for patients with limited stage small cell lung cancer

Author: Yao, Nan, Qin, Zhaohui, Chen, Meng, Hu, Lingling, Ma, Ji, Lu, Jiaying, Tong, Shaodong, Li, Na, and Yao, Yuanhu
Published: 2024
Full Text: View/download PDF

13. Diagnostic performance of artificial intelligence-assisted PET imaging for Parkinson’s disease: a systematic review and meta-analysis

Author: Wang, Jing, Xue, Le, Jiang, Jiehui, Liu, Fengtao, Wu, Ping, Lu, Jiaying, Zhang, Huiwei, Bao, Weiqi, Xu, Qian, Ju, Zizhao, Chen, Li, Jiao, Fangyang, Lin, Huamei, Ge, Jingjie, Zuo, Chuantao, and Tian, Mei
Published: 2024
Full Text: View/download PDF

14. Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

Author: Ling, Chen, Zhao, Xujiang, Lu, Jiaying, Deng, Chengyuan, Zheng, Can, Wang, Junxiang, Chowdhury, Tanmoy, Li, Yun, Cui, Hejie, Zhang, Xuchao, Zhao, Tianjiao, Panalkar, Amit, Mehta, Dhagash, Pasquali, Stefano, Cheng, Wei, Wang, Haoyu, Liu, Yanchi, Chen, Zhengzhang, Chen, Haifeng, White, Chris, Gu, Quanquan, Pei, Jian, Yang, Carl, and Zhao, Liang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of domain objectives, and the diversity of the constraints (e.g., various social norms, cultural conformity, religious beliefs, and ethical standards in the domain applications). Domain specification techniques are key to make large language models disruptive in many applications. Specifically, to solve these hurdles, there has been a notable increase in research and practices conducted in recent years on the domain specialization of LLMs. This emerging field of study, with its substantial potential for impact, necessitates a comprehensive and systematic review to better summarize and guide ongoing work in this area. In this article, we present a comprehensive survey on domain specification techniques for large language models, an emerging direction critical for large language model applications. First, we propose a systematic taxonomy that categorizes the LLM domain-specialization techniques based on the accessibility to LLMs and summarizes the framework for all the subcategories as well as their relations and differences to each other. Second, we present an extensive taxonomy of critical application domains that can benefit dramatically from specialized LLMs, discussing their practical significance and open challenges. Last, we offer our insights into the current research status and future trends in this area.
Published: 2023

15. HiPrompt: Few-Shot Biomedical Knowledge Fusion via Hierarchy-Oriented Prompting

Author: Lu, Jiaying, Shen, Jiaming, Xiong, Bo, Ma, Wenjing, Staab, Steffen, and Yang, Carl
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language
Abstract: Medical decision-making processes can be enhanced by comprehensive biomedical knowledge bases, which require fusing knowledge graphs constructed from different sources via a uniform index system. The index system often organizes biomedical terms in a hierarchy to provide the aligned entities with fine-grained granularity. To address the challenge of scarce supervision in the biomedical knowledge fusion (BKF) task, researchers have proposed various unsupervised methods. However, these methods heavily rely on ad-hoc lexical and structural matching algorithms, which fail to capture the rich semantics conveyed by biomedical entities and terms. Recently, neural embedding models have proved effective in semantic-rich tasks, but they rely on sufficient labeled data to be adequately trained. To bridge the gap between the scarce-labeled BKF and neural embedding models, we propose HiPrompt, a supervision-efficient knowledge fusion framework that elicits the few-shot reasoning ability of large language models through hierarchy-oriented prompts. Empirical results on the collected KG-Hi-BKF benchmark datasets demonstrate the effectiveness of HiPrompt.
Published: 2023
Full Text: View/download PDF

16. MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Author: Lu, Jiaying, Qian, Yongchen, Zhao, Shifan, Xi, Yuanzhe, and Yang, Carl
Subjects: Computer Science - Machine Learning
Abstract: Previous research has demonstrated the advantages of integrating data from multiple sources over traditional unimodal data, leading to the emergence of numerous novel multimodal applications. We propose a multimodal classification benchmark MuG with eight datasets that allows researchers to evaluate and improve their models. These datasets are collected from four various genres of games that cover tabular, textual, and visual modalities. We conduct multi-aspect data analysis to provide insights into the benchmark, including label balance ratios, percentages of missing features, distributions of data within each modality, and the correlations between labels and input modalities. We further present experimental results obtained by several state-of-the-art unimodal classifiers and multimodal classifiers, which demonstrate the challenging and multimodal-dependent properties of the benchmark. MuG is released at https://github.com/lujiaying/MUG-Bench with the data, tutorials, and implemented baselines.
Published: 2023
Full Text: View/download PDF

17. MetaAD: Metabolism-Aware Anomaly Detection for Parkinson’s Disease in F-FDG PET

Author: Huang, Haolin, Shen, Zhenrong, Wang, Jing, Wang, Xinyu, Lu, Jiaying, Lin, Huamei, Ge, Jingjie, Zuo, Chuantao, Wang, Qian, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Linguraru, Marius George, editor, Dou, Qi, editor, Feragen, Aasa, editor, Giannarou, Stamatia, editor, Glocker, Ben, editor, Lekadir, Karim, editor, and Schnabel, Julia A., editor
Published: 2024
Full Text: View/download PDF

18. Closed-book Question Generation via Contrastive Learning

Author: Dong, Xiangjue, Lu, Jiaying, Wang, Jianling, and Caverlee, James
Subjects: Computer Science - Computation and Language
Abstract: Question Generation (QG) is a fundamental NLP task for many downstream applications. Recent studies on open-book QG, where supportive answer-context pairs are provided to models, have achieved promising progress. However, generating natural questions under a more practical closed-book setting that lacks these supporting documents still remains a challenge. In this work, we propose a new QG model for this closed-book setting that is designed to better understand the semantics of long-form abstractive answers and store more information in its parameters through contrastive learning and an answer reconstruction module. Through experiments, we validate the proposed QG model on both public datasets and a new WikiCQA dataset. Empirical results show that the proposed QG model outperforms baselines in both automatic evaluation and human evaluation. In addition, we show how to leverage the proposed model to improve existing question-answering systems. These results further indicate the effectiveness of our QG model for enhancing closed-book question-answering tasks., Comment: To appear in EACL 2023
Published: 2022

19. International consensus on clinical use of presynaptic dopaminergic positron emission tomography imaging in parkinsonism

Author: Tian, Mei, Zuo, Chuantao, Cahid Civelek, A., Carrio, Ignasi, Watanabe, Yasuyoshi, Kang, Keon Wook, Murakami, Koji, Prior, John O., Zhong, Yan, Dou, Xiaofeng, Yu, Congcong, Jin, Chentao, Zhou, Rui, Liu, Fengtao, Li, Xinyi, Lu, Jiaying, Zhang, Hong, and Wang, Jian
Published: 2024
Full Text: View/download PDF

20. Influence of Gender on Tau Precipitation in Alzheimer’s Disease According to ATN Research Framework

Author: Zhang, Ying, Lu, Jiaying, Wang, Min, Zuo, Chuantao, and Jiang, Jiehui
Published: 2023
Full Text: View/download PDF

21. How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation

Author: Cui, Hejie, Lu, Jiaying, Ge, Yao, and Yang, Carl
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language, Computer Science - Machine Learning, 68T50, 68T37, 68T01, 68P20, H.3.3, I.7, I.2.7, I.2.6, I.2.4
Abstract: Graph neural networks (GNNs), as a group of powerful tools for representation learning on irregular data, have manifested superiority in various downstream tasks. With unstructured texts represented as concept maps, GNNs can be exploited for tasks like document retrieval. Intrigued by how can GNNs help document retrieval, we conduct an empirical study on a large-scale multi-discipline dataset CORD-19. Results show that instead of the complex structure-oriented GNNs such as GINs and GATs, our proposed semantics-oriented graph functions achieve better and more stable performance based on the BM25 retrieved candidates. Our insights in this case study can serve as a guideline for future work to develop effective GNNs with appropriate semantics-oriented inductive biases for textual reasoning tasks like document retrieval and classification. All code for this case study is available at https://github.com/HennyJie/GNN-DocRetrieval., Comment: This paper has been accepted to the 44th European Conference on Information Retrieval (ECIR) 2022
Published: 2022

22. HammingVis: A visual analytics approach for understanding erroneous outcomes of quantum computing in hamming space

Author: Chen, Jieyi, Wen, Zhen, Zheng, Li, Lu, Jiaying, Lu, Hui, Ren, Yiwen, and Chen, Wei
Published: 2024
Full Text: View/download PDF

23. Weakly Supervised Concept Map Generation through Task-Guided Graph Translation

Author: Lu, Jiaying, Dong, Xiangjue, and Yang, Carl
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Recent years have witnessed the rapid development of concept map generation techniques due to their advantages in providing well-structured summarization of knowledge from free texts. Traditional unsupervised methods do not generate task-oriented concept maps, whereas deep generative models require large amounts of training data. In this work, we present GT-D2G (Graph Translation-based Document To Graph), an automatic concept map generation framework that leverages generalized NLP pipelines to derive semantic-rich initial graphs, and translates them into more concise structures under the weak supervision of downstream task labels. The concept maps generated by GT-D2G can provide interpretable summarization of structured knowledge for the input texts, which are demonstrated through human evaluation and case studies on three real-world corpora. Further experiments on the downstream task of document classification show that GT-D2G beats other concept map generation methods. Moreover, we specifically validate the labeling efficiency of GT-D2G in the label-efficient learning setting and the flexibility of generated graph sizes in controlled hyper-parameter studies., Comment: Accepted by IEEE TKDE. All code and data available at https://github.com/lujiaying/GT-doc2graph
Published: 2021
Full Text: View/download PDF

24. Turbulence-assisted shear controllable synthesis of silicon oxide micro/nano particles using a counter axial-swirling impinging jet flow reactor

Author: Lu, Jiaying, Guo, Yanqing, Dong, Bin, Yang, Xiaogang, and Li, Jiusheng
Published: 2024
Full Text: View/download PDF

25. Head-to-head comparison of plasma and PET imaging ATN markers in subjects with cognitive complaints

Author: Lu, Jiaying, Ma, Xiaoxi, Zhang, Huiwei, Xiao, Zhenxu, Li, Ming, Wu, Jie, Ju, Zizhao, Chen, Li, Zheng, Li, Ge, Jingjie, Liang, Xiaoniu, Bao, Weiqi, Wu, Ping, Ding, Ding, Yen, Tzu-Chen, Guan, Yihui, Zuo, Chuantao, and Zhao, Qianhua
Published: 2023
Full Text: View/download PDF

26. Cellcano: supervised cell type identification for single cell ATAC-seq data

Author: Ma, Wenjing, Lu, Jiaying, and Wu, Hao
Published: 2023
Full Text: View/download PDF

27. Observational evidence for detrimental impact of inhaled ozone on human respiratory system

Author: Lu, Jiaying and Yao, Ling
Published: 2023
Full Text: View/download PDF

28. International Nuclear Medicine Consensus on the Clinical Use of Amyloid Positron Emission Tomography in Alzheimer’s Disease

Author: Tian, Mei, Zuo, Chuantao, Civelek, Ali Cahid, Carrio, Ignasi, Watanabe, Yasuyoshi, Kang, Keon Wook, Murakami, Koji, Garibotto, Valentina, Prior, John O., Barthel, Henryk, Guan, Yihui, Lu, Jiaying, Zhou, Rui, Jin, Chentao, Wu, Shuang, Zhang, Xiaohui, Zhong, Yan, and Zhang, Hong
Published: 2023
Full Text: View/download PDF

29. Feasibility of 18F-florzolotau quantification in patients with Alzheimer’s disease based on an MRI-free tau PET template

Author: Lu, Jiaying, Ju, Zizhao, Wang, Min, Sun, Xun, Jia, Chenhao, Li, Ling, Bao, Weiqi, Zhang, Huiwei, Jiao, Fangyang, Lin, Huamei, Yen, Tzu-Chen, Cui, Ruixue, Lan, Xiaoli, Zhao, Qianhua, Guan, Yihui, and Zuo, Chuantao
Published: 2023
Full Text: View/download PDF

30. Detection of individual brain tau deposition in Alzheimer's disease based on latent feature-enhanced generative adversarial network

Author: Jiang, Jiehui, Shi, Rong, Lu, Jiaying, Wang, Min, Zhang, Qi, Zhang, Shuoyan, Wang, Luyao, Alberts, Ian, Rominger, Axel, Zuo, Chuantao, and Shi, Kuangyu
Published: 2024
Full Text: View/download PDF

31. Evaluation of Unsupervised Entity and Event Salience Estimation

Author: Lu, Jiaying and Choi, Jinho D.
Subjects: Computer Science - Computation and Language
Abstract: Salience Estimation aims to predict term importance in documents. Due to few existing human-annotated datasets and the subjective notion of salience, previous studies typically generate pseudo-ground truth for evaluation. However, our investigation reveals that the evaluation protocol proposed by prior work is difficult to replicate, thus leading to few follow-up studies existing. Moreover, the evaluation process is problematic: the entity linking tool used for entity matching is very noisy, while the ignorance of event argument for event evaluation leads to boosted performance. In this work, we propose a light yet practical entity and event salience estimation evaluation protocol, which incorporates the more reliable syntactic dependency parser. Furthermore, we conduct a comprehensive analysis among popular entity and event definition standards, and present our own definition for the Salience Estimation task to reduce noise during the pseudo-ground truth generation process. Furthermore, we construct dependency-based heterogeneous graphs to capture the interactions of entities and events. The empirical results show that both baseline methods and the novel GNN method utilizing the heterogeneous graph consistently outperform the previous SOTA model in all proposed metrics.
Published: 2021

32. Colocalization of Increased Midbrain Signals in Neuroinflammation and Tau PET Imaging Suggests the Diagnosis of Progressive Supranuclear Palsy

Author: Lu, Jiaying, Ge, Jingjie, Yu, Hai, Zhao, Guixian, and Chen, Xiangjun
Published: 2024
Full Text: View/download PDF

33. MetaViT: Metabolism-Aware Vision Transformer for Differential Diagnosis of Parkinsonism with F-FDG PET

Author: Zhao, Lin, Dong, Hexin, Wu, Ping, Lu, Jiaying, Lu, Le, Zhou, Jingren, Liu, Tianming, Zhang, Li, Zhang, Ling, Tang, Yuxing, Zuo, Chuantao, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Frangi, Alejandro, editor, de Bruijne, Marleen, editor, Wassermann, Demian, editor, and Navab, Nassir, editor
Published: 2023
Full Text: View/download PDF

34. Uncovering distinct progression patterns of tau deposition in progressive supranuclear palsy using [18F]Florzolotau PET imaging and subtype/stage inference algorithm

Author: Wang, Jian, Liu, Fengtao, Zuo, Chuantao, Wu, Jianjun, Sun, Yimin, Wu, Ping, Tang, Yilin, Zhao, Jue, Wu, Bin, Shen, Bo, Lu, Jiaying, Zhou, Xinyue, Li, Xinyi, Zhang, Huiwei, Ge, Jingjie, Chen, Minjia, Ju, Zizhao, Hong, Jimin, Wang, Min, Clement, Christoph, Lopes, Leonor, Brendel, Matthias, Rominger, Axel, Yen, Tzu-Chen, Guan, Yihui, Tian, Mei, and Shi, Kuangyu
Published: 2023
Full Text: View/download PDF

35. Adjustment for the Age- and Gender-Related Metabolic Changes Improves the Differential Diagnosis of Parkinsonism

Author: Lu, Jiaying, Wang, Min, Wu, Ping, Yakushev, Igor, Zhang, Huiwei, Ziegler, Sibylle, Jiang, Jiehui, Förster, Stefan, Wang, Jian, Schwaiger, Markus, Rominger, Axel, Huang, Sung-Cheng, Liu, Fengtao, Zuo, Chuantao, and Shi, Kuangyu
Published: 2023
Full Text: View/download PDF

36. Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning

Author: Lu, Jiaying, Ye, Xin, Ren, Yi, and Yang, Yezhou
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Multiple-choice VQA has drawn increasing attention from researchers and end-users recently. As the demand for automatically constructing large-scale multiple-choice VQA data grows, we introduce a novel task called textual Distractors Generation for VQA (DG-VQA) focusing on generating challenging yet meaningful distractors given the context image, question, and correct answer. The DG-VQA task aims at generating distractors without ground-truth training samples since such resources are rarely available. To tackle the DG-VQA unsupervisedly, we propose Gobbet, a reinforcement learning(RL) based framework that utilizes pre-trained VQA models as an alternative knowledge base to guide the distractor generation process. In Gobbet, a pre-trained VQA model serves as the environment in RL setting to provide feedback for the input multi-modal query, while a neural distractor generator serves as the agent to take actions accordingly. We propose to use existing VQA models' performance degradation as indicators of the quality of generated distractors. On the other hand, we show the utility of generated distractors through data augmentation experiments, since robustness is more and more important when AI models apply to unpredictable open-domain scenarios or security-sensitive applications. We further conduct a manual case study on the factors why distractors generated by Gobbet can fool existing models.
Published: 2019

37. Improved interpretation of 18F-florzolotau PET in progressive supranuclear palsy using a normalization-free deep-learning classifier

Author: Lu, Jiaying, Clement, Christoph, Hong, Jimin, Wang, Min, Li, Xinyi, Cavinato, Lara, Yen, Tzu-Chen, Jiao, Fangyang, Wu, Ping, Wu, Jianjun, Ge, Jingjie, Sun, Yimin, Brendel, Matthias, Lopes, Leonor, Rominger, Axel, Wang, Jian, Liu, Fengtao, Zuo, Chuantao, Guan, Yihui, Zhao, Qianhua, and Shi, Kuangyu
Published: 2023
Full Text: View/download PDF

38. High-dose versus standard-dose radiotherapy in concurrent chemoradiotherapy for inoperable esophageal cancer: A systematic review and meta-analysis

Author: Yao, Yuanhu, Lu, Jiaying, Qin, Zhaohui, Li, Na, Ma, Ji, Yao, Nan, Qu, Wanxi, Cui, Li, Yuan, Shiwang, Jiang, Aijun, and Liu, Xiaoxiao
Published: 2023
Full Text: View/download PDF

39. Satellite-derived aridity index reveals China's drying in recent two decades

Author: Yao, Ling, Lu, Jiaying, Jiang, Hou, Liu, Tang, Qin, Jun, and Zhou, Chenghu
Published: 2023
Full Text: View/download PDF

40. The heterogeneity of asymmetric tau distribution is associated with an early age at onset and poor prognosis in Alzheimer’s disease

Author: Lu, Jiaying, Zhang, Zhengwei, Wu, Ping, Liang, Xiaoniu, Zhang, Huiwei, Hong, Jimin, Clement, Christoph, Yen, Tzu-Chen, Ding, Saineng, Wang, Min, Xiao, Zhenxu, Rominger, Axel, Shi, Kuangyu, Guan, Yihui, Zuo, Chuantao, and Zhao, Qianhua
Published: 2023
Full Text: View/download PDF

41. Sensitive, Semiquantitative, and Portable Nucleic Acid Detection of Rabies Virus Using a Personal Glucose Meter

Author: Lu, Jiaying, primary, Bai, Yujie, additional, Wang, Xuejin, additional, Huang, Pei, additional, Liu, Meihui, additional, Wang, Ruijia, additional, Zhang, Haili, additional, Wang, Hualei, additional, and Li, Yuanyuan, additional
Published: 2024
Full Text: View/download PDF

42. Decoding the dopamine transporter imaging for the differential diagnosis of parkinsonism using deep learning

Author: Zhao, Yu, Wu, Ping, Wu, Jianjun, Brendel, Matthias, Lu, Jiaying, Ge, Jingjie, Tang, Chunmeng, Hong, Jimin, Xu, Qian, Liu, Fengtao, Sun, Yimin, Ju, Zizhao, Lin, Huamei, Guan, Yihui, Bassetti, Claudio, Schwaiger, Markus, Huang, Sung-Cheng, Rominger, Axel, Wang, Jian, Zuo, Chuantao, and Shi, Kuangyu
Published: 2022
Full Text: View/download PDF

43. Image-level trajectory inference of tau pathology using variational autoencoder for Flortaucipir PET

Author: Hong, Jimin, Kang, Seung Kwan, Alberts, Ian, Lu, Jiaying, Sznitman, Raphael, Lee, Jae Sung, Rominger, Axel, Choi, Hongyoon, and Shi, Kuangyu
Published: 2022
Full Text: View/download PDF

44. Dual-Model Radiomic Biomarkers Predict Development of Mild Cognitive Impairment Progression to Alzheimer’s Disease

Author: Zhou, Hucheng, Jiang, Jiehui, Lu, Jiaying, Wang, Min, Zhang, Huiwei, Zuo, Chuantao, and Initiative, Alzheimer’s Disease Neuroimaging
Subjects: Biomedical and Clinical Sciences, Clinical Sciences, Bioengineering, Biomedical Imaging, Aging, Alzheimer's Disease, Brain Disorders, Alzheimer's Disease including Alzheimer's Disease Related Dementias (AD/ADRD), Clinical Research, Neurodegenerative, Neurosciences, Acquired Cognitive Impairment, Dementia, 4.2 Evaluation of markers and technologies, 4.1 Discovery and preclinical testing of markers and technologies, Neurological, Good Health and Well Being, Alzheimer's disease, mild cognitive impairment, radiomics, image fusion, Cox model, Alzheimer’s Disease Neuroimaging Initiative, Alzheimer’s disease, Psychology, Cognitive Sciences, Biological psychology
Abstract: Predicting progression of mild cognitive impairment (MCI) to Alzheimer's disease (AD) is clinically important. In this study, we propose a dual-model radiomic analysis with multivariate Cox proportional hazards regression models to investigate promising risk factors associated with MCI conversion to AD. T1 structural magnetic resonance imaging (MRI) and 18F-Fluorodeoxyglucose (FDG) positron emission tomography (PET) data, from the AD Neuroimaging Initiative database, were collected from 131 patients with MCI who converted to AD within 3 years and 132 patients with MCI without conversion within 3 years. These subjects were randomly partition into 70% training dataset and 30% test dataset with multiple times. We fused MRI and PET images by wavelet method. In a subset of subjects, a group comparison was performed using a two-sample t-test to determine regions of interest (ROIs) associated with MCI conversion. 172 radiomic features from ROIs for each individual were established using a published radiomics tool. Finally, L1-penalized Cox model was constructed and Harrell's C index (C-index) was used to evaluate prediction accuracy of the model. To evaluate the efficacy of our proposed method, we used a same analysis framework to evaluate MRI and PET data separately. We constructed prognostic Cox models with: clinical data, MRI images, PET images, fused MRI/PET images, and clinical variables and fused MRI/PET images in combination. The experimental results showed that captured ROIs significantly associated with conversion to AD, such as gray matter atrophy in the bilateral hippocampus and hypometabolism in the temporoparietal cortex. Imaging model (MRI/PET/fused) provided significant enhancement in prediction of conversion compared to clinical models, especially the fused-modality Cox model. Moreover, the combination of fused-modality imaging and clinical variables resulted in the greatest accuracy of prediction. The average C-index for the clinical/MRI/PET/fused/combined model in the test dataset was 0.69, 0.73, 0.73 and 0.75, and 0.78, respectively. These results suggested that a combination of radiomic analysis and Cox model analyses could be used successfully in survival analysis and may be powerful tools for personalized precision medicine patients with potential to undergo conversion from MCI to AD.
Published: 2019

45. The impact of probable rapid eye movement sleep behavior disorder on Parkinson's disease: A dual-tracer PET imaging study

Author: Xu, Qian, Jiang, Chengfeng, Ge, Jingjie, Lu, Jiaying, Li, Ling, Yu, Huan, Wu, Jianjun, Wang, Jian, Wu, Ping, and Zuo, Chuantao
Published: 2022
Full Text: View/download PDF

46. Optimized Cingulate Island Sign in Discriminating Dementia With Lewy Bodies From Alzheimer Disease

Author: Ge, Jingjie, Lin, Huamei, Chen, Keliang, Wang, Min, He, Zhijie, Lu, Jiaying, Ju, Zizhao, Sun, Yimin, Liu, Fengtao, Guan, Yihui, Zhao, Qianhua, Zuo, Chuantao, and Wu, Ping
Published: 2023
Full Text: View/download PDF

47. Observed causative impact of fine particulate matter on acute upper respiratory disease: a comparative study in two typical cities in China

Author: Xia, Xiaolin, Yao, Ling, Lu, Jiaying, Liu, Yangxiaoyue, Jing, Wenlong, and Li, Yong
Published: 2022
Full Text: View/download PDF

48. Dual-Model Radiomic Biomarkers Predict Development of Mild Cognitive Impairment Progression to Alzheimer's Disease.

Author: Zhou, Hucheng, Jiang, Jiehui, Lu, Jiaying, Wang, Min, Zhang, Huiwei, Zuo, Chuantao, and Alzheimer’s Disease Neuroimaging Initiative
Subjects: Alzheimer’s Disease Neuroimaging Initiative, Alzheimer’s disease, Cox model, image fusion, mild cognitive impairment, radiomics, Alzheimer's disease, Acquired Cognitive Impairment, Aging, Neurosciences, Biomedical Imaging, Bioengineering, Prevention, Alzheimer's Disease, Neurodegenerative, Brain Disorders, Clinical Research, Dementia, Alzheimer's Disease including Alzheimer's Disease Related Dementias (AD/ADRD), 4.1 Discovery and preclinical testing of markers and technologies, 4.2 Evaluation of markers and technologies, Neurological, Psychology, Cognitive Sciences
Abstract: Predicting progression of mild cognitive impairment (MCI) to Alzheimer's disease (AD) is clinically important. In this study, we propose a dual-model radiomic analysis with multivariate Cox proportional hazards regression models to investigate promising risk factors associated with MCI conversion to AD. T1 structural magnetic resonance imaging (MRI) and 18F-Fluorodeoxyglucose (FDG) positron emission tomography (PET) data, from the AD Neuroimaging Initiative database, were collected from 131 patients with MCI who converted to AD within 3 years and 132 patients with MCI without conversion within 3 years. These subjects were randomly partition into 70% training dataset and 30% test dataset with multiple times. We fused MRI and PET images by wavelet method. In a subset of subjects, a group comparison was performed using a two-sample t-test to determine regions of interest (ROIs) associated with MCI conversion. 172 radiomic features from ROIs for each individual were established using a published radiomics tool. Finally, L1-penalized Cox model was constructed and Harrell's C index (C-index) was used to evaluate prediction accuracy of the model. To evaluate the efficacy of our proposed method, we used a same analysis framework to evaluate MRI and PET data separately. We constructed prognostic Cox models with: clinical data, MRI images, PET images, fused MRI/PET images, and clinical variables and fused MRI/PET images in combination. The experimental results showed that captured ROIs significantly associated with conversion to AD, such as gray matter atrophy in the bilateral hippocampus and hypometabolism in the temporoparietal cortex. Imaging model (MRI/PET/fused) provided significant enhancement in prediction of conversion compared to clinical models, especially the fused-modality Cox model. Moreover, the combination of fused-modality imaging and clinical variables resulted in the greatest accuracy of prediction. The average C-index for the clinical/MRI/PET/fused/combined model in the test dataset was 0.69, 0.73, 0.73 and 0.75, and 0.78, respectively. These results suggested that a combination of radiomic analysis and Cox model analyses could be used successfully in survival analysis and may be powerful tools for personalized precision medicine patients with potential to undergo conversion from MCI to AD.
Published: 2018

49. Hollow-structured amorphous prussian blue decorated on graphitic carbon nitride for photo-assisted activation of peroxymonosulfate

Author: Lu, Jiaying, Chen, Chaofa, Qian, Mengying, Xiao, Peiyuan, Ge, Peng, Shen, Cailiang, Wu, Xi-Lin, and Chen, Jianrong
Published: 2021
Full Text: View/download PDF

50. Uncertainty-Aware Pre-Trained Foundation Models for Patient Risk Prediction via Gaussian Process

Author: Lu, Jiaying, primary, Zhao, Shifan, additional, Ma, Wenjing, additional, Shao, Hui, additional, Hu, Xiao, additional, Xi, Yuanzhe, additional, and Yang, Carl, additional
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

417 results on '"Lu, Jiaying"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources