Author: "Lu, Zhiyong" / Topic: deep learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Lu, Zhiyong"' showing total 38 results

Start Over Author "Lu, Zhiyong" Topic deep learning

38 results on '"Lu, Zhiyong"'

1. Advancing entity recognition in biomedicine via instruction tuning of large language models.

Author: Keloth VK, Hu Y, Xie Q, Peng X, Wang Y, Zheng A, Selek M, Raja K, Wei CH, Jin Q, Lu Z, Chen Q, and Xu H
Subjects: Animals, Language, Natural Language Processing, Camelids, New World, Deep Learning
Abstract: Motivation: Large Language Models (LLMs) have the potential to revolutionize the field of Natural Language Processing, excelling not only in text generation and reasoning tasks but also in their ability for zero/few-shot learning, swiftly adapting to new tasks with minimal fine-tuning. LLMs have also demonstrated great promise in biomedical and healthcare applications. However, when it comes to Named Entity Recognition (NER), particularly within the biomedical domain, LLMs fall short of the effectiveness exhibited by fine-tuned domain-specific models. One key reason is that NER is typically conceptualized as a sequence labeling task, whereas LLMs are optimized for text generation and reasoning tasks., Results: We developed an instruction-based learning paradigm that transforms biomedical NER from a sequence labeling task into a generation task. This paradigm is end-to-end and streamlines the training and evaluation process by automatically repurposing pre-existing biomedical NER datasets. We further developed BioNER-LLaMA using the proposed paradigm with LLaMA-7B as the foundational LLM. We conducted extensive testing on BioNER-LLaMA across three widely recognized biomedical NER datasets, consisting of entities related to diseases, chemicals, and genes. The results revealed that BioNER-LLaMA consistently achieved higher F1-scores ranging from 5% to 30% compared to the few-shot learning capabilities of GPT-4 on datasets with different biomedical entities. We show that a general-domain LLM can match the performance of rigorously fine-tuned PubMedBERT models and PMC-LLaMA, biomedical-specific language model. Our findings underscore the potential of our proposed paradigm in developing general-domain LLMs that can rival SOTA performances in multi-task, multi-domain scenarios in biomedical and health applications., Availability and Implementation: Datasets and other resources are available at https://github.com/BIDS-Xu-Lab/BioNER-LLaMA., (© The Author(s) 2024. Published by Oxford University Press.)
Published: 2024
Full Text: View/download PDF

2. A scoping review on multimodal deep learning in biomedical images and texts.

Author: Sun Z, Lin M, Zhu Q, Xie Q, Wang F, Lu Z, and Peng Y
Subjects: Diagnostic Imaging, Semantics, Natural Language Processing, Diagnosis, Computer-Assisted, Deep Learning
Abstract: Objective: Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and text, has the potential to revolutionize the analysis and interpretation of biomedical data. However, it only caught researchers' attention recently. To this end, there is a critical need to conduct a systematic review on this topic, identify the limitations of current work, and explore future directions., Methods: In this scoping review, we aim to provide a comprehensive overview of the current state of the field and identify key concepts, types of studies, and research gaps with a focus on biomedical images and texts joint learning, mainly because these two were the most commonly available data types in MDL research., Result: This study reviewed the current uses of multimodal deep learning on five tasks: (1) Report generation, (2) Visual question answering, (3) Cross-modal retrieval, (4) Computer-aided diagnosis, and (5) Semantic segmentation., Conclusion: Our results highlight the diverse applications and potential of MDL and suggest directions for future research in the field. We hope our review will facilitate the collaboration of natural language processing (NLP) and medical imaging communities and support the next generation of decision-making and computer-assisted diagnostic system development., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 Elsevier Inc. All rights reserved.)
Published: 2023
Full Text: View/download PDF

3. AIONER: all-in-one scheme-based biomedical named entity recognition using deep learning.

Author: Luo L, Wei CH, Lai PT, Leaman R, Chen Q, and Lu Z
Subjects: Data Mining methods, Software, Language, PubMed, Deep Learning
Abstract: Motivation: Biomedical named entity recognition (BioNER) seeks to automatically recognize biomedical entities in natural language text, serving as a necessary foundation for downstream text mining tasks and applications such as information extraction and question answering. Manually labeling training data for the BioNER task is costly, however, due to the significant domain expertise required for accurate annotation. The resulting data scarcity causes current BioNER approaches to be prone to overfitting, to suffer from limited generalizability, and to address a single entity type at a time (e.g. gene or disease)., Results: We therefore propose a novel all-in-one (AIO) scheme that uses external data from existing annotated resources to enhance the accuracy and stability of BioNER models. We further present AIONER, a general-purpose BioNER tool based on cutting-edge deep learning and our AIO schema. We evaluate AIONER on 14 BioNER benchmark tasks and show that AIONER is effective, robust, and compares favorably to other state-of-the-art approaches such as multi-task learning. We further demonstrate the practical utility of AIONER in three independent tasks to recognize entity types not previously seen in training data, as well as the advantages of AIONER over existing methods for processing biomedical text at a large scale (e.g. the entire PubMed data)., Availability and Implementation: The source code, trained models and data for AIONER are freely available at https://github.com/ncbi/AIONER., (Published by Oxford University Press 2023.)
Published: 2023
Full Text: View/download PDF

4. DeepLensNet: Deep Learning Automated Diagnosis and Quantitative Classification of Cataract Type and Severity.

Author: Keenan TDL, Chen Q, Agrón E, Tham YC, Goh JHL, Lei X, Ng YP, Liu Y, Xu X, Cheng CY, Bikbov MM, Jonas JB, Bhandari S, Broadhead GK, Colyer MH, Corsini J, Cousineau-Krieger C, Gensheimer W, Grasic D, Lamba T, Magone MT, Maiberger M, Oshinsky A, Purt B, Shin SY, Thavikulwat AT, Lu Z, and Chew EY
Subjects: Humans, Photography, Cataract diagnosis, Cataract Extraction, Deep Learning
Abstract: Purpose: To develop deep learning models to perform automated diagnosis and quantitative classification of age-related cataract from anterior segment photographs., Design: DeepLensNet was trained by applying deep learning models to the Age-Related Eye Disease Study (AREDS) dataset., Participants: A total of 18 999 photographs (6333 triplets) from longitudinal follow-up of 1137 eyes (576 AREDS participants)., Methods: Deep learning models were trained to detect and quantify nuclear sclerosis (NS; scale 0.9-7.1) from 45-degree slit-lamp photographs and cortical lens opacity (CLO; scale 0%-100%) and posterior subcapsular cataract (PSC; scale 0%-100%) from retroillumination photographs. DeepLensNet performance was compared with that of 14 ophthalmologists and 24 medical students., Main Outcome Measures: Mean squared error (MSE)., Results: On the full test set, mean MSE for DeepLensNet was 0.23 (standard deviation [SD], 0.01) for NS, 13.1 (SD, 1.6) for CLO, and 16.6 (SD, 2.4) for PSC. On a subset of the test set (substantially enriched for positive cases of CLO and PSC), for NS, mean MSE for DeepLensNet was 0.23 (SD, 0.02), compared with 0.98 (SD, 0.24; P = 0.000001) for the ophthalmologists and 1.24 (SD, 0.34; P = 0.000005) for the medical students. For CLO, mean MSE was 53.5 (SD, 14.8), compared with 134.9 (SD, 89.9; P = 0.003) for the ophthalmologists and 433.6 (SD, 962.1; P = 0.0007) for the medical students. For PSC, mean MSE was 171.9 (SD, 38.9), compared with 176.8 (SD, 98.0; P = 0.67) for the ophthalmologists and 398.2 (SD, 645.4; P = 0.18) for the medical students. In external validation on the Singapore Malay Eye Study (sampled to reflect the cataract severity distribution in AREDS), the MSE for DeepSeeNet was 1.27 for NS and 25.5 for PSC., Conclusions: DeepLensNet performed automated and quantitative classification of cataract severity for all 3 types of age-related cataract. For the 2 most common types (NS and CLO), the accuracy was significantly superior to that of ophthalmologists; for the least common type (PSC), it was similar. DeepLensNet may have wide potential applications in both clinical and research domains. In the future, such approaches may increase the accessibility of cataract assessment globally. The code and models are available at https://github.com/ncbi/deeplensnet., (Copyright © 2022. Published by Elsevier Inc.)
Published: 2022
Full Text: View/download PDF

5. Detecting visually significant cataract using retinal photograph-based deep learning.

Author: Tham YC, Goh JHL, Anees A, Lei X, Rim TH, Chee ML, Wang YX, Jonas JB, Thakur S, Teo ZL, Cheung N, Hamzah H, Tan GSW, Husain R, Sabanayagam C, Wang JJ, Chen Q, Lu Z, Keenan TD, Chew EY, Tan AG, Mitchell P, Goh RSM, Xu X, Liu Y, Wong TY, and Cheng CY
Subjects: Humans, Aged, Retina diagnostic imaging, ROC Curve, Algorithms, Deep Learning, Cataract diagnosis
Abstract: Age-related cataracts are the leading cause of visual impairment among older adults. Many significant cases remain undiagnosed or neglected in communities, due to limited availability or accessibility to cataract screening. In the present study, we report the development and validation of a retinal photograph-based, deep-learning algorithm for automated detection of visually significant cataracts, using more than 25,000 images from population-based studies. In the internal test set, the area under the receiver operating characteristic curve (AUROC) was 96.6%. External testing performed across three studies showed AUROCs of 91.6-96.5%. In a separate test set of 186 eyes, we further compared the algorithm's performance with 4 ophthalmologists' evaluations. The algorithm performed comparably, if not being slightly more superior (sensitivity of 93.3% versus 51.7-96.6% by ophthalmologists and specificity of 99.0% versus 90.7-97.9% by ophthalmologists). Our findings show the potential of a retinal photograph-based screening tool for visually significant cataracts among older adults, providing more appropriate referrals to tertiary eye centers., (© 2022. The Author(s).)
Published: 2022
Full Text: View/download PDF

6. Multi-task deep learning-based survival analysis on the prognosis of late AMD using the longitudinal data in AREDS.

Author: Ghahramani G, Brendel M, Lin M, Chen Q, Keenan T, Chen K, Chew E, Lu Z, Peng Y, and Wang F
Subjects: Disease Progression, Fundus Oculi, Humans, Prognosis, Survival Analysis, Deep Learning, Macular Degeneration diagnostic imaging
Abstract: Age-related macular degeneration (AMD) is the leading cause of vision loss. Some patients experience vision loss over a delayed timeframe, others at a rapid pace. Physicians analyze time-of-visit fundus photographs to predict patient risk of developing late-AMD, the most severe form of AMD. Our study hypothesizes that 1) incorporating historical data improves predictive strength of developing late-AMD and 2) state-of-the-art deep-learning techniques extract more predictive image features than clinicians do. We incorporate longitudinal data from the Age-Related Eye Disease Studies and deep-learning extracted image features in survival settings to predict development of late- AMD. To extract image features, we used multi-task learning frameworks to train convolutional neural networks. Our findings show 1) incorporating longitudinal data improves prediction of late-AMD for clinical standard features, but only the current visit is informative when using complex features and 2) "deep-features" are more informative than clinician derived features. We make codes publicly available at https://github.com/bionlplab/AMD_prognosis_amia2021., (©2021 AMIA - All rights reserved.)
Published: 2022

7. Multimodal, multitask, multiattention (M3) deep learning detection of reticular pseudodrusen: Toward automated and accessible classification of age-related macular degeneration.

Author: Chen Q, Keenan TDL, Allot A, Peng Y, Agrón E, Domalpally A, Klaver CCW, Luttikhuizen DT, Colyer MH, Cukras CA, Wiley HE, Teresa Magone M, Cousineau-Krieger C, Wong WT, Zhu Y, Chew EY, and Lu Z
Subjects: Aged, Computer Simulation, Datasets as Topic, Female, Fundus Oculi, Humans, Macular Degeneration diagnosis, Male, Deep Learning, Diagnosis, Computer-Assisted, Retinal Drusen diagnosis
Abstract: Objective: Reticular pseudodrusen (RPD), a key feature of age-related macular degeneration (AMD), are poorly detected by human experts on standard color fundus photography (CFP) and typically require advanced imaging modalities such as fundus autofluorescence (FAF). The objective was to develop and evaluate the performance of a novel multimodal, multitask, multiattention (M3) deep learning framework on RPD detection., Materials and Methods: A deep learning framework (M3) was developed to detect RPD presence accurately using CFP alone, FAF alone, or both, employing >8000 CFP-FAF image pairs obtained prospectively (Age-Related Eye Disease Study 2). The M3 framework includes multimodal (detection from single or multiple image modalities), multitask (training different tasks simultaneously to improve generalizability), and multiattention (improving ensembled feature representation) operation. Performance on RPD detection was compared with state-of-the-art deep learning models and 13 ophthalmologists; performance on detection of 2 other AMD features (geographic atrophy and pigmentary abnormalities) was also evaluated., Results: For RPD detection, M3 achieved an area under the receiver-operating characteristic curve (AUROC) of 0.832, 0.931, and 0.933 for CFP alone, FAF alone, and both, respectively. M3 performance on CFP was very substantially superior to human retinal specialists (median F1 score = 0.644 vs 0.350). External validation (the Rotterdam Study) demonstrated high accuracy on CFP alone (AUROC, 0.965). The M3 framework also accurately detected geographic atrophy and pigmentary abnormalities (AUROC, 0.909 and 0.912, respectively), demonstrating its generalizability., Conclusions: This study demonstrates the successful development, robust evaluation, and external validation of a novel deep learning framework that enables accessible, accurate, and automated AMD diagnosis and prognosis., (Published by Oxford University Press on behalf of the American Medical Informatics Association 2021. This work is written by US Government employees and is in the public domain in the US.)
Published: 2021
Full Text: View/download PDF

8. Deep Learning Automated Detection of Reticular Pseudodrusen from Fundus Autofluorescence Images or Color Fundus Photographs in AREDS2.

Author: Keenan TDL, Chen Q, Peng Y, Domalpally A, Agrón E, Hwang CK, Thavikulwat AT, Lee DH, Li D, Wong WT, Lu Z, and Chew EY
Subjects: Aged, Aged, 80 and over, Area Under Curve, Datasets as Topic, Female, Humans, Macular Degeneration, Male, Middle Aged, Ophthalmologists, ROC Curve, Reproducibility of Results, Sensitivity and Specificity, Deep Learning, Fluorescein Angiography, Optical Imaging, Retinal Drusen diagnostic imaging
Abstract: Purpose: To develop deep learning models for detecting reticular pseudodrusen (RPD) using fundus autofluorescence (FAF) images or, alternatively, color fundus photographs (CFP) in the context of age-related macular degeneration (AMD)., Design: Application of deep learning models to the Age-Related Eye Disease Study 2 (AREDS2) dataset., Participants: FAF and CFP images (n = 11 535) from 2450 AREDS2 participants. Gold standard labels from reading center grading of the FAF images were transferred to the corresponding CFP images., Methods: A deep learning model was trained to detect RPD in eyes with intermediate to late AMD using FAF images (FAF model). Using label transfer from FAF to CFP images, a deep learning model was trained to detect RPD from CFP (CFP model). Performance was compared with 4 ophthalmologists using a random subset from the full test set., Main Outcome Measures: Area under the receiver operating characteristic curve (AUC), κ value, accuracy, and F1 score., Results: The FAF model had an AUC of 0.939 (95% confidence interval [CI], 0.927-0.950), a κ value of 0.718 (95% CI, 0.685-0.751), and accuracy of 0.899 (95% CI, 0.887-0.911). The CFP model showed equivalent values of 0.832 (95% CI, 0.812-0.851), 0.470 (95% CI, 0.426-0.511), and 0.809 (95% CI, 0.793-0.825), respectively. The FAF model demonstrated superior performance to 4 ophthalmologists, showing a higher κ value of 0.789 (95% CI, 0.675-0.875) versus a range of 0.367 to 0.756 and higher accuracy of 0.937 (95% CI, 0.907-0.963) versus a range of 0.696 to 0.933. The CFP model demonstrated substantially superior performance to 4 ophthalmologists, showing a higher κ value of 0.471 (95% CI, 0.330-0.606) versus a range of 0.105 to 0.180 and higher accuracy of 0.844 (95% CI, 0.798-0.886) versus a range of 0.717 to 0.814., Conclusions: Deep learning-enabled automated detection of RPD presence from FAF images achieved a high level of accuracy, equal or superior to that of ophthalmologists. Automated RPD detection using CFP achieved a lower accuracy that still surpassed that of ophthalmologists. Deep learning models can assist, and even augment, the detection of this clinically important AMD-associated lesion., (Published by Elsevier Inc.)
Published: 2020
Full Text: View/download PDF

9. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records.

Author: Chen Q, Du J, Kim S, Wilbur WJ, and Lu Z
Subjects: Data Mining, Humans, Language, PubMed, Deep Learning, Electronic Health Records, Information Storage and Retrieval methods, Machine Learning
Abstract: Background: Capturing sentence semantics plays a vital role in a range of text mining applications. Despite continuous efforts on the development of related datasets and models in the general domain, both datasets and models are limited in biomedical and clinical domains. The BioCreative/OHNLP2018 organizers have made the first attempt to annotate 1068 sentence pairs from clinical notes and have called for a community effort to tackle the Semantic Textual Similarity (BioCreative/OHNLP STS) challenge., Methods: We developed models using traditional machine learning and deep learning approaches. For the post challenge, we focused on two models: the Random Forest and the Encoder Network. We applied sentence embeddings pre-trained on PubMed abstracts and MIMIC-III clinical notes and updated the Random Forest and the Encoder Network accordingly., Results: The official results demonstrated our best submission was the ensemble of eight models. It achieved a Person correlation coefficient of 0.8328 - the highest performance among 13 submissions from 4 teams. For the post challenge, the performance of both Random Forest and the Encoder Network was improved; in particular, the correlation of the Encoder Network was improved by ~ 13%. During the challenge task, no end-to-end deep learning models had better performance than machine learning models that take manually-crafted features. In contrast, with the sentence embeddings pre-trained on biomedical corpora, the Encoder Network now achieves a correlation of ~ 0.84, which is higher than the original best model. The ensembled model taking the improved versions of the Random Forest and Encoder Network as inputs further increased performance to 0.8528., Conclusions: Deep learning models with sentence embeddings pre-trained on biomedical corpora achieve the highest performance on the test set. Through error analysis, we find that end-to-end deep learning models and traditional machine learning models with manually-crafted features complement each other by finding different types of sentences. We suggest a combination of these models can better find similar sentences in practice.
Published: 2020
Full Text: View/download PDF

10. BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.

Author: Chen Q, Lee K, Yan S, Kim S, Wei CH, and Lu Z
Subjects: Algorithms, Databases, Protein, Drug Interactions, Electronic Health Records, Humans, Protein Interaction Mapping, PubMed, Semantics, Computational Biology methods, Data Mining methods, Deep Learning, Publications
Abstract: A massive number of biological entities, such as genes and mutations, are mentioned in the biomedical literature. The capturing of the semantic relatedness of biological entities is vital to many biological applications, such as protein-protein interaction prediction and literature-based discovery. Concept embeddings-which involve the learning of vector representations of concepts using machine learning models-have been employed to capture the semantics of concepts. To develop concept embeddings, named-entity recognition (NER) tools are first used to identify and normalize concepts from the literature, and then different machine learning models are used to train the embeddings. Despite multiple attempts, existing biomedical concept embeddings generally suffer from suboptimal NER tools, small-scale evaluation, and limited availability. In response, we employed high-performance machine learning-based NER tools for concept recognition and trained our concept embeddings, BioConceptVec, via four different machine learning models on ~30 million PubMed abstracts. BioConceptVec covers over 400,000 biomedical concepts mentioned in the literature and is of the largest among the publicly available biomedical concept embeddings to date. To evaluate the validity and utility of BioConceptVec, we respectively performed two intrinsic evaluations (identifying related concepts based on drug-gene and gene-gene interactions) and two extrinsic evaluations (protein-protein interaction prediction and drug-drug interaction extraction), collectively using over 25 million instances from nine independent datasets (17 million instances from six intrinsic evaluation tasks and 8 million instances from three extrinsic evaluation tasks), which is, by far, the most comprehensive to our best knowledge. The intrinsic evaluation results demonstrate that BioConceptVec consistently has, by a large margin, better performance than existing concept embeddings in identifying similar and related concepts. More importantly, the extrinsic evaluation results demonstrate that using BioConceptVec with advanced deep learning models can significantly improve performance in downstream bioinformatics studies and biomedical text-mining applications. Our BioConceptVec embeddings and benchmarking datasets are publicly available at https://github.com/ncbi-nlp/BioConceptVec., Competing Interests: The authors have declared that no competing interests exist.
Published: 2020
Full Text: View/download PDF

11. A Deep Learning Approach for Automated Detection of Geographic Atrophy from Color Fundus Photographs.

Author: Keenan TD, Dharssi S, Peng Y, Chen Q, Agrón E, Wong WT, Lu Z, and Chew EY
Subjects: Aged, Aged, 80 and over, Area Under Curve, Female, Humans, Male, Middle Aged, Photography methods, Physical Examination, Reproducibility of Results, Sensitivity and Specificity, Deep Learning, Diagnostic Techniques, Ophthalmological, Geographic Atrophy diagnosis, Image Processing, Computer-Assisted methods
Abstract: Purpose: To assess the utility of deep learning in the detection of geographic atrophy (GA) from color fundus photographs and to explore potential utility in detecting central GA (CGA)., Design: A deep learning model was developed to detect the presence of GA in color fundus photographs, and 2 additional models were developed to detect CGA in different scenarios., Participants: A total of 59 812 color fundus photographs from longitudinal follow-up of 4582 participants in the Age-Related Eye Disease Study (AREDS) dataset. Gold standard labels were from human expert reading center graders using a standardized protocol., Methods: A deep learning model was trained to use color fundus photographs to predict GA presence from a population of eyes with no AMD to advanced AMD. A second model was trained to predict CGA presence from the same population. A third model was trained to predict CGA presence from the subset of eyes with GA. For training and testing, 5-fold cross-validation was used. For comparison with human clinician performance, model performance was compared with that of 88 retinal specialists., Main Outcome Measures: Area under the curve (AUC), accuracy, sensitivity, specificity, and precision., Results: The deep learning models (GA detection, CGA detection from all eyes, and centrality detection from GA eyes) had AUCs of 0.933-0.976, 0.939-0.976, and 0.827-0.888, respectively. The GA detection model had accuracy, sensitivity, specificity, and precision of 0.965 (95% confidence interval [CI], 0.959-0.971), 0.692 (0.560-0.825), 0.978 (0.970-0.985), and 0.584 (0.491-0.676), respectively, compared with 0.975 (0.971-0.980), 0.588 (0.468-0.707), 0.982 (0.978-0.985), and 0.368 (0.230-0.505) for the retinal specialists. The CGA detection model had values of 0.966 (0.957-0.975), 0.763 (0.641-0.885), 0.971 (0.960-0.982), and 0.394 (0.341-0.448). The centrality detection model had values of 0.762 (0.725-0.799), 0.782 (0.618-0.945), 0.729 (0.543-0.916), and 0.799 (0.710-0.888)., Conclusions: A deep learning model demonstrated high accuracy for the automated detection of GA. The AUC was noninferior to that of human retinal specialists. Deep learning approaches may also be applied to the identification of CGA. The code and pretrained models are publicly available at https://github.com/ncbi-nlp/DeepSeeNet., (Published by Elsevier Inc.)
Published: 2019
Full Text: View/download PDF

12. ML-Net: multi-label classification of biomedical texts with deep neural networks.

Author: Du J, Chen Q, Peng Y, Xiang Y, Tao C, and Lu Z
Subjects: Benchmarking, Classification methods, Computational Biology, Data Mining methods, Machine Learning, Deep Learning, Natural Language Processing, Neural Networks, Computer
Abstract: Objective: In multi-label text classification, each textual document is assigned 1 or more labels. As an important task that has broad applications in biomedicine, a number of different computational methods have been proposed. Many of these methods, however, have only modest accuracy or efficiency and limited success in practical use. We propose ML-Net, a novel end-to-end deep learning framework, for multi-label classification of biomedical texts., Materials and Methods: ML-Net combines a label prediction network with an automated label count prediction mechanism to provide an optimal set of labels. This is accomplished by leveraging both the predicted confidence score of each label and the deep contextual information (modeled by ELMo) in the target document. We evaluate ML-Net on 3 independent corpora in 2 text genres: biomedical literature and clinical notes. For evaluation, we use example-based measures, such as precision, recall, and the F measure. We also compare ML-Net with several competitive machine learning and deep learning baseline models., Results: Our benchmarking results show that ML-Net compares favorably to state-of-the-art methods in multi-label classification of biomedical text. ML-Net is also shown to be robust when evaluated on different text genres in biomedicine., Conclusion: ML-Net is able to accuractely represent biomedical document context and dynamically estimate the label count in a more systematic and accurate manner. Unlike traditional machine learning methods, ML-Net does not require human effort for feature engineering and is a highly efficient and scalable approach to tasks with a large set of labels, so there is no need to build individual classifiers for each separate label., (Published by Oxford University Press on behalf of the American Medical Informatics Association 2019.)
Published: 2019
Full Text: View/download PDF

13. DeepSeeNet: A Deep Learning Model for Automated Classification of Patient-based Age-related Macular Degeneration Severity from Color Fundus Photographs.

Author: Peng Y, Dharssi S, Chen Q, Keenan TD, Agrón E, Wong WT, Chew EY, and Lu Z
Subjects: Aged, Aged, 80 and over, Area Under Curve, Disease Progression, Female, Humans, Male, Middle Aged, Prospective Studies, Reproducibility of Results, Retinal Drusen classification, Retinal Drusen diagnosis, Risk Factors, Sensitivity and Specificity, Severity of Illness Index, Deep Learning, Diagnosis, Computer-Assisted methods, Diagnostic Techniques, Ophthalmological, Geographic Atrophy classification, Geographic Atrophy diagnosis, Models, Theoretical, Photography methods
Abstract: Purpose: In assessing the severity of age-related macular degeneration (AMD), the Age-Related Eye Disease Study (AREDS) Simplified Severity Scale predicts the risk of progression to late AMD. However, its manual use requires the time-consuming participation of expert practitioners. Although several automated deep learning systems have been developed for classifying color fundus photographs (CFP) of individual eyes by AREDS severity score, none to date has used a patient-based scoring system that uses images from both eyes to assign a severity score., Design: DeepSeeNet, a deep learning model, was developed to classify patients automatically by the AREDS Simplified Severity Scale (score 0-5) using bilateral CFP., Participants: DeepSeeNet was trained on 58 402 and tested on 900 images from the longitudinal follow-up of 4549 participants from AREDS. Gold standard labels were obtained using reading center grades., Methods: DeepSeeNet simulates the human grading process by first detecting individual AMD risk factors (drusen size, pigmentary abnormalities) for each eye and then calculating a patient-based AMD severity score using the AREDS Simplified Severity Scale., Main Outcome Measures: Overall accuracy, specificity, sensitivity, Cohen's kappa, and area under the curve (AUC). The performance of DeepSeeNet was compared with that of retinal specialists., Results: DeepSeeNet performed better on patient-based classification (accuracy = 0.671; kappa = 0.558) than retinal specialists (accuracy = 0.599; kappa = 0.467) with high AUC in the detection of large drusen (0.94), pigmentary abnormalities (0.93), and late AMD (0.97). DeepSeeNet also outperformed retinal specialists in the detection of large drusen (accuracy 0.742 vs. 0.696; kappa 0.601 vs. 0.517) and pigmentary abnormalities (accuracy 0.890 vs. 0.813; kappa 0.723 vs. 0.535) but showed lower performance in the detection of late AMD (accuracy 0.967 vs. 0.973; kappa 0.663 vs. 0.754)., Conclusions: By simulating the human grading process, DeepSeeNet demonstrated high accuracy with increased transparency in the automated assignment of individual patients to AMD risk categories based on the AREDS Simplified Severity Scale. These results highlight the potential of deep learning to assist and enhance clinical decision-making in patients with AMD, such as early AMD detection and risk prediction for developing late AMD. DeepSeeNet is publicly available on https://github.com/ncbi-nlp/DeepSeeNet., (Copyright © 2018 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.)
Published: 2019
Full Text: View/download PDF

14. Using deep learning to identify translational research in genomic medicine beyond bench to bedside.

Author: Hsu YY, Clyne M, Wei CH, Khoury MJ, and Lu Z
Subjects: Neural Networks, Computer, ROC Curve, Reproducibility of Results, Statistics as Topic, Support Vector Machine, Task Performance and Analysis, Deep Learning, Genomics, Translational Research, Biomedical
Abstract: Tracking scientific research publications on the evaluation, utility and implementation of genomic applications is critical for the translation of basic research to impact clinical and population health. In this work, we utilize state-of-the-art machine learning approaches to identify translational research in genomics beyond bench to bedside from the biomedical literature. We apply the convolutional neural networks (CNNs) and support vector machines (SVMs) to the bench/bedside article classification on the weekly manual annotation data of the Public Health Genomics Knowledge Base database. Both classifiers employ salient features to determine the probability of curation-eligible publications, which can effectively reduce the workload of manual triage and curation process. We applied the CNNs and SVMs to an independent test set (n = 400), and the models achieved the F-measure of 0.80 and 0.74, respectively. We further tested the CNNs, which perform better results, on the routine annotation pipeline for 2 weeks and significantly reduced the effort and retrieved more appropriate research articles. Our approaches provide direct insight into the automated curation of genomic translational research beyond bench to bedside. The machine learning classifiers are found to be helpful for annotators to enhance the efficiency of manual curation.
Published: 2019
Full Text: View/download PDF

15. Opportunities and obstacles for deep learning in biology and medicine.

Author: Ching T, Himmelstein DS, Beaulieu-Jones BK, Kalinin AA, Do BT, Way GP, Ferrero E, Agapow PM, Zietz M, Hoffman MM, Xie W, Rosen GL, Lengerich BJ, Israeli J, Lanchantin J, Woloszynek S, Carpenter AE, Shrikumar A, Xu J, Cofer EM, Lavender CA, Turaga SC, Alexandari AM, Lu Z, Harris DJ, DeCaprio D, Qi Y, Kundaje A, Peng Y, Wiley LK, Segler MHS, Boca SM, Swamidass SJ, Huang A, Gitter A, and Greene CS
Subjects: Algorithms, Biomedical Research methods, Decision Making, Delivery of Health Care methods, Delivery of Health Care trends, Disease genetics, Drug Design, Electronic Health Records trends, Humans, Terminology as Topic, Biomedical Research trends, Biomedical Technology trends, Deep Learning trends
Abstract: Deep learning describes a class of machine learning algorithms that are capable of combining raw inputs into layers of intermediate features. These algorithms have recently shown impressive results across a variety of domains. Biology and medicine are data-rich disciplines, but the data are complex and often ill-understood. Hence, deep learning techniques may be particularly well suited to solve problems of these fields. We examine applications of deep learning to a variety of biomedical problems-patient classification, fundamental biological processes and treatment of patients-and discuss whether deep learning will be able to transform these tasks or if the biomedical sphere poses unique challenges. Following from an extensive literature review, we find that deep learning has yet to revolutionize biomedicine or definitively resolve any of the most pressing challenges in the field, but promising advances have been made on the prior state of the art. Even though improvements over previous baselines have been modest in general, the recent progress indicates that deep learning methods will provide valuable means for speeding up or aiding human investigation. Though progress has been made linking a specific neural network's prediction to input features, understanding how users should interpret these models to make testable hypotheses about the system under study remains an open challenge. Furthermore, the limited amount of labelled data for training presents problems in some domains, as do legal and privacy constraints on work with sensitive health records. Nonetheless, we foresee deep learning enabling changes at both bench and bedside with the potential to transform several areas of biology and medicine., (© 2018 The Authors.)
Published: 2018
Full Text: View/download PDF

16. Comparison of State-of-the-Art Neural Network Survival Models with the Pooled Cohort Equations for Cardiovascular Disease Risk Prediction

Author: Deng, Yu, Liu, Lei, Jiang, Hongmei, Peng, Yifan, Wei, Yishu, Zhou, Zhiyang, Zhong, Yizhen, Zhao, Yun, Yang, Xiaoyun, Yu, Jingzhi, Lu, Zhiyong, Kho, Abel, Ning, Hongyan, Allen, Norrina B., Wilkins, John T., Liu, Kiang, Lloyd-Jones, Donald M., and Zhao, Lihui
Published: 2023
Full Text: View/download PDF

17. Universal lymph node detection in T2 MRI using neural networks

Author: Mathai, Tejas Sudharshan, Lee, Sungwon, Shen, Thomas C., Lu, Zhiyong, and Summers, Ronald M.
Published: 2023
Full Text: View/download PDF

18. Predicting Age-related Macular Degeneration Progression with Longitudinal Fundus Images Using Deep Learning

Author: Lee, Junghwan, Wanyan, Tingyi, Chen, Qingyu, Keenan, Tiarnan D. L., Glicksberg, Benjamin S., Chew, Emily Y., Lu, Zhiyong, Wang, Fei, Peng, Yifan, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lian, Chunfeng, editor, Cao, Xiaohuan, editor, Rekik, Islem, editor, Xu, Xuanang, editor, and Cui, Zhiming, editor
Published: 2022
Full Text: View/download PDF

19. Dadnet: dual-attention detection network for crack segmentation on tomb murals.

Author: Wu, Meng, Chai, Ruochang, Zhang, Yongqin, and Lu, Zhiyong
Subjects: DEEP learning, TANG dynasty, China, 618-907, FEATURE extraction, TOMBS, NEIGHBORHOODS
Abstract: Many tomb murals have punctate losses, cracks, and craquelure due to underground subsidence and changes in their physical support. Visual non-destructive detection techniques enable rapid assessment of how much tomb murals are affected by cracking, providing suggestions for their conservation. However, tomb murals are typically created by sketching outlines and then colored. Detailed sketches can easily interfere with crack detection work, requiring the use of deep learning network to better learn crack features and improve detection accuracy. At the same time the limited data of tomb mural presents a challenge to build a deep learning network. To address these issues, this paper introduces a novel dual-attention detection network (DADNet) for crack segmentation of tomb murals. In this work, a customized dataset is first constructed by collecting mural images from the Tang Dynasty tombs. Then the ConvNeXt framework serves as the basis for feature extraction, enhancing the process. Lastly, a dual-attention module utilizing neighborhood attention and biaxial attention is employed to accurately identify the crack regions. Neighborhood attention performs a local self-attention operation around the pixel point, addressing the limitations of self-attention. This approach significantly reduces computational demands as the image size increases. Biaxial attention performs attention calculations in the horizontal and vertical directions. This compensates for the limitation of neighborhood attention in capturing global dependencies. Our DADNet outperformed the competing methods, achieving the highest recorded scores of 78.95% for MIoU and 61.05% for the Jaccard index. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Artificial Intelligence in Age-Related Macular Degeneration (AMD)

Author: Peng, Yifan, Chen, Qingyu, Keenan, Tiarnan D. L., Chew, Emily Y., Lu, Zhiyong, and Grzybowski, Andrzej, editor
Published: 2021
Full Text: View/download PDF

21. Detection of Lymph Nodes in T2 MRI Using Neural Network Ensembles

Author: Mathai, Tejas Sudharshan, Lee, Sungwon, Elton, Daniel C., Shen, Thomas C., Peng, Yifan, Lu, Zhiyong, Summers, Ronald M., Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lian, Chunfeng, editor, Cao, Xiaohuan, editor, Rekik, Islem, editor, Xu, Xuanang, editor, and Yan, Pingkun, editor
Published: 2021
Full Text: View/download PDF

22. Closing the gap between open source and commercial large language models for medical evidence summarization.

Author: Zhang, Gongbo, Jin, Qiao, Zhou, Yiliang, Wang, Song, Idnay, Betina, Luo, Yiming, Park, Elizabeth, Nestor, Jordan G., Spotnitz, Matthew E., Soroush, Ali, Campion Jr., Thomas R., Lu, Zhiyong, Weng, Chunhua, and Peng, Yifan
Subjects: COMPUTER simulation, PEARSON correlation (Statistics), RESEARCH funding, T-test (Statistics), TASK performance, PROBABILITY theory, EVALUATION of human services programs, NATURAL language processing, DESCRIPTIVE statistics, MEDICAL databases, ARTIFICIAL neural networks, DEEP learning, COMPUTER networks, EVIDENCE-based medicine, CONFIDENCE intervals, COMPUTER assisted instruction, COMPARATIVE studies
Abstract: Large language models (LLMs) hold great promise in summarizing medical evidence. Most recent studies focus on the application of proprietary LLMs. Using proprietary LLMs introduces multiple risk factors, including a lack of transparency and vendor dependency. While open-source LLMs allow better transparency and customization, their performance falls short compared to the proprietary ones. In this study, we investigated to what extent fine-tuning open-source LLMs can further improve their performance. Utilizing a benchmark dataset, MedReview, consisting of 8161 pairs of systematic reviews and summaries, we fine-tuned three broadly-used, open-sourced LLMs, namely PRIMERA, LongT5, and Llama-2. Overall, the performance of open-source models was all improved after fine-tuning. The performance of fine-tuned LongT5 is close to GPT-3.5 with zero-shot settings. Furthermore, smaller fine-tuned models sometimes even demonstrated superior performance compared to larger zero-shot models. The above trends of improvement were manifested in both a human evaluation and a larger-scale GPT4-simulated evaluation. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling.

Author: Holste, Gregory, Lin, Mingquan, Zhou, Ruiwen, Wang, Fei, Liu, Lei, Yan, Qi, Van Tassel, Sarah H., Kovacs, Kyle, Chew, Emily Y., Lu, Zhiyong, Wang, Zhangyang, and Peng, Yifan
Subjects: GLAUCOMA, PREDICTION models, DIAGNOSTIC imaging, RESEARCH funding, T-test (Statistics), DATA analysis, RETINAL degeneration, PROBABILITY theory, EYE diseases, PHOTOGRAPHY, LONGITUDINAL method, OPHTHALMOLOGY, DEEP learning, STATISTICS, AUTOMATION, SURVIVAL analysis (Biometry), CONFIDENCE intervals, PROPORTIONAL hazards models, NONPARAMETRIC statistics
Abstract: Deep learning has enabled breakthroughs in automated diagnosis from medical imaging, with many successful applications in ophthalmology. However, standard medical image classification approaches only assess disease presence at the time of acquisition, neglecting the common clinical setting of longitudinal imaging. For slow, progressive eye diseases like age-related macular degeneration (AMD) and primary open-angle glaucoma (POAG), patients undergo repeated imaging over time to track disease progression and forecasting the future risk of developing a disease is critical to properly plan treatment. Our proposed Longitudinal Transformer for Survival Analysis (LTSA) enables dynamic disease prognosis from longitudinal medical imaging, modeling the time to disease from sequences of fundus photography images captured over long, irregular time periods. Using longitudinal imaging data from the Age-Related Eye Disease Study (AREDS) and Ocular Hypertension Treatment Study (OHTS), LTSA significantly outperformed a single-image baseline in 19/20 head-to-head comparisons on late AMD prognosis and 18/20 comparisons on POAG prognosis. A temporal attention analysis also suggested that, while the most recent image is typically the most influential, prior imaging still provides additional prognostic value. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. Improving model fairness in image-based computer-aided diagnosis.

Author: Lin, Mingquan, Li, Tianhao, Yang, Yifan, Holste, Gregory, Ding, Ying, Van Tassel, Sarah H., Kovacs, Kyle, Shih, George, Wang, Zhangyang, Lu, Zhiyong, Wang, Fei, and Peng, Yifan
Subjects: DEEP learning, COMPUTER-aided diagnosis, IMAGE recognition (Computer vision), COMPUTER-assisted image analysis (Medicine), FAIRNESS, MEDICAL coding
Abstract: Deep learning has become a popular tool for computer-aided diagnosis using medical images, sometimes matching or exceeding the performance of clinicians. However, these models can also reflect and amplify human bias, potentially resulting inaccurate missed diagnoses. Despite this concern, the problem of improving model fairness in medical image classification by deep learning has yet to be fully studied. To address this issue, we propose an algorithm that leverages the marginal pairwise equal opportunity to reduce bias in medical image classification. Our evaluations across four tasks using four independent large-scale cohorts demonstrate that our proposed algorithm not only improves fairness in individual and intersectional subgroups but also maintains overall performance. Specifically, the relative change in pairwise fairness difference between our proposed model and the baseline model was reduced by over 35%, while the relative change in AUC value was typically within 1%. By reducing the bias generated by deep learning models, our proposed approach can potentially alleviate concerns about the fairness and reliability of image-based computer-aided diagnosis. Deep learning models can reflect and amplify human bias, potentially resulting inaccurate missed diagnoses. Here, the authors show that by leveraging the marginal pairwise equal opportunity, their model reduces bias in medical image classification by over 35% compared to baseline models, with minimal impact on AUC values. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

25. GNorm2: an improved gene name recognition and normalization system.

Author: Wei, Chih-Hsuan, Luo, Ling, Islamaj, Rezarta, Lai, Po-Ting, and Lu, Zhiyong
Subjects: DEEP learning, GENES, TEXT mining, RECOGNITION (Psychology), SCIENTIFIC community
Abstract: Motivation Gene name normalization is an important yet highly complex task in biomedical text mining research, as gene names can be highly ambiguous and may refer to different genes in different species or share similar names with other bioconcepts. This poses a challenge for accurately identifying and linking gene mentions to their corresponding entries in databases such as NCBI Gene or UniProt. While there has been a body of literature on the gene normalization task, few have addressed all of these challenges or make their solutions publicly available to the scientific community. Results Building on the success of GNormPlus, we have created GNorm2: a more advanced tool with optimized functions and improved performance. GNorm2 integrates a range of advanced deep learning-based methods, resulting in the highest levels of accuracy and efficiency for gene recognition and normalization to date. Our tool is freely available for download. Availability and implementation https://github.com/ncbi/GNorm2. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

26. Author Correction: Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling.

Author: Holste, Gregory, Lin, Mingquan, Zhou, Ruiwen, Wang, Fei, Liu, Lei, Yan, Qi, Van Tassel, Sarah H., Kovacs, Kyle, Chew, Emily Y., Lu, Zhiyong, Wang, Zhangyang, and Peng, Yifan
Subjects: DIAGNOSTIC imaging, EYE diseases, DEEP learning
Published: 2024
Full Text: View/download PDF

27. Generalized Zero-Shot Chest X-Ray Diagnosis Through Trait-Guided Multi-View Semantic Embedding With Self-Training.

Author: Paul, Angshuman, Shen, Thomas C., Lee, Sungwon, Balachandar, Niranjan, Peng, Yifan, Lu, Zhiyong, and Summers, Ronald M.
Subjects: RADIOSCOPIC diagnosis, DEEP learning, X-rays, CHEST X rays, DIAGNOSIS, MACHINE learning
Abstract: Zero-shot learning (ZSL) is one of the most promising avenues of annotation-efficient machine learning. In the era of deep learning, ZSL techniques have achieved unprecedented success. However, the developments of ZSL methods have taken place mostly for natural images. ZSL for medical images has remained largely unexplored. We design a novel strategy for generalized zero-shot diagnosis of chest radiographs. In doing so, we leverage the potential of multi-view semantic embedding, a useful yet less-explored direction for ZSL. Our design also incorporates a self-training phase to tackle the problem of noisy labels alongside improving the performance for classes not seen during training. Through rigorous experiments, we show that our model trained on one dataset can produce consistent performance across test datasets from different sources including those with very different quality. Comparisons with a number of state-of-the-art techniques show the superiority of the proposed method for generalized zero-shot chest x-ray diagnosis. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

28. PhenoTagger: a hybrid method for phenotype concept recognition using human phenotype ontology.

Author: Luo, Ling, Yan, Shankai, Lai, Po-Ting, Veltri, Daniel, Oler, Andrew, Xirasagar, Sandhya, Ghosh, Rajarshi, Similuk, Morgan, Robinson, Peter N, and Lu, Zhiyong
Subjects: HUMAN phenotype, PHENOTYPES, GENE ontology, DEEP learning, MACHINE learning, SOURCE code, APPLICATION program interfaces
Abstract: Motivation Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high precision but suffer from lower recall. Recently, machine learning-based methods have been proposed to identify biomedical concepts, which can recognize more unseen concept synonyms by automatic feature learning. However, most methods require large corpora of manually annotated data for model training, which is difficult to obtain due to the high cost of human annotation. Results In this article, we propose PhenoTagger, a hybrid method that combines both dictionary and machine learning-based methods to recognize Human Phenotype Ontology (HPO) concepts in unstructured biomedical text. We first use all concepts and synonyms in HPO to construct a dictionary, which is then used to automatically build a distantly supervised training dataset for machine learning. Next, a cutting-edge deep learning model is trained to classify each candidate phrase (n -gram from input sentence) into a corresponding concept label. Finally, the dictionary and machine learning-based prediction results are combined for improved performance. Our method is validated with two HPO corpora, and the results show that PhenoTagger compares favorably to previous methods. In addition, to demonstrate the generalizability of our method, we retrained PhenoTagger using the disease ontology MEDIC for disease concept recognition to investigate the effect of training on different ontologies. Experimental results on the NCBI disease corpus show that PhenoTagger without requiring manually annotated training data achieves competitive performance as compared with state-of-the-art supervised methods. Availabilityand implementation The source code, API information and data for PhenoTagger are freely available at https://github.com/ncbi-nlp/PhenoTagger. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

29. Recent advances in biomedical literature mining.

Author: Zhao, Sendong, Su, Chang, Lu, Zhiyong, and Wang, Fei
Subjects: DATA mining, COMPUTER science, COMMUNITIES, NATURAL language processing
Abstract: The recent years have witnessed a rapid increase in the number of scientific articles in biomedical domain. These literature are mostly available and readily accessible in electronic format. The domain knowledge hidden in them is critical for biomedical research and applications, which makes biomedical literature mining (BLM) techniques highly demanding. Numerous efforts have been made on this topic from both biomedical informatics (BMI) and computer science (CS) communities. The BMI community focuses more on the concrete application problems and thus prefer more interpretable and descriptive methods, while the CS community chases more on superior performance and generalization ability, thus more sophisticated and universal models are developed. The goal of this paper is to provide a review of the recent advances in BLM from both communities and inspire new research directions. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

30. Automated abnormality classification of chest radiographs using deep convolutional neural networks.

Author: Tang, Yu-Xing, Tang, You-Bao, Peng, Yifan, Yan, Ke, Bagheri, Mohammadhadi, Redd, Bernadette A., Brandon, Catherine J., Lu, Zhiyong, Han, Mei, Xiao, Jing, and Summers, Ronald M.
Subjects: CHEST X rays, CONVOLUTIONAL neural networks, PNEUMONIA diagnosis, DEEP learning, RADIOLOGISTS
Abstract: As one of the most ubiquitous diagnostic imaging tests in medical practice, chest radiography requires timely reporting of potential findings and diagnosis of diseases in the images. Automated, fast, and reliable detection of diseases based on chest radiography is a critical step in radiology workflow. In this work, we developed and evaluated various deep convolutional neural networks (CNN) for differentiating between normal and abnormal frontal chest radiographs, in order to help alert radiologists and clinicians of potential abnormal findings as a means of work list triaging and reporting prioritization. A CNN-based model achieved an AUC of 0.9824 ± 0.0043 (with an accuracy of 94.64 ± 0.45%, a sensitivity of 96.50 ± 0.36% and a specificity of 92.86 ± 0.48%) for normal versus abnormal chest radiograph classification. The CNN model obtained an AUC of 0.9804 ± 0.0032 (with an accuracy of 94.71 ± 0.32%, a sensitivity of 92.20 ± 0.34% and a specificity of 96.34 ± 0.31%) for normal versus lung opacity classification. Classification performance on the external dataset showed that the CNN model is likely to be highly generalizable, with an AUC of 0.9444 ± 0.0029. The CNN model pre-trained on cohorts of adult patients and fine-tuned on pediatric patients achieved an AUC of 0.9851 ± 0.0046 for normal versus pneumonia classification. Pretraining with natural images demonstrates benefit for a moderate-sized training image set of about 8500 images. The remarkable performance in diagnostic accuracy observed in this study shows that deep CNNs can accurately and effectively differentiate normal and abnormal chest radiographs, thereby providing potential benefits to radiology workflow and patient care. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

31. Extracting chemical–protein relations with ensembles of SVM and deep learning models.

Author: Peng, Yifan, Rios, Anthony, Kavuluru, Ramakanth, and Lu, Zhiyong
Subjects: DATA mining, SUPPORT vector machines, DEEP learning
Abstract: Mining relations between chemicals and proteins from the biomedical literature is an increasingly important task. The CHEMPROT track at BioCreative VI aims to promote the development and evaluation of systems that can automatically detect the chemical–protein relations in running text (PubMed abstracts). This work describes our CHEMPROT track entry, which is an ensemble of three systems, including a support vector machine, a convolutional neural network, and a recurrent neural network. Their output is combined using majority voting or stacking for final predictions. Our CHEMPROT system obtained 0.7266 in precision and 0.5735 in recall for an F-score of 0.6410 during the challenge, demonstrating the effectiveness of machine learning-based approaches for automatic relation extraction from biomedical literature and achieving the highest performance in the task during the 2017 challenge. Database URL: http://www.biocreative.org/tasks/biocreative-vi/track-5/ [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

32. An Updated Simplified Severity Scale for Age-Related Macular Degeneration Incorporating Reticular Pseudodrusen: Age-Related Eye Disease Study Report Number 42.

Author: Agrón, Elvira, Domalpally, Amitha, Chen, Qingyu, Lu, Zhiyong, Chew, Emily Y., and Keenan, Tiarnan D.L.
Subjects: *MACULAR degeneration, *EYE diseases, *DEEP learning, *REPORTING of diseases, *ATROPHY
Abstract: To update the Age-Related Eye Disease Study (AREDS) simplified severity scale for risk of late age-related macular degeneration (AMD), including incorporation of reticular pseudodrusen (RPD), and to perform external validation on the Age-Related Eye Disease Study 2 (AREDS2). Post hoc analysis of 2 clinical trial cohorts: AREDS and AREDS2. Participants with no late AMD in either eye at baseline in AREDS (n = 2719) and AREDS2 (n = 1472). Five-year rates of progression to late AMD were calculated according to levels 0 to 4 on the simplified severity scale after 2 updates: (1) noncentral geographic atrophy (GA) considered part of the outcome, rather than a risk feature, and (2) scale separation according to RPD status (determined by validated deep learning grading of color fundus photographs). Five-year rate of progression to late AMD (defined as neovascular AMD or any GA). In the AREDS, after the first scale update, the 5-year rates of progression to late AMD for levels 0 to 4 were 0.3%, 4.5%, 12.9%, 32.2%, and 55.6%, respectively. As the final simplified severity scale, the 5-year progression rates for levels 0 to 4 were 0.3%, 4.3%, 11.6%, 26.7%, and 50.0%, respectively, for participants without RPD at baseline and 2.8%, 8.0%, 29.0%, 58.7%, and 72.2%, respectively, for participants with RPD at baseline. In external validation on the AREDS2, for levels 2 to 4, the progression rates were similar: 15.0%, 27.7%, and 45.7% (RPD absent) and 26.2%, 46.0%, and 73.0% (RPD present), respectively. The AREDS AMD simplified severity scale has been modernized with 2 important updates. The new scale for individuals without RPD has 5-year progression rates of approximately 0.5%, 4%, 12%, 25%, and 50%, such that the rates on the original scale remain accurate. The new scale for individuals with RPD has 5-year progression rates of approximately 3%, 8%, 30%, 60%, and 70%, that is, approximately double for most levels. This scale fits updated definitions of late AMD, has increased prognostic accuracy, seems generalizable to similar populations, but remains simple for broad risk categorization. Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

33. NLM-Gene, a richly annotated gold standard dataset for gene entities that addresses ambiguity and multi-species gene recognition.

Author: Islamaj, Rezarta, Wei, Chih-Hsuan, Cissel, David, Miliaras, Nicholas, Printseva, Olga, Rodionov, Oleg, Sekiya, Keiko, Ward, Janice, and Lu, Zhiyong
Abstract: The automatic recognition of gene names and their corresponding database identifiers in biomedical text is an important first step for many downstream text-mining applications. While current methods for tagging gene entities have been developed for biomedical literature, their performance on species other than human is substantially lower due to the lack of annotation data. We therefore present the NLM-Gene corpus, a high-quality manually annotated corpus for genes developed at the US National Library of Medicine (NLM), covering ambiguous gene names, with an average of 29 gene mentions (10 unique identifiers) per document, and a broader representation of different species (including Homo sapiens, Mus musculus, Rattus norvegicus, Drosophila melanogaster, Arabidopsis thaliana, Danio rerio, etc.) when compared to previous gene annotation corpora. NLM-Gene consists of 550 PubMed abstracts from 156 biomedical journals, doubly annotated by six experienced NLM indexers, randomly paired for each document to control for bias. The annotators worked in three annotation rounds until they reached complete agreement. This gold-standard corpus can serve as a benchmark to develop & test new gene text mining algorithms. Using this new resource, we have developed a new gene finding algorithm based on deep learning which improved both on precision and recall from existing tools. The NLM-Gene annotated corpus is freely available at ftp://ftp.ncbi.nlm.nih.gov/pub/lu/NLMGene. We have also applied this tool to the entire PubMed/PMC with their results freely accessible through our web-based tool PubTator (www.ncbi.nlm.nih.gov/research/pubtator). [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

34. Predicting risk of late age-related macular degeneration using deep learning.

Author: Peng, Yifan, Keenan, Tiarnan D., Chen, Qingyu, Agrón, Elvira, Allot, Alexis, Wong, Wai T., Chew, Emily Y., and Lu, Zhiyong
Subjects: DEEP learning, RETINAL degeneration, MEDICAL decision making, DISEASE progression, CLINICAL trials
Abstract: By 2040, age-related macular degeneration (AMD) will affect ~288 million people worldwide. Identifying individuals at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Although deep learning has shown promise in diagnosing/screening AMD using color fundus photographs, it remains difficult to predict individuals' risks of late AMD accurately. For both tasks, these initial deep learning attempts have remained largely unvalidated in independent cohorts. Here, we demonstrate how deep learning and survival analysis can predict the probability of progression to late AMD using 3298 participants (over 80,000 images) from the Age-Related Eye Disease Studies AREDS and AREDS2, the largest longitudinal clinical trials in AMD. When validated against an independent test data set of 601 participants, our model achieved high prognostic accuracy (5-year C-statistic 86.4 (95% confidence interval 86.2–86.6)) that substantially exceeded that of retinal specialists using two existing clinical standards (81.3 (81.1–81.5) and 82.0 (81.8–82.3), respectively). Interestingly, our approach offers additional strengths over the existing clinical standards in AMD prognosis (e.g., risk ascertainment above 50%) and is likely to be highly generalizable, given the breadth of training data from 82 US retinal specialty clinics. Indeed, during external validation through training on AREDS and testing on AREDS2 as an independent cohort, our model retained substantially higher prognostic accuracy than existing clinical standards. These results highlight the potential of deep learning systems to enhance clinical decision-making in AMD patients. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

35. Using deep learning to identify translational research in genomic medicine beyond bench to bedside.

Author: Hsu, Yi-Yu, Clyne, Mindy, Wei, Chih-Hsuan, Khoury, Muin J, and Lu, Zhiyong
Subjects: TRANSLATIONAL research, ARTIFICIAL neural networks, MEDICAL research, DEEP learning, SUPPORT vector machines, RESEARCH evaluation
Abstract: Tracking scientific research publications on the evaluation, utility and implementation of genomic applications is critical for the translation of basic research to impact clinical and population health. In this work, we utilize state-of-the-art machine learning approaches to identify translational research in genomics beyond bench to bedside from the biomedical literature. We apply the convolutional neural networks (CNNs) and support vector machines (SVMs) to the bench/bedside article classification on the weekly manual annotation data of the Public Health Genomics Knowledge Base database. Both classifiers employ salient features to determine the probability of curation-eligible publications, which can effectively reduce the workload of manual triage and curation process. We applied the CNNs and SVMs to an independent test set (n = 400), and the models achieved the F -measure of 0.80 and 0.74, respectively. We further tested the CNNs, which perform better results, on the routine annotation pipeline for 2 weeks and significantly reduced the effort and retrieved more appropriate research articles. Our approaches provide direct insight into the automated curation of genomic translational research beyond bench to bedside. The machine learning classifiers are found to be helpful for annotators to enhance the efficiency of manual curation. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

36. Reticular Pseudodrusen: The Third Macular Risk Feature for Progression to Late Age-Related Macular Degeneration: Age-Related Eye Disease Study 2 Report 30.

Author: Agrón, Elvira, Domalpally, Amitha, Cukras, Catherine A., Clemons, Traci E., Chen, Qingyu, Lu, Zhiyong, Chew, Emily Y., and Keenan, Tiarnan D.L.
Subjects: *MACULAR degeneration, *EYE diseases, *DEEP learning
Abstract: To analyze reticular pseudodrusen (RPD) as an independent risk factor for progression to late age-related macular degeneration (AMD), alongside traditional macular risk factors (soft drusen and pigmentary abnormalities) considered simultaneously. Post hoc analysis of 2 clinical trial cohorts: Age-Related Eye Disease Study (AREDS) and AREDS2. Eyes with no late AMD at baseline in AREDS (6959 eyes, 3780 participants) and AREDS2 (3355 eyes, 2056 participants). Color fundus photographs (CFPs) from annual visits were graded for soft drusen, pigmentary abnormalities, and late AMD. Presence of RPD was from grading of fundus autofluorescence images (AREDS2) and deep learning grading of CFPs (AREDS). Proportional hazards regression analyses were performed, considering AREDS AMD severity scales (modified simplified severity scale [person] and 9-step scale [eye]) and RPD presence simultaneously. Progression to late AMD, geographic atrophy (GA), and neovascular AMD. In AREDS, for late AMD analyses by person, in a model considering the simplified severity scale simultaneously, RPD presence was associated with a higher risk of progression: hazard ratio (HR), 2.15 (95% confidence interval [CI], 1.75–2.64). However, the risk associated with RPD presence differed at different severity scale levels: HR, 3.23 (95% CI, 1.60–6.51), HR, 3.81 (95% CI, 2.38–6.10), HR, 2.28 (95% CI, 1.59–3.27), and HR, 1.64 (95% CI, 1.20–2.24), at levels 0–1, 2, 3, and 4, respectively. Considering the 9-step scale (by eye), RPD presence was associated with higher risk: HR, 2.54 (95% CI, 2.07–3.13). The HRs were 5.11 (95% CI, 3.93–6.66) at levels 1–6 and 1.78 (95% CI, 1.43–2.22) at levels 7 and 8. In AREDS2, by person, RPD presence was not associated with higher risk: HR, 1.18 (95% CI, 0.90–1.56); by eye, it was HR, 1.57 (95% CI, 1.31–1.89). In both cohorts, RPD presence carried a higher risk for GA than neovascular AMD. Reticular pseudodrusen represent an important risk factor for progression to late AMD, particularly GA. However, the added risk varies markedly by severity level, with highly increased risk at lower/moderate levels and less increased risk at higher levels. Reticular pseudodrusen status should be included in updated AMD classification systems, risk calculators, and clinical trials. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

37. Detection of abdominopelvic lymph nodes in multi-parametric MRI.

Author: Mathai, Tejas Sudharshan, Shen, Thomas C., Elton, Daniel C., Lee, Sungwon, Lu, Zhiyong, and Summers, Ronald M.
Subjects: *LYMPH nodes, *COMPUTER-aided diagnosis, *MAGNETIC resonance imaging, *BETA distribution, *TUMOR classification, *LYMPHANGIOGRAPHY, *OPTICAL scanners
Abstract: Reliable localization of lymph nodes (LNs) in multi-parametric MRI (mpMRI) studies plays a major role in the assessment of lymphadenopathy and staging of metastatic disease. Radiologists routinely measure the nodal size in order to distinguish benign from malignant nodes, which require subsequent cancer staging. However, identification of lymph nodes is a cumbersome task due to their myriad appearances in mpMRI studies. Multiple sequences are acquired in mpMRI studies, including T2 fat suppressed (T2FS) and diffusion weighted imaging (DWI) sequences among others; consequently, the sizing of LNs is rendered challenging due to the variety of signal intensities in these sequences. Furthermore, radiologists can miss potentially metastatic LNs during a busy clinical day. To lighten these imaging and workflow challenges, we propose a computer-aided detection (CAD) pipeline to detect both benign and malignant LNs in the body for their subsequent measurement. We employed the recently proposed Dynamic Head (DyHead) neural network to detect LNs in mpMRI studies that were acquired using a variety of scanners and exam protocols. The T2FS and DWI series were co-registered, and a selective augmentation technique called Intra-Label LISA (ILL) was used to blend the two volumes with the interpolation factor drawn from a Beta distribution. In this way, ILL diversified the samples that the model encountered during the training phase, while the requirement for both sequences to be present at test time was nullified. Our results showed a mean average precision (mAP) of 53.5% and a sensitivity of ∼ 78% with ILL at 4 FP/vol. This corresponded to an improvement of ≥ 10% in mAP and ≥ 12% in sensitivity at 4FP (p ¡ 0.05) respectively over current LN detection approaches evaluated on the same dataset. We also established the out-of-distribution robustness of the DyHead model by training it on data acquired by a Siemens Aera scanner and testing it on data from the Siemens Verio, Siemens Biograph mMR, and Philips Achieva scanners. Our pilot work represents an important first step towards automated detection, segmentation, and classification of lymph nodes in mpMRI. • We propose a pipeline for lymph node detection in multi-parametric MRI studies. • T2 fat suppressed (T2FS) and DWI series were linearly interpolated to preserve their traits during training. • We used a Dynamic Head (DyHead) neural network integrated with the complete IoU loss. • At test time, either T2FS can be used alone or blended with any available DWI series (after co-registration). • A 53.5% precision and ∼ 78% sensitivity at 4 FP/vol was seen with ≥ 10% improvement (p < 0.05) over prior works. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. Global-Local attention network with multi-task uncertainty loss for abnormal lymph node detection in MR images.

Author: Wang, Shuai, Zhu, Yingying, Lee, Sungwon, Elton, Daniel C., Shen, Thomas C., Tang, Youbao, Peng, Yifan, Lu, Zhiyong, and Summers, Ronald M.
Subjects: *LYMPH nodes, *MAGNETIC resonance imaging, *MULTICASTING (Computer networks), *MAGNETIC resonance
Abstract: • We propose a novel network for the universal abnormal lymph node detection in MR images, which has great clinical value for the diagnosis of numerous diseases. • We design a global-local context module to encode the image global and local scale context information for the detection and utilize the channel attention mechanism to weight different contexts. • We introduce a multi-task uncertainty loss to adaptively balance the losses of different tasks, which can effectively alleviate the burden for tuning the loss weights by hand. • We build a large-scale MRI abnormal lymph node dataset, which includes a total of 821 abnormal abdominal lymph nodes of 41 types from 584 different patients. Moreover, 123 images with complete 3D volume annotations are delineated by an experienced radiologist. [Display omitted] Accurate and reliable detection of abnormal lymph nodes in magnetic resonance (MR) images is very helpful for the diagnosis and treatment of numerous diseases. However, it is still a challenging task due to similar appearances between abnormal lymph nodes and other tissues. In this paper, we propose a novel network based on an improved Mask R-CNN framework for the detection of abnormal lymph nodes in MR images. Instead of laboriously collecting large-scale pixel-wise annotated training data, pseudo masks generated from RECIST bookmarks on hand are utilized as the supervision. Different from the standard Mask R-CNN architecture, there are two main innovations in our proposed network: 1) global-local attention which encodes the global and local scale context for detection and utilizes the channel attention mechanism to extract more discriminative features and 2) multi-task uncertainty loss which adaptively weights multiple objective loss functions based on the uncertainty of each task to automatically search the optimal solution. For the experiments, we built a new abnormal lymph node dataset with 821 RECIST bookmarks of 41 different types of abnormal abdominal lymph nodes from 584 different patients. The experimental results showed the superior performance of our algorithm over compared state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

38 results on '"Lu, Zhiyong"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources