Journal: artificial intelligence in medicine / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Showing total 1,253 results

Start Over Publication Year Range Last 10 years Journal artificial intelligence in medicine

1,253 results

1. Publishing Artificial Intelligence Research Papers: A Tale of Three Journals.

Author: Shortliffe EH, Peleg M, Combi C, Chang AC, and Vinci J
Subjects: Editorial Policies, Humans, Publishing, Artificial Intelligence, Periodicals as Topic
Published: 2021
Full Text: View/download PDF

2. Editorial from the new Editor-in-Chief: Artificial Intelligence in Medicine and the forthcoming challenges.

Author: Combi, Carlo
Subjects: *ARTIFICIAL intelligence, *RESEARCH papers (Students), *EDITORIAL boards, *MEDICINE, *MEDICAL research, *NEWSLETTERS, *PUBLISHING, *BIOINFORMATICS
Published: 2017
Full Text: View/download PDF

3. Hierarchical medical image report adversarial generation with hybrid discriminator

Author: Zhang, Junsan, Cheng, Ming, Cheng, Qiaoqiao, Shen, Xiuxuan, Wan, Yao, Zhu, Jie, and Liu, Mengxuan
Published: 2024
Full Text: View/download PDF

4. Pre-trained language models in medicine: A survey.

Author: Luo, Xudong, Deng, Zhiqi, Yang, Binxia, and Luo, Michael Y.
Abstract: With the rapid progress in Natural Language Processing (NLP), Pre-trained Language Models (PLM) such as BERT, BioBERT, and ChatGPT have shown great potential in various medical NLP tasks. This paper surveys the cutting-edge achievements in applying PLMs to various medical NLP tasks. Specifically, we first brief PLMS and outline the research of PLMs in medicine. Next, we categorise and discuss the types of tasks in medical NLP, covering text summarisation, question-answering, machine translation, sentiment analysis, named entity recognition, information extraction, medical education, relation extraction, and text mining. For each type of task, we first provide an overview of the basic concepts, the main methodologies, the advantages of applying PLMs, the basic steps of applying PLMs application, the datasets for training and testing, and the metrics for task evaluation. Subsequently, a summary of recent important research findings is presented, analysing their motivations, strengths vs weaknesses, similarities vs differences, and discussing potential limitations. Also, we assess the quality and influence of the research reviewed in this paper by comparing the citation count of the papers reviewed and the reputation and impact of the conferences and journals where they are published. Through these indicators, we further identify the most concerned research topics currently. Finally, we look forward to future research directions, including enhancing models' reliability, explainability, and fairness, to promote the application of PLMs in clinical practice. In addition, this survey also collect some download links of some model codes and the relevant datasets, which are valuable references for researchers applying NLP techniques in medicine and medical professionals seeking to enhance their expertise and healthcare service through AI technology. • Brief PLMs, highlighting their effectiveness in various NLP tasks in medicine • Explore the advanced applications of PLMs in various tasks of medical NLP • Summarise each task and review recent researches, especially their pros and cons • Identify future research, focusing on reliability, explainability, and fairness • Collect download links for datasets and codes, aiding the research of AI in medicine [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Systematic literature review on reinforcement learning in non-communicable disease interventions.

Author: Zhao, Yanfeng, Chaw, Jun Kit, Liu, Lin, Chaw, Sook Hui, Ang, Mei Choo, and Ting, Tin Tin
Abstract: There is evidence that reducing modifiable risk factors and strengthening medical and health interventions can reduce early mortality and economic losses from non-communicable diseases (NCDs). Machine learning (ML) algorithms have been successfully applied to preventing and controlling NCDs. Reinforcement learning (RL) is the most promising of these approaches because of its ability to dynamically adapt interventions to NCD disease progression and its commitment to achieving long-term intervention goals. This paper reviews the preferred algorithms, data sources, design details, and obstacles to clinical application in existing studies to facilitate the early application of RL algorithms in clinical practice research for NCD interventions. We screened 40 relevant papers for quantitative and qualitative analysis using the PRISMA review flow diagram. The results show that researchers tend to use Deep Q-Network (DQN) and Actor-Critic as well as their improved or hybrid algorithms to train and validate RL models on retrospective datasets. Often, the patient's physical condition is the main defining parameter of the state space, while interventions are the main defining parameter of the action space. Mostly, changes in the patient's physical condition are used as a basis for immediate rewards to the agent. Various attempts have been made to address the challenges to clinical application, and several approaches have been proposed from existing research. However, as there is currently no universally accepted solution, the use of RL algorithms in clinical practice for NCD interventions necessitates more comprehensive responses to the issues addressed in this paper, which are safety, interpretability, training efficiency, and the technical aspect of exploitation and exploration in RL algorithms. • Reinforcement learning (RL) approaches enhance strategies for managing non-communicable diseases (NCDs). • 40 relevant papers for quantitative and qualitative analysis using the PRISMA review flow diagram. • This review focuses on the methods, data sources, and challenges when building a RL model for NCDs interventions. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. Challenges and strategies for wide-scale artificial intelligence (AI) deployment in healthcare practices: A perspective for healthcare organizations.

Author: Esmaeilzadeh, Pouyan
Abstract: Healthcare organizations have realized that Artificial intelligence (AI) can provide a competitive edge through personalized patient experiences, improved patient outcomes, early diagnosis, augmented clinician capabilities, enhanced operational efficiencies, or improved medical service accessibility. However, deploying AI-driven tools in the healthcare ecosystem could be challenging. This paper categorizes AI applications in healthcare and comprehensively examines the challenges associated with deploying AI in medical practices at scale. As AI continues to make strides in healthcare, its integration presents various challenges, including production timelines, trust generation, privacy concerns, algorithmic biases, and data scarcity. The paper highlights that flawed business models and wrong workflows in healthcare practices cannot be rectified merely by deploying AI-driven tools. Healthcare organizations should re-evaluate root problems such as misaligned financial incentives (e.g., fee-for-service models), dysfunctional medical workflows (e.g., high rates of patient readmissions), poor care coordination between different providers, fragmented electronic health records systems, and inadequate patient education and engagement models in tandem with AI adoption. This study also explores the need for a cultural shift in viewing AI not as a threat but as an enabler that can enhance healthcare delivery and create new employment opportunities while emphasizing the importance of addressing underlying operational issues. The necessity of investments beyond finance is discussed, emphasizing the importance of human capital, continuous learning, and a supportive environment for AI integration. The paper also highlights the crucial role of clear regulations in building trust, ensuring safety, and guiding the ethical use of AI, calling for coherent frameworks addressing transparency, model accuracy, data quality control, liability, and ethics. Furthermore, this paper underscores the importance of advancing AI literacy within academia to prepare future healthcare professionals for an AI-driven landscape. Through careful navigation and proactive measures addressing these challenges, the healthcare community can harness AI's transformative power responsibly and effectively, revolutionizing healthcare delivery and patient care. The paper concludes with a vision and strategic suggestions for the future of healthcare with AI, emphasizing thoughtful, responsible, and innovative engagement as the pathway to realizing its full potential to unlock immense benefits for healthcare organizations, physicians, nurses, and patients while proactively mitigating risks. • This study categorizes AI applications in healthcare and analyzes their deployment challenges at scale in medical practices. • The paper emphasizes a cultural shift to view AI as a healthcare delivery enhancer and job creator, not a threat. • To integrate AI, investments in finance and human capital, continuous learning, and a supportive environment are needed. • Clear regulatory frameworks should be developed to build trust, ensure safety, and guide the ethical use of AI in healthcare. • The study provides strategic suggestions for responsible AI use in healthcare to maximize benefits and minimize risks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. HR-BGCN [formula omitted] Predicting readmission for heart failure from electronic health records.

Author: Ma, Huiting, Li, Dengao, Zhao, Jumin, Li, Wenjing, Fu, Jian, and Li, Chunxia
Abstract: Heart failure has become a huge public health problem, and failure to accurately predict readmission will further lead to the disease's high cost and high mortality. The construction of readmission prediction model can assist doctors in making decisions to prevent patients from deteriorating and reduce the cost burden. This paper extracts the patient discharge records from the MIMIC-III database. It divides the patients into three research categories: no readmission, readmission within 30 days, and readmission after 30 days, to predict the readmission of patients. We propose the HR-BGCN model to predict the readmission of patients. First, we use the Adaptive-TMix to improve the prediction indicators of a few categories and reduce the impact of unbalanced categories. Then, the knowledge-informed graph attention mechanism is proposed. By introducing a document-level explicit diagram structure, the coding ability of graph node features is significantly improved. The paragraph-level representation obtained through graph learning is combined with the context token-level representation of BERT, and finally, the multi-classification task is carried out. We also compare several typical graph learning classification models to verify the model's effectiveness, such as the IA-GCN model, GAT model, etc. The results show that the average F1 score of the HR-BGCN model proposed in this paper for 30-day readmission of heart failure patients is 88.26%, and the average accuracy is 90.47%. The HR-BGCN model is significantly better than the graph learning classification model for predicting heart failure readmission. It can help doctors predict the 30-day readmission of patients, then reduce the readmission rate of patients. • The clinical notes (unstructured data) are used for prognostic prediction. • The Adaptive-TMix is proposed to reduce the impact of unbalanced categories. • The proposed knowledge-informed graph attention mechanism can improve coding ability of graph node features. • The model can complete a multi-class prediction task and perform well. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

8. ECG-based cardiac arrhythmias detection through ensemble learning and fusion of deep spatial–temporal and long-range dependency features.

Author: Din, Sadia, Qaraqe, Marwa, Mourad, Omar, Qaraqe, Khalid, and Serpedin, Erchin
Abstract: Cardiac arrhythmia is one of the prime reasons for death globally. Early diagnosis of heart arrhythmia is crucial to provide timely medical treatment. Heart arrhythmias are diagnosed by analyzing the electrocardiogram (ECG) of patients. Manual analysis of ECG is time-consuming and challenging. Hence, effective automated detection of heart arrhythmias is important to produce reliable results. Different deep-learning techniques to detect heart arrhythmias such as Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Transformer, and Hybrid CNN–LSTM were proposed. However, these techniques, when used individually, are not sufficient to effectively learn multiple features from the ECG signal. The fusion of CNN and LSTM overcomes the limitations of CNN in the existing studies as CNN–LSTM hybrids can extract spatiotemporal features. However, LSTMs suffer from long-range dependency issues due to which certain features may be ignored. Hence, to compensate for the drawbacks of the existing models, this paper proposes a more comprehensive feature fusion technique by merging CNN, LSTM, and Transformer models. The fusion of these models facilitates learning spatial, temporal, and long-range dependency features, hence, helping to capture different attributes of the ECG signal. These features are subsequently passed to a majority voting classifier equipped with three traditional base learners. The traditional learners are enriched with deep features instead of handcrafted features. Experiments are performed on the MIT-BIH arrhythmias database and the model performance is compared with that of the state-of-art models. Results reveal that the proposed model performs better than the existing models yielding an accuracy of 99.56%. • This paper proposes a rich feature fusion technique for cardiac arrhythmia detection from ECG by merging three deep learning models, namely, CNN, CNN–LSTM, and Transformer. • Integrating these models combines the advantages of all these individual models, overcomes their limitations, and learns significant features from the input ECG signal. • The deep spatial, temporal, and long-range dependency patterns learned by the aforementioned models are fused via concatenation and fed to a majority voting classifier with three traditional base learners: SVM, LR, and RF. • MIT-BIH Arrhythmia database is used for experimentation. A comparison of the results with the baseline models and state-of-the-art models shows that the proposed model outdoes the existing models achieving an accuracy of 95.56% and an F-score of 99.34%. • The results show that the proposed model outperforms the existing models in terms of precision, recall, F-score, and accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

9. The Concordance Index decomposition: A measure for a deeper understanding of survival prediction models.

Author: Alabdallah, Abdallah, Ohlsson, Mattias, Pashami, Sepideh, and Rögnvaldsson, Thorsteinn
Abstract: The Concordance Index (C-index) is a commonly used metric in Survival Analysis for evaluating the performance of a prediction model. In this paper, we propose a decomposition of the C-index into a weighted harmonic mean of two quantities: one for ranking observed events versus other observed events, and the other for ranking observed events versus censored cases. This decomposition enables a finer-grained analysis of the relative strengths and weaknesses between different survival prediction methods. The usefulness of this decomposition is demonstrated through benchmark comparisons against classical models and state-of-the-art methods, together with the new variational generative neural-network-based method (SurVED) proposed in this paper. The performance of the models is assessed using four publicly available datasets with varying levels of censoring. Using the C-index decomposition and synthetic censoring, the analysis shows that deep learning models utilize the observed events more effectively than other models. This allows them to keep a stable C-index in different censoring levels. In contrast to such deep learning methods, classical machine learning models deteriorate when the censoring level decreases due to their inability to improve on ranking the events versus other events. • The C-Index Decomposition is proposed as a weighted harmonic mean of the C-index of the events vs other events and the C-index of events vs censored cases. • The C-Index Decomposition gives a more detailed image of the survival model performance. • SurVED; a new survival model based on Variational inference formulation was shown to be better at improving performance with more observed event. • Models which seem to perform similarly using the C-index, showed different behavior with respect to the terms of the C-index decomposition. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. Diseases diagnosis based on artificial intelligence and ensemble classification.

Author: Rabie, Asmaa H. and Saleh, Ahmed I.
Abstract: In recent years, Computer Aided Diagnosis (CAD) has become an important research area that attracted a lot of researchers. In medical diagnostic systems, several attempts have been made to build and enhance CAD applications to avoid errors that can cause dangerously misleading medical treatments. The most exciting opportunity for promoting the performance of CAD system can be accomplished by integrating Artificial Intelligence (AI) in medicine. This allows the effective automation of traditional manual workflow, which is slow, inaccurate and affected by human errors. This paper aims to provide a complete Computer Aided Disease Diagnosis (CAD2) strategy based on Machine Learning (ML) techniques that can help clinicians to make better medical decisions. The proposed CAD2 consists of three main sequential phases, namely; (i) Outlier Rejection Phase (ORP), (ii) Feature Selection Phase (FSP), and (iii) Classification Phase (CP). ORP is implemented to reject outliers using new Outlier Rejection Technique (ORT) that contains two sequential stages called Fast Outlier Rejection (FOR) and Accurate Outlier Rejection (AOR). The most informative features are selected through FSP using Hybrid Selection Technique (HST). HST includes two main stages called Quick Selection Stage (QS2) using fisher score as a filter method and Precise Selection Stage (PS2) using a Hybrid Bio-inspired Optimization (HBO) technique as a wrapper method. Finally, actual diagnose takes place through CP, which relies on Ensemble Classification Technique (ECT). The proposed CAD2 has been tested experimentally against recent disease diagnostic strategies using two different datasets in which the first contains several diseases, while the second includes data for Covid-19 patients only. Experimental results have proven the high efficiency of the proposed CAD2 in terms of accuracy, error, precision, and recall compared with other competitors. Additionally, CAD2 strategy provides the best Wilcoxon signed rank test and Friedman test measurements against other strategies according to both datasets. It is concluded that CAD2 strategy based on ORP, FSP, and CP gave an accurate diagnosis compared to other strategies because it gave the highest accuracy and the lowest error and implementation time. • This paper provides Computer Aided Disease Diagnosis (CAD2) strategy by AI methods. • CAD2 includes outlier rejection, feature selection, and classification phases. • Rejection phase eliminates outliers but selection phase selects the best features. • Actual diagnosis takes place through classification phase using AI techniques. • The provided CAD2 introduces the maximum diagnose accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. BGRL: Basal Ganglia inspired Reinforcement Learning based framework for deep brain stimulators.

Author: Agarwal, Harsh and Rathore, Heena
Abstract: Deep Brain Stimulation (DBS) is an implantable medical device used for electrical stimulation to treat neurological disorders. Traditional DBS devices provide fixed frequency pulses, but personalized adjustment of stimulation parameters is crucial for optimal treatment. This paper introduces a Basal Ganglia inspired Reinforcement Learning (BGRL) approach, incorporating a closed-loop feedback mechanism to suppress neural synchrony during neurological fluctuations. The BGRL approach leverages the resemblance between the Basal Ganglia region of brain by incorporating the actor–critic architecture of reinforcement learning (RL). Simulation results demonstrate that BGRL significantly reduces synchronous electrical pulses compared to other standard RL algorithms. BGRL algorithm outperforms existing RL methods in terms of suppression capability and energy consumption, validated through comparisons using ensemble oscillators. Results shown in the paper demonstrate BGRL suppressed the synchronous electrical pulses across three signaling regimes namely regular, chaotic and bursting by 40%, 146% and 40% respectively as compared to soft actor–critic model. BGRL shows promise in effectively suppressing neural synchrony in DBS therapy, providing an efficient alternative to open-loop methodologies. • Stabilize the synchronization and convergence points allowing for more accurate estimation. • Avoid overestimated bias to generalize the model well. • Adds noise to the target action to exploit the Q-function errors by smoothing out Q-values. • Ran simulations which capture the characteristics of closed loop DBS. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. Value function assessment to different RL algorithms for heparin treatment policy of patients with sepsis in ICU.

Author: Liu, Jiang, Xie, Yihao, Shu, Xin, Chen, Yuwen, Sun, Yizhu, Zhong, Kunhua, Liang, Hao, Li, Yujie, Yang, Chunyong, Han, Yan, Zou, Yuwei, Zhuyi, Ziting, Huang, Jiahao, Li, Junhong, Hu, Xiaoyan, and Yi, Bin
Abstract: Heparin is a critical aspect of managing sepsis after abdominal surgery, which can improve microcirculation, protect organ function, and reduce mortality. However, there is no clinical evidence to support decision-making for heparin dosage. This paper proposes a model called SOFA-MDP, which utilizes SOFA scores as states of MDP, to investigate clinic policies. Different algorithms provide different value functions, making it challenging to determine which value function is more reliable. Due to ethical restrictions, we cannot test all policies on patients. To address this issue, we proposed two value function assessment methods: action similarity rate and relative gain. We experimented with heparin treatment policies for sepsis patients after abdominal surgery using MIMIC-IV. In the experiments, TD (0) shows the most reliable performance. Using the action similarity rate and relative gain to assess AI policy from TD (0) , the agreement rates between AI policy and "good" physician's actual treatment are 64.6% and 73.2%, while the agreement rates between AI policy and "bad" physician's actual treatment are 44.1% and 35.8%, the gaps are 20.5% and 37.4%, respectively. External validation using action similarity rate and relative gain based on eICU resulted in agreement rates of 61.5% and 69.1% with the "good" physician's treatment, and 45.2% and 38.3% with the "bad" physician's treatment, with gaps of 16.3% and 30.8%, respectively. In conclusion, the model provides instructive support for clinical decisions, and the evaluation methods accurately distinguish reliable and unreasonable outcomes. • This paper proposes a model called SOFA-MDP , which utilizes SOFA scores as states and the difference of SOFA scores of successive states as reward function, to investigate heparin treatment policy of patients with sepsis in ICU. • Moreover, we present two novel assessment methods: action similarity rate and relative gain , to estimate the clinic policies. The experimental results show that our SOFA-MDP model offers instructive support for clinical decisions of heparin treatment and our assessment methods can distinguish between reliable and unreasonable outcomes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Depression detection for twitter users using sentiment analysis in English and Arabic tweets.

Author: Helmy, AbdelMoniem, Nassar, Radwa, and Ramdan, Nagy
Abstract: Since depression often results in suicidal thoughts and leaves a person severely disabled daily, there is an elevated risk of premature mortality due to mental problems caused by depression. Therefore, it's crucial to identify the patient's mental illness as soon as possible. People are increasingly using social media platforms to express their opinions and share daily activities, which makes online platforms rich sources of early depression detection. The contribution of this paper is multifold. First , it presents five machine-learning models for Arabic and English depression detection using Twitter text. The best model for Arabic text achieved an f1-score of 96.6 % for binary classification to depressed and Non_dep. For English text without negation, the model achieved 92 % for binary classification and 88 % for multi-classification (depressed, indifferent, happy). For English text with negation, an 87 %, and 85 % f1 score was achieved for binary and multi-classification respectively. Second , the work introduced a manually annotated Arabic_Dep_tweets_10,000 corpus of 10.000 Arabic tweets, which covered neutral tweets as well as a variety of depressed and happy terms. In addition, two automatically annotated English corpora, Eng_without_negation_60.000 corpus of 60,172 English tweets and Eng_with_negation_57.000 corpus of 57,392 English tweets. Both covered a wide range of depressed and cheerful terms; however, Negation was included in the Eng_with_negation_57.000 corpus. Finally , this paper exposes a depression-detection web application which implements our optimal models to detect tweets that contain depression symptoms and predict depression trends for a person either using English or Arabic language. • More than two-thirds of suicides each year are caused by depression, which is the most common mental illness. • Social media posts can be a useful tool for tracking a variety of mental health conditions, including depression. • The lack of adequate Arabic corpora makes it difficult to use machine learning techniques in depression detection for the Arabic people. • Existing models for English language use binary classification only and does not handle negation in the text. • Twittpy is a web application developed to detect depression symptoms for persons using their text from Arabic or English tweets. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. Machine intelligence and medical cyber-physical system architectures for smart healthcare: Taxonomy, challenges, opportunities, and possible solutions.

Author: Shaikh, Tawseef Ayoub, Rasool, Tabasum, and Verma, Prabal
Abstract: Hospitals use medical cyber-physical systems (MCPS) more often to give patients quality continuous care. MCPS isa life-critical, context-aware, networked system of medical equipment. It has been challenging to achieve high assurance in system software, interoperability, context-aware intelligence, autonomy, security and privacy, and device certifiability due to the necessity to create complicated MCPS that are safe and efficient. The MCPS system is shown in the paper as a newly developed application case study of artificial intelligence in healthcare. Applications for various CPS-based healthcare systems are discussed, such as telehealthcare systems for managing chronic diseases (cardiovascular diseases, epilepsy, hearing loss, and respiratory diseases), supporting medication intake management, and tele-homecare systems. The goal of this study is to provide a thorough overview of the essential components of the MCPS from several angles, including design, methodology, and important enabling technologies, including sensor networks, the Internet of Things (IoT), cloud computing, and multi-agent systems. Additionally, some significant applications are investigated, such as smart cities, which are regarded as one of the key applications that will offer new services for industrial systems, transportation networks, energy distribution, monitoring of environmental changes, business and commerce applications, emergency response, and other social and recreational activities.The four levels of an MCPS's general architecture—data collecting, data aggregation, cloud processing, and action—are shown in this study. Different encryption techniques must be employed to ensure data privacy inside each layer due to the variations in hardware and communication capabilities of each layer. We compare established and new encryption techniques based on how well they support safe data exchange, secure computing, and secure storage. Our thorough experimental study of each method reveals that, although enabling innovative new features like secure sharing and safe computing, developing encryption approaches significantly increases computational and storage overhead. To increase the usability of newly developed encryption schemes in an MCPS and to provide a comprehensive list of tools and databases to assist other researchers, we provide a list of opportunities and challenges for incorporating machine intelligence-based MCPS in healthcare applications in our paper's conclusion. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

15. MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering.

Author: Alonso, Iñigo, Oronoz, Maite, and Agerri, Rodrigo
Abstract: Large Language Models (LLMs) have the potential of facilitating the development of Artificial Intelligence technology to assist medical experts for interactive decision support. This potential has been illustrated by the state-of-the-art performance obtained by LLMs in Medical Question Answering, with striking results such as passing marks in licensing medical exams. However, while impressive, the required quality bar for medical applications remains far from being achieved. Currently, LLMs remain challenged by outdated knowledge and by their tendency to generate hallucinated content. Furthermore, most benchmarks to assess medical knowledge lack reference gold explanations which means that it is not possible to evaluate the reasoning of LLMs predictions. Finally, the situation is particularly grim if we consider benchmarking LLMs for languages other than English which remains, as far as we know, a totally neglected topic. In order to address these shortcomings, in this paper we present MedExpQA, the first multilingual benchmark based on medical exams to evaluate LLMs in Medical Question Answering. To the best of our knowledge, MedExpQA includes for the first time reference gold explanations, written by medical doctors, of the correct and incorrect options in the exams. Comprehensive multilingual experimentation using both the gold reference explanations and Retrieval Augmented Generation (RAG) approaches show that performance of LLMs, with best results around 75 accuracy for English, still has large room for improvement, especially for languages other than English, for which accuracy drops 10 points. Therefore, despite using state-of-the-art RAG methods, our results also demonstrate the difficulty of obtaining and integrating readily available medical knowledge that may positively impact results on downstream evaluations for Medical Question Answering. Data, code, and fine-tuned models will be made publicly available. 1 1 https://huggingface.co/datasets/HiTZ/MedExpQA. • MedExpQA: the first multilingual benchmark for MedicalQA including gold reference explanations. • Comparison of gold and automatically extracted medical knowledge via RAG techniques. • Fine-tuning makes redundant the external knowledge obtained via RAG. • Overall performance of LLMs with or without RAG still has large room for improvement. • Performance for French, Italian and Spanish lower for every LLM in every setting. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. CHNet: A multi-task global–local Collaborative Hybrid Network for KRAS mutation status prediction in colorectal cancer.

Author: Cai, Meiling, Zhao, Lin, Qiang, Yan, Wang, Long, and Zhao, Juanjuan
Abstract: Accurate prediction of Kirsten rat sarcoma (KRAS) mutation status is crucial for personalized treatment of advanced colorectal cancer patients. However, despite the excellent performance of deep learning models in certain aspects, they often overlook the synergistic promotion among multiple tasks and the consideration of both global and local information, which can significantly reduce prediction accuracy. To address these issues, this paper proposes an innovative method called the Multi-task Global–Local Collaborative Hybrid Network (CHNet) aimed at more accurately predicting patients' KRAS mutation status. CHNet consists of two branches that can extract global and local features from segmentation and classification tasks, respectively, and exchange complementary information to collaborate in executing these tasks. Within the two branches, we have designed a Channel-wise Hybrid Transformer (CHT) and a Spatial-wise Hybrid Transformer (SHT). These transformers integrate the advantages of both Transformer and CNN, employing cascaded hybrid attention and convolution to capture global and local information from the two tasks. Additionally, we have created an Adaptive Collaborative Attention (ACA) module to facilitate the collaborative fusion of segmentation and classification features through guidance. Furthermore, we introduce a novel Class Activation Map (CAM) loss to encourage CHNet to learn complementary information between the two tasks. We evaluate CHNet on the T2-weighted MRI dataset, and achieve an accuracy of 88.93% in KRAS mutation status prediction, which outperforms the performance of representative KRAS mutation status prediction methods. The results suggest that our CHNet can more accurately predict KRAS mutation status in patients via a multi-task collaborative facilitation and considering global–local information way, which can assist doctors in formulating more personalized treatment strategies for patients. • Two tasks collaborate to predict KRAS mutation status in colorectal cancer. • New Hybrid Transformers capture global and local information. • Fusion Mechanisms for the guidance of advanced semantic segmentation features. • A novel Class Activation Map loss based on modeling the correlation between global features. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. A systematic literature review on the significance of deep learning and machine learning in predicting Alzheimer's disease.

Author: Kaur, Arshdeep, Mittal, Meenakshi, Bhatti, Jasvinder Singh, Thareja, Suresh, and Singh, Satwinder
Abstract: Alzheimer's disease (AD) is the most prevalent cause of dementia, characterized by a steady decline in mental, behavioral, and social abilities and impairs a person's capacity for independent functioning. It is a fatal neurodegenerative disease primarily affecting older adults. The purpose of this literature review is to investigate various AD detection techniques, datasets, input modalities, algorithms, libraries, and performance evaluation metrics used to determine which model or strategy may provide superior performance. The initial search yielded 807 papers, but only 100 research articles were chosen after applying the inclusion-exclusion criteria. This SLR analyzed research items published between January 2019 and December 2022. The ACM, Elsevier, IEEE Xplore Digital Library, PubMed, Springer and Taylor & Francis were systematically searched. The current study considers articles that used Magnetic Resonance Imaging (MRI), Positron Emission Tomography (PET), APOe4 genotype, Diffusion Tensor Imaging (DTI) and Cerebrospinal Fluid (CSF) biomarkers. The study was performed following Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. According to the literature survey, most studies (n = 76) used the DL strategy. The datasets used by studies were primarily derived from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. The majority of studies (n = 73) used single-modality neuroimaging data, while the remaining used multi-modal input data. In a multi-modality approach, the combination of MRI and PET scans is commonly preferred. Also, Regarding the algorithm used, Convolution Neural Network (CNN) showed the highest accuracy, 100 %, in classifying AD vs. CN subjects whereas the SVM was the most common ML algorithm, with a maximum accuracy of 99.82 %. • The review of 100 studies were selected for Classification of AD, CN and MCI with ML and DL Algo. • Studies on Machine Learning, Deep Learning for Neuroimaging (MRI, PET), Apoe4 genetic, and CSF biomarkers for disease detection are compared. • MRI and PET are the most widely used biomarkers. The ADNI database is the most prominent dataset for detecting AD • Study results showed that CNN achieved 100% accuracy for ADD. Other evaluation parameters discussed include Precision, Sensitivity, Specificity, F1 score, and AUC. • This review is valuable for researchers working on AI and medical applications with ML/DL-based AD detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. EHR coding with hybrid attention and features propagation on disease knowledge graph.

Author: Xu, Tianhan, Li, Bin, Chen, Ling, Yang, Chao, Gu, Yixun, and Gu, Xiang
Abstract: And sentences associated with these attributes and relationships have been neglected. in this paper ►We propose an end-to-end model called K nowledge G raph E nhanced neural net work (KGENet) to address the above shortcomings. specifically ►We first construct a disease knowledge graph that focuses on the multi-view disease attributes of ICD codes and the disease relationships between these codes. we also use a long sequence encoder to get EHR document representation. most importantly ►KGENet leverages multi-view disease attributes and structured disease relationships for knowledge enhancement through hybrid attention and graph propagation ►Respectively. furthermore ►The above processes can provide attribute-aware and relationship-augmented explainability for the model prediction results based on our disease knowledge graph. experiments conducted on the MIMIC-III benchmark dataset show that KGENet outperforms state-of-the-art models in both model effectiveness and explainability Electronic health record (EHR) coding assigns International Classification of Diseases (ICD) codes to each EHR document. These standard medical codes represent diagnoses or procedures and play a critical role in medical applications. However, EHR is a long medical text that is difficult to represent, the ICD code label space is large, and the labels have an extremely unbalanced distribution. These factors pose challenges to automatic EHR coding. Previous studies have not explored the disease attributes (e.g., symptoms, tests, medications) of ICD codes and the disease relationships (e.g., causes, risk factors, comorbidities) between them. In addition, the important roles of medical • Disease Knowledge Graph with multi-view attributes and relationships. • Hybrid Attention Module for label-specific feature extraction. • Graph Propagation Module for interactive feature extraction. • Attribute-aware and relationship-augmented explainability. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. A joint entity Relation Extraction method for document level Traditional Chinese Medicine texts.

Author: Xu, Wenxuan, Wang, Lin, Zhang, Mingchuan, Zhu, Junlong, Yan, Junqiang, and Wu, Qingtao
Abstract: Chinese medicine is a unique and complex medical system with complete and rich scientific theories. The textual data of Traditional Chinese Medicine (TCM) contains a large amount of relevant knowledge in the field of TCM, which can serve as guidance for accurate disease diagnosis as well as efficient disease prevention and treatment. Existing TCM texts are disorganized and lack a uniform standard. For this reason, this paper proposes a joint extraction framework by using graph convolutional networks to extract joint entity relations on document-level TCM texts to achieve TCM entity relation mining. More specifically, we first finetune the pre-trained language model by using the TCM domain knowledge to obtain the task-specific model. Taking the integrity of TCM into account, we extract the complete entities as well as the relations corresponding to diagnosis and treatment from the document-level medical cases by using multiple features such as word fusion coding, TCM lexicon information, and multi-relational graph convolutional networks. The experimental results show that the proposed method outperforms the state-of-the-art methods. It has an F1-score of 90.7% for Name Entity Recognization and 76.14% for Relation Extraction on the TCM dataset, which significantly improves the ability to extract entity relations from TCM texts. Code is available at https://github.com/xxxxwx/TCMERE. • The proposed method could alleviate entity nesting and relation overlapping. • A large amount data is collected to obtain model in TCM domain. • The multi-type TCM dictionaries are built. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. Transformers and large language models in healthcare: A review.

Author: Nerella, Subhash, Bandyopadhyay, Sabyasachi, Zhang, Jiaqing, Contreras, Miguel, Siegel, Scott, Bumin, Aysegul, Silva, Brandon, Sena, Jessica, Shickel, Benjamin, Bihorac, Azra, Khezeli, Kia, and Rashidi, Parisa
Abstract: With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, including healthcare. In this survey paper, we provide an overview of how this architecture has been adopted to analyze various forms of healthcare data, including clinical NLP, medical imaging, structured Electronic Health Records (EHR), social media, bio-physiological signals, biomolecular sequences. Furthermore, which have also include the articles that used the transformer architecture for generating surgical instructions and predicting adverse outcomes after surgeries under the umbrella of critical care. Under diverse settings, these models have been used for clinical diagnosis, report generation, data reconstruction, and drug/protein synthesis. Finally, we also discuss the benefits and limitations of using transformers in healthcare and examine issues such as computational cost, model interpretability, fairness, alignment with human values, ethical implications, and environmental impact. • Transformers in clinical NLP, Electronic Health Records, and social media data • Transformers in medical imaging (image segmentation, registration, captioning, synthesis) • Transformers for analyzing bio-signals (human activity, EEG, ECG) and biomolecular sequence. • Detailed explanation of basic Transformer Architecture and some popular variants • Discussion on computational costs and the necessity of AI alignment [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

21. Consensus modeling: Safer transfer learning for small health systems.

Author: Tourani, Roshan, Murphree, Dennis H., Sheka, Adam, Melton, Genevieve B., Kor, Daryl J., and Simon, Gyorgy J.
Abstract: Predictive modeling is becoming an essential tool for clinical decision support, but health systems with smaller sample sizes may construct suboptimal or overly specific models. Models become over-specific when beside true physiological effects, they also incorporate potentially volatile site-specific artifacts. These artifacts can change suddenly and can render the model unsafe. To obtain safer models, health systems with inadequate sample sizes may adopt one of the following options. First, they can use a generic model, such as one purchased from a vendor, but often such a model is not sufficiently specific to the patient population and is thus suboptimal. Second, they can participate in a research network. Paradoxically though, sites with smaller datasets contribute correspondingly less to the joint model, again rendering the final model suboptimal. Lastly, they can use transfer learning, starting from a model trained on a large data set and updating this model to the local population. This strategy can also result in a model that is over-specific. In this paper we present the consensus modeling paradigm, which uses the help of a large site (source) to reach a consensus model at the small site (target). We evaluate the approach on predicting postoperative complications at two health systems with 9,044 and 38,045 patients (rare outcomes at about 1% positive rate), and conduct a simulation study to understand the performance of consensus modeling relative to the other three approaches as a function of the available training sample size at the target site. We found that consensus modeling exhibited the least over-specificity at either the source or target site and achieved the highest combined predictive performance. • Health systems with inadequate data may build suboptimal or overly specific models. • Over-specialized models are not robust to institutional changes and can cause harm. • Consensus modeling reduces over-specialization for small health systems. • Consensus modeling improves equity in AI capability and can reduce health disparity. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier.

Author: Lin, Jianying, Chen, Hui, Li, Shan, Liu, Yushuang, Li, Xuan, and Yu, Bin
Subjects: *AMINO acid sequence, *RANDOM forest algorithms, *DATA mining, *PROTEINS, *TARGETED drug delivery, *DRUG development
Abstract: Discovering and accurately locating drug targets is of great significance for the research and development of new drugs. As a different approach to traditional drug development, the machine learning algorithm is used to predict the drug target by mining the data. Because of its advantages of short time and low cost, it has received more and more attention in recent years. In this paper, we propose a novel method for predicting druggable proteins. Firstly, the features of the protein sequence are extracted by combining Chou's pseudo amino acid composition (PseAAC), dipeptide composition (DPC) and reduced sequence (RS), getting the 591 dimension of drug target dataset. Then, the feature information of druggable proteins dataset is selected by genetic algorithm (GA). Finally, we use Bagging ensemble learning to improve SVM classifier to get the final prediction model. The predictive accuracy rate reaches 93.78% by using 5-fold cross-validation and compared with other state-of-the-art predictive methods. The results indicate that the method proposed in this paper has a high reference value for the prediction of potential drug targets, which will successfully play a key role in the drug research and development. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/GA-Bagging-SVM. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

23. Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.

Author: Woldaregay, Ashenafi Zebene, Årsand, Eirik, Walderhaug, Ståle, Albers, David, Mamykina, Lena, Botsis, Taxiarchis, and Hartvigsen, Gunnar
Subjects: *TYPE 1 diabetes, *BLOOD sugar, *MACHINE learning, *MACHINE dynamics, *DECISION support systems, *RECURRENT neural networks, *GLYCEMIC index
Abstract: Background: Diabetes mellitus (DM) is a metabolic disorder that causes abnormal blood glucose (BG) regulation that might result in short and long-term health complications and even death if not properly managed. Currently, there is no cure for diabetes. However, self-management of the disease, especially keeping BG in the recommended range, is central to the treatment. This includes actively tracking BG levels and managing physical activity, diet, and insulin intake. The recent advancements in diabetes technologies and self-management applications have made it easier for patients to have more access to relevant data. In this regard, the development of an artificial pancreas (a closed-loop system), personalized decision systems, and BG event alarms are becoming more apparent than ever. Techniques such as predicting BG (modeling of a personalized profile), and modeling BG dynamics are central to the development of these diabetes management technologies. The increased availability of sufficient patient historical data has paved the way for the introduction of machine learning and its application for intelligent and improved systems for diabetes management. The capability of machine learning to solve complex tasks with dynamic environment and knowledge has contributed to its success in diabetes research.Motivation: Recently, machine learning and data mining have become popular, with their expanding application in diabetes research and within BG prediction services in particular. Despite the increasing and expanding popularity of machine learning applications in BG prediction services, updated reviews that map and materialize the current trends in modeling options and strategies are lacking within the context of BG prediction (modeling of personalized profile) in type 1 diabetes.Objective: The objective of this review is to develop a compact guide regarding modeling options and strategies of machine learning and a hybrid system focusing on the prediction of BG dynamics in type 1 diabetes. The review covers machine learning approaches pertinent to the controller of an artificial pancreas (closed-loop systems), modeling of personalized profiles, personalized decision support systems, and BG alarm event applications. Generally, the review will identify, assess, analyze, and discuss the current trends of machine learning applications within these contexts.Method: A rigorous literature review was conducted between August 2017 and February 2018 through various online databases, including Google Scholar, PubMed, ScienceDirect, and others. Additionally, peer-reviewed journals and articles were considered. Relevant studies were first identified by reviewing the title, keywords, and abstracts as preliminary filters with our selection criteria, and then we reviewed the full texts of the articles that were found relevant. Information from the selected literature was extracted based on predefined categories, which were based on previous research and further elaborated through brainstorming among the authors.Results: The initial search was done by analyzing the title, abstract, and keywords. A total of 624 papers were retrieved from DBLP Computer Science (25), Diabetes Technology and Therapeutics (31), Google Scholar (193), IEEE (267), Journal of Diabetes Science and Technology (31), PubMed/Medline (27), and ScienceDirect (50). After removing duplicates from the list, 417 records remained. Then, we independently assessed and screened the articles based on the inclusion and exclusion criteria, which eliminated another 204 papers, leaving 213 relevant papers. After a full-text assessment, 55 articles were left, which were critically analyzed. The inter-rater agreement was measured using a Cohen Kappa test, and disagreements were resolved through discussion.Conclusion: Due to the complexity of BG dynamics, it remains difficult to achieve a universal model that produces an accurate prediction in every circumstance (i.e., hypo/eu/hyperglycemia events). Recently, machine learning techniques have received wider attention and increased popularity in diabetes research in general and BG prediction in particular, coupled with the ever-growing availability of a self-collected health data. The state-of-the-art demonstrates that various machine learning techniques have been tested to predict BG, such as recurrent neural networks, feed-forward neural networks, support vector machines, self-organizing maps, the Gaussian process, genetic algorithm and programs, deep neural networks, and others, using various group of input parameters and training algorithms. The main limitation of the current approaches is the lack of a well-defined approach to estimate carbohydrate intake, which is mainly done manually by individual users and is prone to an error that can severely affect the predictive performance. Moreover, a universal approach has not been established to estimate and quantify the approximate effect of physical activities, stress, and infections on the BG level. No researchers have assessed model predictive performance during stress and infection incidences in a free-living condition, which should be considered in future studies. Furthermore, a little has been done regarding model portability that can capture inter- and intra-variability among patients. It seems that the effect of time lags between the CGM readings and the actual BG levels is not well covered. However, in general, we foresee that these developments might foster the advancement of next-generation BG prediction algorithms, which will make a great contribution in the effort to develop the long-awaited, so-called artificial pancreas (a closed-loop system). [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

24. Project INSIDE: towards autonomous semi-unstructured human-robot social interaction in autism therapy.

Author: Melo, Francisco S., Sardinha, Alberto, Belo, David, Couto, Marta, Faria, Miguel, Farias, Anabela, Gambôa, Hugo, Jesus, Cátia, Kinarullathil, Mithun, Lima, Pedro, Luz, Luís, Mateus, André, Melo, Isabel, Moreno, Plinio, Osório, Daniel, Paiva, Ana, Pimentel, Jhielson, Rodrigues, João, Sequeira, Pedro, and Solera-Ureña, Rubén
Subjects: *HUMAN-robot interaction, *AUTONOMOUS robots, *SOCIAL interaction, *CHILDREN with autism spectrum disorders, *MOBILE robots, *ROBOT design & construction, *RESEARCH, *RESEARCH methodology, *EVALUATION research, *MEDICAL cooperation, *ROBOTICS, *COMPARATIVE studies, *INTERPERSONAL relations
Abstract: This paper describes the INSIDE system, a networked robot system designed to allow the use of mobile robots as active players in the therapy of children with autism spectrum disorders (ASD). While a significant volume of work has explored the impact of robots in ASD therapy, most such work comprises remotely operated robots and/or well-structured interaction dynamics. In contrast, the INSIDE system allows for complex, semi-unstructured interaction in ASD therapy while featuring a fully autonomous robot. In this paper we describe the hardware and software infrastructure that supports such rich form of interaction, as well as the design methodology that guided the development of the INSIDE system. We also present some results on the use of our system both in pilot and in a long-term study comprising multiple therapy sessions with children at Hospital Garcia de Orta, in Portugal, highlighting the robustness and autonomy of the system as a whole. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

25. A comparison between discrete and continuous time Bayesian networks in learning from clinical time series data with irregularity.

Author: Liu, Manxia, Stella, Fabio, Hommersom, Arjen, Lucas, Peter J.F., Boer, Lonneke, and Bischoff, Erik
Subjects: *MISSING data (Statistics), *CONTINUOUS time models, *OBSTRUCTIVE lung diseases, *DATA distribution, *TIME series analysis, *TIME-varying networks, *FIXED interest rates
Abstract: Background: Recently, mobile devices, such as smartphones, have been introduced into healthcare research to substitute paper diaries as data-collection tools in the home environment. Such devices support collecting patient data at different time points over a long period, resulting in clinical time-series data with high temporal complexity, such as time irregularities. Analysis of such time series poses new challenges for machine-learning techniques. The clinical context for the research discussed in this paper is home monitoring in chronic obstructive pulmonary disease (COPD).Objective: The goal of the present research is to find out which properties of temporal Bayesian network models allow to cope best with irregularly spaced multivariate clinical time-series data.Methods: Two mainstream temporal Bayesian network models of multivariate clinical time series are studied: dynamic Bayesian networks, where the system is described as a snapshot at discrete time points, and continuous time Bayesian networks, where transitions between states are modeled in continuous time. Their capability of learning from clinical time series that vary in nature are extensively studied. In order to compare the two temporal Bayesian network types for regularly and irregularly spaced time-series data, three typical ways of observing time-series data were investigated: (1) regularly spaced in time with a fixed rate; (2) irregularly spaced and missing completely at random at discrete time points; (3) irregularly spaced and missing at random at discrete time points. In addition, similar experiments were carried out using real-world COPD patient data where observations are unevenly spaced.Results: For regularly spaced time series, the dynamic Bayesian network models outperform the continuous time Bayesian networks. Similarly, if the data is missing completely at random, discrete-time models outperform continuous time models in most situations. For more realistic settings where data is not missing completely at random, the situation is more complicated. In simulation experiments, both models perform similarly if there is strong prior knowledge available about the missing data distribution. Otherwise, continuous time Bayesian networks perform better. In experiments with unevenly spaced real-world data, we surprisingly found that a dynamic Bayesian network where time is ignored performs similar to a continuous time Bayesian network.Conclusion: The results confirm conventional wisdom that discrete-time Bayesian networks are appropriate when learning from regularly spaced clinical time series. Similarly, we found that time series where the missingness occurs completely at random, dynamic Bayesian networks are an appropriate choice. However, for complex clinical time-series data that motivated this research, the continuous-time models are at least competitive and sometimes better than their discrete-time counterparts. Furthermore, continuous-time models provide additional benefits of being able to provide more fine-grained predictions than discrete-time models, which will be of practical relevance in clinical applications. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

26. Detection of abnormal behaviour for dementia sufferers using Convolutional Neural Networks.

Author: Arifoglu, Damla and Bouchachia, Abdelhamid
Subjects: *CONVOLUTIONAL neural networks, *NEUROFIBRILLARY tangles, *ARTIFICIAL neural networks
Abstract: In recent years, there is a rapid increase in the population of elderly people. However, elderly people may suffer from the consequences of cognitive decline, which is a mental health disorder that primarily affects cognitive abilities such as learning, memory, etc. As a result, the elderly people may get dependent on caregivers to complete daily life tasks. Detecting the early indicators of dementia before it gets worsen and warning the caregivers and medical doctors would be helpful for further diagnosis. In this paper, the problem of activity recognition and abnormal behaviour detection is investigated for elderly people with dementia. First of all, the paper presents a methodology for generating synthetic data reflecting on some behavioural difficulties of people with dementia given the difficulty of obtaining real-world data. Secondly, the paper explores Convolutional Neural Networks (CNNs) to model patterns in activity sequences and detect abnormal behaviour related to dementia. Activity recognition is considered as a sequence labelling problem, while abnormal behaviour is flagged based on the deviation from normal patterns. Moreover, the performance of CNNs is compared against the state-of-art methods such as Naïve Bayes (NB), Hidden Markov Models (HMMs), Hidden Semi-Markov Models (HSMM), Conditional Random Fields (CRFs). The results obtained indicate that CNNs are competitive with those state-of-art methods. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

27. A systematic literature review of machine learning based risk prediction models for diabetic retinopathy progression.

Author: Usman, Tiwalade Modupe, Saheed, Yakub Kayode, Nsang, Augustine, Ajibesin, Abel, and Rakshit, Sandip
Subjects: *LITERATURE reviews, *MACHINE learning, *PREDICTION models, *DIABETIC retinopathy, *ARTIFICIAL intelligence, *EVIDENCE gaps
Abstract: Diabetic Retinopathy (DR) is the most popular debilitating impairment of diabetes and it progresses symptom-free until a sudden loss of vision occurs. Understanding the progression of DR is a pressing issue in clinical research and practice. In this systematic review of articles on Machine Learning (ML) based risk prediction models for DR progression, ever since the use of Artificial Intelligence (AI) for DR detection, there have been more cross-sectional studies with different algorithms of use of AI, there haven't been many longitudinal studies for the AI based risk prediction models. This paper proposes a novel review to fill in the gaps identified in current reviews and facilitate other researchers with current research solutions for developing AI-based risk prediction models for DR progression and closely related problems; synthesize the current results from these studies and identify research challenges, limitations and gaps to inform the selection of machine learning techniques and predictors to build novel prediction models. Additionally, this paper suggested six (6) deep AI-related technical and critical discussion of the adopted strategies and approaches. The Systematic Literature Review (SLR) methodology was employed to gather relevant studies. We searched IEEE Xplore, PubMed, Springer Link, Google Scholar, and Science Direct electronic databases for papers published from January 2017 to 30th April 2023. Thirteen (13) studies were chosen on the basis of their relevance to the review questions and satisfying the selection criteria. However, findings from the literature review exposed some critical research gaps that need to be addressed in future research to improve on the performance of risk prediction models for DR progression. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

28. CEHMR: Curriculum learning enhanced hierarchical multi-label classification for medication recommendation.

Author: Sun, Mengxuan, Niu, Jinghao, Yang, Xuebing, Gu, Yifan, and Zhang, Wensheng
Subjects: *PRESCRIPTION writing, *ELECTRONIC health records, *CURRICULUM frameworks, *DRUG interactions, *MEDICAL databases, *DEEP learning
Abstract: The medication recommendation (MR) or medication combination prediction task aims to predict effective prescriptions given accurate patient representations derived from electronic health records (EHRs), which contributes to improving the quality of clinical decision-making, especially for patients with multi-morbidity. Although in recent years deep learning technology has achieved great success in MR, the performance of current multi-label based MR solutions is unsatisfactory. They mainly focus on improving the patient representation module and modeling the medication label dependencies such as drug–drug interaction (DDI) correlation and co-occurrence relationship. However, the hierarchical dependency among medication labels and diversity of difficulty among MR training examples lack sufficient consideration. In this paper, we propose a framework of Curriculum learning Enhanced Hierarchical multi-label classification for MR (CEHMR). Motivated by the category hierarchy of medications which organizes standard medication codes in a hierarchical structure, we utilize it to provide more trustworthy prior knowledge for modeling label dependency. Specifically, we design a hierarchical multi-label classifier with a learnable gate fusion layer, to simultaneously capture the level-independent (local) and level-dependent (global) hierarchical information in the medication hierarchy. In addition, to overcome the diversity of training example difficulties, and progressively achieve a smoother training process, we introduce a bootstrap-based curriculum learning strategy. Hence, the example difficulty can be measured based on the predictive performance of the MR model, and then all training examples would be retrained from easy to hard under the guidance of a predefined training scheduler. Experiments on the real-world medical MIMIC-III database demonstrate that the proposed framework can achieve state-of-the-art performance compared with seven representative baselines, and extensive ablation studies validate the effectiveness of each component of CEHMR. [Display omitted] • This paper formulates the medication recommendation task as a hierarchical multi-label classification problem. • The proposed CEHMR models the medication dependency by enabling classifiers to learn the prior hierarchy. • CEHMR measures the difficulty of each training example and progressively achieves a smoother training process. • The experimental testing performed on the MIMIC-III data set confirms the advantages of CEHMR. • Extensive experiments demonstrate its potential in dealing with the diversity of training example difficulties. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

29. Monkeypox diagnosis using ensemble classification.

Author: Rabie, Asmaa H. and Saleh, Ahmed I.
Subjects: *MONKEYPOX, *VIRUS diseases, *K-nearest neighbor classification, *FEATURE selection, *DEEP learning
Abstract: The world has recently been exposed to a fierce attack from many viral diseases, such as Covid-19, that exhausted medical systems around the world. Such attack had a negative impact not only on the health status of people or the high death rate, but also had a bad impact on the economic situation, which affected all countries of the world especially the poor and the developing ones. Monkeypox is one of the latest viral diseases that may cause a pandemic in the near future if not dealt and diagnosed with appropriately. This paper provides a new strategy for diagnosing monkeypox, which is called; Accurate Monkeypox Diagnosing Strategy (AMDS). The proposed AMDS consists of two phases, which are; (i) pre-processing and (ii) classification. During the pre-processing phase, the most effective feature are selected using Binary Tiki-Taka Algorithm (BTTA). On the other hand, in the classification phase, ensemble classification is used for diagnosing new cases, which combines evidence from three different new classifiers, namely; (a) Layered K-Nearest Neighbors (LKNN), (b) Statistical Naïve Bayes (SNB), and (c) Deep Learning Classifier (DLC). Moreover, the decisions of the proposed classifiers are merged in a new voting scheme called Fuzzified Voting Scheme (FVS). AMDS has been compared against recent diagnostic strategies. Experimental results have proven that AMDS outperforms other monkeypox diagnostic strategies as it introduces the most accurate diagnosis according to two different datasets. • This paper provides Accurate Monkeypox Diagnosing Strategy (AMDS). • AMDS consists of pre-processing and classification phases. • In pre-processing phase, binary Tiki-Taka algorithm as feature selection is used. • In the classification phase, ensemble classification is used to diagnose new cases. • Results have proven that AMDS outperforms other monkeypox diagnostic strategies. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

30. FDA-approved machine learning algorithms in neuroradiology: A systematic review of the current evidence for approval.

Author: Yearley, Alexander G., Goedmakers, Caroline M.W., Panahi, Armon, Doucette, Joanne, Rana, Aakanksha, Ranganathan, Kavitha, and Smith, Timothy R.
Subjects: *MACHINE learning, *ARTIFICIAL intelligence, *MEDICAL protocols, *NEURORADIOLOGY, *CENTRAL nervous system
Abstract: Over the past decade, machine learning (ML) and artificial intelligence (AI) have become increasingly prevalent in the medical field. In the United States, the Food and Drug Administration (FDA) is responsible for regulating AI algorithms as "medical devices" to ensure patient safety. However, recent work has shown that the FDA approval process may be deficient. In this study, we evaluate the evidence supporting FDA-approved neuroalgorithms, the subset of machine learning algorithms with applications in the central nervous system (CNS), through a systematic review of the primary literature. Articles covering the 53 FDA-approved algorithms with applications in the CNS published in PubMed, EMBASE, Google Scholar and Scopus between database inception and January 25, 2022 were queried. Initial searches identified 1505 studies, of which 92 articles met the criteria for extraction and inclusion. Studies were identified for 26 of the 53 neuroalgorithms, of which 10 algorithms had only a single peer-reviewed publication. Performance metrics were available for 15 algorithms, external validation studies were available for 24 algorithms, and studies exploring the use of algorithms in clinical practice were available for 7 algorithms. Papers studying the clinical utility of these algorithms focused on three domains: workflow efficiency, cost savings, and clinical outcomes. Our analysis suggests that there is a meaningful gap between the FDA approval of machine learning algorithms and their clinical utilization. There appears to be room for process improvement by implementation of the following recommendations: the provision of compelling evidence that algorithms perform as intended, mandating minimum sample sizes, reporting of a predefined set of performance metrics for all algorithms and clinical application of algorithms prior to widespread use. This work will serve as a baseline for future research into the ideal regulatory framework for AI applications worldwide. • Of the 52 "neuroalgorithms" studied, primary literature could be identified for 26 of them. • Fifteen of the algorithms had primary literature reporting algorithmic performance metrics. • Heterogeneity in study design between papers made definitive assessment of algorithmic performance difficult. • Clinical adoption of algorithms was hampered by the relative lack of patient-centered research testing each algorithm. • Few papers provided a comprehensive overview of all notable limitations for a single algorithm. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

31. A genetic programming-based convolutional deep learning algorithm for identifying COVID-19 cases via X-ray images.

Author: Najaran, Mohammad Hassan Tayarani
Subjects: *MACHINE learning, *COVID-19 pandemic, *X-ray imaging, *DEEP learning, *GENETIC programming, *OPTIMIZATION algorithms, *THRESHOLDING algorithms
Abstract: Evolutionary algorithms have been successfully employed to find the best structure for many learning algorithms including neural networks. Due to their flexibility and promising results, Convolutional Neural Networks (CNNs) have found their application in many image processing applications. The structure of CNNs greatly affects the performance of these algorithms both in terms of accuracy and computational cost, thus, finding the best architecture for these networks is a crucial task before they are employed. In this paper, we develop a genetic programming approach for the optimization of CNN structure in diagnosing COVID-19 cases via X-ray images. A graph representation for CNN architecture is proposed and evolutionary operators including crossover and mutation are specifically designed for the proposed representation. The proposed architecture of CNNs is defined by two sets of parameters, one is the skeleton which determines the arrangement of the convolutional and pooling operators and their connections and one is the numerical parameters of the operators which determine the properties of these operators like filter size and kernel size. The proposed algorithm in this paper optimizes the skeleton and the numerical parameters of the CNN architectures in a co-evolutionary scheme. The proposed algorithm is used to identify covid-19 cases via X-ray images. • A novel optimization algorithm is proposed for image thresholding. • Three different fitness functions are used for optimization. • Experimental analysis are performed. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

32. Probabilistic double hierarchy linguistic Maclaurin symmetric mean-MultiCriteria Border Approximation area Comparison method for multi-criteria group decision making and its application in a selection of traditional Chinese medicine prescriptions.

Author: Xian, Sidong, Qing, Ke, Li, Chaozhen, Luo, Miao, and Liu, Renping
Subjects: *GROUP decision making, *CHINESE medicine, *MULTIPLE criteria decision making, *FEATURE selection, *JUDGMENT (Psychology), *MEDICAL prescriptions
Abstract: Traditional Chinese medicine (TCM) has gradually played an indispensable role in people's health maintenance, especially in the treatment of chronic diseases. However, there is always uncertainty and hesitation in the judgment and understanding of diseases by doctors, which affects the status recognition and optimal diagnosis and treatment decision-making of patients. In order to overcome the above problems, we lead into probabilistic double hierarchy linguistic term set (PDHLTS) to accurately describe language information in traditional Chinese medicine and make decisions. In this paper, a multi-criteria group decision making (MCGDM) model is constructed based on the MSM-MCBAC (Maclaurin symmetric mean-MultiCriteria Border Approximation area Comparison) method in the PDHL environment. Firstly, a PDHL weighted Maclaurin symmetric mean (PDHLWMSM) operator is proposed to aggregate the evaluation matrices of multiple experts. Then, combined with the BWM and maximizing deviation method, a comprehensive weight determination method is put forward to calculate the weights of criteria. Furthermore, we propose PDHL MSM-MCBAC method based on the Multi-Attributive Border Approximation area Comparison (MABAC) method and the PDHLWMSM operator. Finally, an example of a selection of TCM prescriptions is used and some comparative analyses are made to verify the effectiveness and superiority of this paper. • The PDHLWMSM operator is put forward to aggregate the evaluation matrices given by multiple experts. • A comprehensive weight determination method based on the BWM and the maximizing deviation in the PDHL environment is proposed. • We propose the PDHL MSM-MCBAC method for MCGDM involving the selection of TCM prescriptions. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

33. A multicenter random forest model for effective prognosis prediction in collaborative clinical research network

Author: Li, Jin, Tian, Yu, Zhu, Yan, Zhou, Tianshu, Li, Jun, Ding, Kefeng, and Li, Jingsong
Published: 2020
Full Text: View/download PDF

34. Automatic detection of epileptic seizure based on approximate entropy, recurrence quantiﬁcation analysis and convolutional neural networks

Author: Gao, Xiaozeng, Yan, Xiaoyan, Gao, Ping, Gao, Xiujiang, and Zhang, Shubo
Published: 2020
Full Text: View/download PDF

35. Automated classification of histopathology images using transfer learning

Author: Talo, Muhammed
Published: 2019
Full Text: View/download PDF

36. Study on miR-384-5p activates TGF-β signaling pathway to promote neuronal damage in abutment nucleus of rats based on deep learning

Author: Wang, Zhen, Du, Xiaoyan, Yang, Yang, and Zhang, Guoqing
Published: 2019
Full Text: View/download PDF

37. Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases

Author: Abdel-Basset, Mohamed, Mohamed, Mai, Elhoseny, Mohamed, Son, Le Hoang, Chiclana, Francisco, and Zaied, Abd El-Nasser H.
Published: 2019
Full Text: View/download PDF

38. Methods for algorithmic diagnosis of metabolic syndrome

Author: Vrbaški, Dunja, Vrbaški, Milan, Kupusinac, Aleksandar, Ivanović, Darko, Stokić, Edita, Ivetić, Dragan, and Doroslovački, Ksenija
Published: 2019
Full Text: View/download PDF

39. Early anomaly detection in smart home: A causal association rule-based approach.

Author: Hela, Sfar, Amel, Bouzeghoub, and Badran, Raddaoui
Subjects: *ANOMALY detection (Computer security), *DATA mining, *PATIENT monitoring, *COMPUTED tomography, *CLINICS
Abstract: As the world's population grows older, an increasing number of people are facing health issues. For the elderly, living alone can be difficult and dangerous. Consequently, smart homes are becoming increasingly popular. A sensor-rich environment can be exploited for healthcare applications, in particular, anomaly detection (AD). The literature review for this paper showed that few works consider environmental factors to detect anomalies. Instead, the focus is on user activity and checking whether it is abnormal, i.e., does not conform to expected behavior. Furthermore, reducing the number of anomalies using early detection is a major issue in many applications. In this context, anomaly-cause discovery may be helpful in recommending actions that may prevent risk. In this paper, we present a novel approach for detecting the risk of anomalies occurring in the environment regarding user activities. The method relies on anomaly-cause extraction from a given dataset using causal association rules mining. These anomaly causes are utilized afterward for real-time analysis to detect the risk of anomalies using the Markov logic network machine learning method. The detected risk allows the method to recommend suitable actions to perform in order to avoid the occurrence of an actual anomaly. The proposed approach is implemented, tested, and evaluated for each contribution using real data obtained from an intelligent environment platform and real data from a clinical datasets. Experimental results prove our approach to be efficient in terms of recognition rate. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

40. Automatic classification of radiological reports for clinical care.

Author: Gerevini, Alfonso Emilio, Lavelli, Alberto, Maffi, Alessandro, Maroldi, Roberto, Minard, Anne-Lyse, Serina, Ivan, and Squassina, Guido
Subjects: *AUTOMATION, *RADIOLOGIC technologists, *COMPUTED tomography, *HEURISTIC, *MACHINE learning
Abstract: Radiological reporting generates a large amount of free-text clinical narratives, a potentially valuable source of information for improving clinical care and supporting research. The use of automatic techniques to analyze such reports is necessary to make their content effectively available to radiologists in an aggregated form. In this paper we focus on the classification of chest computed tomography reports according to a classification schema proposed for this task by radiologists of the Italian hospital ASST Spedali Civili di Brescia. The proposed system is built exploiting a training data set containing reports annotated by radiologists. Each report is classified according to the schema developed by radiologists and textual evidences are marked in the report. The annotations are then used to train different machine learning based classifiers. We present in this paper a method based on a cascade of classifiers which make use of a set of syntactic and semantic features. The resulting system is a novel hierarchical classification system for the given task, that we have experimentally evaluated. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

41. Pharmacological therapy selection of type 2 diabetes based on the SWARA and modified MULTIMOORA methods under a fuzzy environment.

Author: Eghbali-Zarch, M., Tavakkoli-Moghaddam, R., Esfahanian, F., Sepehri, M.M., and Azaron, A.
Subjects: *TYPE 2 diabetes, *RATIO analysis, *HYPERGLYCEMIA, *CD26 antigen, *SULFONYLUREAS
Abstract: Medication selection for Type 2 Diabetes (T2D) is a challenging medical decision-making problem involving multiple medications that can be prescribed to control the patient's blood glucose. The wide range of hyperglycemia lowering agents with varying effects and various side effects makes the decision quite difficult. This paper presents computer-aided medical decision support using a fuzzy Multi-Criteria Decision-Making (MCDM) model that hybridizes a Step-wise Weight Assessment Ratio Analysis (SWARA) method with a modification of Fuzzy Multi-Objective Optimization on the basis of a Ratio Analysis plus the full multiplicative form (FMULTIMOORA) method for pharmacological therapy selection of T2D. It makes the use of SWARA for obtaining the relative significance of every selected criterion by soliciting experts' opinions and FMULTIMOORA method for evaluation of each alternative according to all criteria based on a published clinical guideline. In this paper, an extended reference point approach is considered in the proposed hybrid MCDM model that resolves the classic reference point limitations and improves the FMULTIMOORA ranking procedure. Computational results indicate that Metformin is confirmed as the first-line medication and Sulfonylurea as the second-line add-on therapy. The Glucagon-like peptide-1 receptor agonist, Dipeptidyl peptidase-4 inhibitor, and Insulin are placed 3rd, 4th, and 5th, respectively. A sensitivity analysis is conducted to validate the model performance by comparing its result with studies in the literature, other fuzzy MCDM techniques and an interval MULTIMOORA method based on an observational dataset. The close correspondence between the final rankings of anti-diabetic agents resulted from the proposed hybrid model and other methodologies provide significant implications for endocrinologists to refer. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

42. Temporal case-based reasoning for type 1 diabetes mellitus bolus insulin decision support.

Author: Brown, Daniel, Aldea, Arantza, Harrison, Rachel, Martin, Clare, and Bayley, Ian
Subjects: *TYPE 1 diabetes, *GLYCEMIC index, *BOLUS drug administration, *ARTIFICIAL intelligence in medicine, *BLOOD sugar
Abstract: Individuals with type 1 diabetes have to monitor their blood glucose levels, determine the quantity of insulin required to achieve optimal glycaemic control and administer it themselves subcutaneously, multiple times per day. To help with this process bolus calculators have been developed that suggest the appropriate dose. However these calculators do not automatically adapt to the specific circumstances of an individual and require fine-tuning of parameters, a process that often requires the input of an expert. To overcome the limitations of the traditional methods this paper proposes the use of an artificial intelligence technique, case-based reasoning, to personalise the bolus calculation. A novel aspect of our approach is the use of temporal sequences to take into account preceding events when recommending the bolus insulin doses rather than looking at events in isolation. The in silico results described in this paper show that given the initial conditions of the patient, the temporal retrieval algorithm identifies the most suitable case for reuse. Additionally through insulin-on-board adaptation and postprandial revision, the approach is able to learn and improve bolus predictions, reducing the blood glucose risk index by up to 27% after three revisions of a bolus solution. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

43. Activities suggestion based on emotions in AAL environments.

Author: Costa, Angelo, Rincon, Jaime Andres, Carrascosa, Carlos, Novais, Paulo, and Julian, Vicente
Subjects: *SERVICES for older people, *SENIOR housing, *LONG-term care facilities, *MEDICAL care for older people, *MULTIAGENT systems
Abstract: The elderly population is increasing and the response of the society was to provide them with services directed to them to cope with their needs. One of the oldest solutions is the retirement home, providing housing and permanent assistance for the elderly. Furthermore, most of the retirement homes are inhabited by multiple elderly people, thus creating a community of people who are somewhat related in age and medical issues. The ambient assisted living (AAL) area tries to solve some of the elderly issues by producing technological products, some of them dedicated to elderly homes. One of the identified problem is that elderly people are sometimes discontent about the activities that consume most of their day promoted by the retirement home social workers. The work presented in this paper attempts to improve how these activities are scheduled taking into account the elderlies' emotional response to these activities. The aim is to maximize the group happiness by promoting the activities the group likes, minding if they are bored due to activities repetition. In this sense, this paper presents an extension of the Cognitive Life Assistant platform incorporating a social emotional model. The proposed system has been modelled as a free time activity manager which is in charge of suggesting activities to the social workers. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

44. Different approaches for identifying important concepts in probabilistic biomedical text summarization.

Author: Moradi, Milad and Ghadiri, Nasser
Subjects: *MEDICAL textbooks, *DATA mining, *MEDICAL language, *REDUNDANCY (Linguistics), *BAYES' theorem, *ARTIFICIAL intelligence in medicine, *ABSTRACTING & indexing services, *MEDICAL information storage & retrieval systems, *MEDICAL research, *PROBABILITY theory, *READING, *SEMANTICS, *MEDICAL subject headings
Abstract: Automatic text summarization tools help users in the biomedical domain to acquire their intended information from various textual resources more efficiently. Some of biomedical text summarization systems put the basis of their sentence selection approach on the frequency of concepts extracted from the input text. However, it seems that exploring other measures rather than the raw frequency for identifying valuable contents within an input document, or considering correlations existing between concepts, may be more useful for this type of summarization. In this paper, we describe a Bayesian summarization method for biomedical text documents. The Bayesian summarizer initially maps the input text to the Unified Medical Language System (UMLS) concepts; then it selects the important ones to be used as classification features. We introduce six different feature selection approaches to identify the most important concepts of the text and select the most informative contents according to the distribution of these concepts. We show that with the use of an appropriate feature selection approach, the Bayesian summarizer can improve the performance of biomedical summarization. Using the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) toolkit, we perform extensive evaluations on a corpus of scientific papers in the biomedical domain. The results show that when the Bayesian summarizer utilizes the feature selection methods that do not use the raw frequency, it can outperform the biomedical summarizers that rely on the frequency of concepts, domain-independent and baseline methods. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

45. Agent-based approaches for biological modeling in oncology: A literature review.

Author: Stephan, Simon, Galland, Stéphane, Labbani Narsis, Ouassila, Shoji, Kenji, Vachenc, Sébastien, Gerart, Stéphane, and Nicolle, Christophe
Abstract: Computational modeling involves the use of computer simulations and models to study and understand real-world phenomena. Its application is particularly relevant in the study of potential interactions between biological elements. It is a promising approach to understand complex biological processes and predict their behavior under various conditions. This paper is a review of the recent literature on computational modeling of biological systems. Our study focuses on the field of oncology and the use of artificial intelligence (AI) and, in particular, agent-based modeling (ABM), between 2010 and May 2023. Most of the articles studied focus on improving the diagnosis and understanding the behaviors of biological entities, with metaheuristic algorithms being the models most used. Several challenges are highlighted regarding increasing and structuring knowledge about biological systems, developing holistic models that capture multiple scales and levels of organization, reproducing emergent behaviors of biological systems, validating models with experimental data, improving computational performance of models and algorithms, and ensuring privacy and personal data protection are discussed. • Use of multi-agent systems to study biological interactions in oncology. • Multi-agent systems enable modeling of cell signaling network dynamics. • Review of multi-agent systems application for personalized therapeutic strategies. • Meta-heuristic algorithms found to be the most used models in recent studies. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. Hematologic cancer diagnosis and classification using machine and deep learning: State-of-the-art techniques and emerging research directives.

Author: Patel, Hema, Shah, Himal, Patel, Gayatri, and Patel, Atul
Abstract: Hematology is the study of diagnosis and treatment options for blood diseases, including cancer. Cancer is considered one of the deadliest diseases across all age categories. Diagnosing such a deadly disease at the initial stage is essential to cure the disease. Hematologists and pathologists rely on microscopic evaluation of blood or bone marrow smear images to diagnose blood-related ailments. The abundance of overlapping cells, cells of varying densities among platelets, non-illumination levels, and the amount of red and white blood cells make it more difficult to diagnose illness using blood cell images. Pathologists are required to put more effort into the traditional, time-consuming system. Nowadays, it becomes possible with machine learning and deep learning techniques, to automate the diagnostic processes, categorize microscopic blood cells, and improve the accuracy of the procedure and its speed as the models developed using these methods may guide an assisting tool. In this article, we have acquired, analyzed, scrutinized, and finally selected around 57 research papers from various machine learning and deep learning methodologies that have been employed in the diagnosis of leukemia and its classification over the past 20 years, which have been published between the years 2003 and 2023 by PubMed, IEEE, Science Direct, Google Scholar and other pertinent sources. Our primary emphasis is on evaluating the advantages and limitations of analogous research endeavors to provide a concise and valuable research directive that can be of significant utility to fellow researchers in the field. • Use of machine learning techniques for the diagnostic purpose • Diagnosis and classification of leukemia using computer aided methods • Systematic review of the work done so far in the field of leukemia research • Machine learning and deep learning are capable to proceed large amount of data for higher accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. Improving diagnosis and outcome prediction of gastric cancer via multimodal learning using whole slide pathological images and gene expression.

Author: Xie, Yuzhang, Sang, Qingqing, Da, Qian, Niu, Guoshuai, Deng, Shijie, Feng, Haoran, Chen, Yunqin, Li, Yuan-Yuan, Liu, Bingya, Yang, Yang, and Dai, Wentao
Abstract: For the diagnosis and outcome prediction of gastric cancer (GC), machine learning methods based on whole slide pathological images (WSIs) have shown promising performance and reduced the cost of manual analysis. Nevertheless, accurate prediction of GC outcome may rely on multiple modalities with complementary information, particularly gene expression data. Thus, there is a need to develop multimodal learning methods to enhance prediction performance. In this paper, we collect a dataset from Ruijin Hospital and propose a multimodal learning method for GC diagnosis and outcome prediction, called GaCaMML, which is featured by a cross-modal attention mechanism and Per-Slide training scheme. Additionally, we perform feature attribution analysis via integrated gradient (IG) to identify important input features. The proposed method improves prediction accuracy over the single-modal learning method on three tasks, i.e., survival prediction (by 4.9% on C-index), pathological stage classification (by 11.6% on accuracy), and lymph node classification (by 12.0% on accuracy). Especially, the Per-Slide strategy addresses the issue of a high WSI-to-patient ratio and leads to much better results compared with the Per-Person training scheme. For the interpretable analysis, we find that although WSIs dominate the prediction for most samples, there is still a substantial portion of samples whose prediction highly relies on gene expression information. This study demonstrates the great potential of multimodal learning in GC-related prediction tasks and investigates the contribution of WSIs and gene expression, respectively, which not only shows how the model makes a decision but also provides insights into the association between macroscopic pathological phenotypes and microscopic molecular features. [Display omitted] • GaCaMML outperforms the single-modal learning method across three tasks on accuracy. • Per-Slide strategy enhances prediction results by optimizing WSI-to-patient ratio. • WSI features prevail in gastric cancer outcome prediction for most samples. • Certain genes like MMPs and TP53 significantly impact prediction results. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. MICIL: Multiple-Instance Class-Incremental Learning for skin cancer whole slide images.

Author: Meseguer, Pablo, del Amor, Rocío, and Naranjo, Valery
Abstract: Artificial intelligence (AI) agents encounter the problem of catastrophic forgetting when they are trained in sequentially with new data batches. This issue poses a barrier to the implementation of AI-based models in tasks that involve ongoing evolution, such as cancer prediction. Moreover, whole slide images (WSI) play a crucial role in cancer management, and their automated analysis has become increasingly popular in assisting pathologists during the diagnosis process. Incremental learning (IL) techniques aim to develop algorithms capable of retaining previously acquired information while also acquiring new insights to predict future data. Deep IL techniques need to address the challenges posed by the gigapixel scale of WSIs, which often necessitates the use of multiple instance learning (MIL) frameworks. In this paper, we introduce an IL algorithm tailored for analyzing WSIs within a MIL paradigm. The proposed M ultiple I nstance C lass- I ncremental L earning (MICIL) algorithm combines MIL with class-IL for the first time, allowing for the incremental prediction of multiple skin cancer subtypes from WSIs within a class-IL scenario. Our framework incorporates knowledge distillation and data rehearsal, along with a novel embedding-level distillation, aiming to preserve the latent space at the aggregated WSI level. Results demonstrate the algorithm's effectiveness in addressing the challenge of balancing IL-specific metrics, such as intransigence and forgetting, and solving the plasticity-stability dilemma. • An incremental learning algorithm is applied under the MIL paradigm to analyze WSI. • A public dataset featuring patch-level embeddings of multiple spindle-cell neoplasms. • A WSI embedding-level distillation to prevent shift in the aggregated latent space. • Comprehensive experiments address the plasticity-stability dilemma in MIL frameworks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. Learning the cellular activity representation based on gene regulatory networks for prediction of tumor response to drugs.

Author: Xie, Xinping, Wang, Fengting, Wang, Guanfu, Zhu, Weiwei, Du, Xiaodong, and Wang, Hongqiang
Abstract: Predicting the response of tumor cells to anti-tumor drugs is critical to realizing cancer precision medicine. Currently, most existing methods ignore the regulatory relationships between genes and thus have unsatisfactory predictive performance. In this paper, we propose to predict anti-tumor drug efficacy via learning the activity representation of tumor cells based on a priori knowledge of gene regulation networks (GRNs). Specifically, the method simulates the cellular biosystem by synthesizing a cell-gene activity network and then infers a new low-dimensional activity representation for tumor cells from the raw high-dimensional expression profile. The simulated cell-gene network mainly comprises known gene regulatory networks collected from multiple resources and fuses tumor cells by linking them to hotspot genes that are over- or under-expressed in them. The resulting activity representation could not only reflect the shallow expression profile (hotspot genes) but also mines in-depth information of gene regulation activity in tumor cells before treatment. Finally, we build deep learning models on the activity representation for predicting drug efficacy in tumor cells. Experimental results on the benchmark GDSC dataset demonstrate the superior performance of the proposed method over SOTA methods with the highest AUC of 0.954 in the efficacy label prediction and the best R2 of 0.834 in the regression of half maximal inhibitory concentration (IC50) values, suggesting the potential value of the proposed method in practice. • Propose a drug efficacy prediction method based on the cellular activity representation • The activity representation learned from a cell-gene network combining GRNs and expression profiles • The predictive ability of the activity representation depends on simulating the real biosystem. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

50. Cellular data extraction from multiplexed brain imaging data using self-supervised Dual-loss Adaptive Masked Autoencoder.

Author: Ly, Son T., Lin, Bai, Vo, Hung Q., Maric, Dragan, Roysam, Badrinath, and Nguyen, Hien V.
Abstract: Reliable large-scale cell detection and segmentation is the fundamental first step to understanding biological processes in the brain. The ability to phenotype cells at scale can accelerate preclinical drug evaluation and system-level brain histology studies. The impressive advances in deep learning offer a practical solution to cell image detection and segmentation. Unfortunately, categorizing cells and delineating their boundaries for training deep networks is an expensive process that requires skilled biologists. This paper presents a novel self-supervised Dual-Loss Adaptive Masked Autoencoder (DAMA) for learning rich features from multiplexed immunofluorescence brain images. DAMA's objective function minimizes the conditional entropy in pixel-level reconstruction and feature-level regression. Unlike existing self-supervised learning methods based on a random image masking strategy, DAMA employs a novel adaptive mask sampling strategy to maximize mutual information and effectively learn brain cell data. To the best of our knowledge, this is the first effort to develop a self-supervised learning method for multiplexed immunofluorescence brain images. Our extensive experiments demonstrate that DAMA features enable superior cell detection, segmentation, and classification performance without requiring many annotations. In addition, to examine the generalizability of DAMA, we also experimented on TissueNet, a multiplexed imaging dataset comprised of two-channel fluorescence images from six distinct tissue types, captured using six different imaging platforms. Our code is publicly available at https://github.com/hula-ai/DAMA. • We present a novel information-theoretic self-supervised method for multiplexed brain image analysis. Our theoretical analysis shows that the proposed objective function maximizes the mutual information between the input image and self-supervised labels. • We design the first adaptive mask sampling strategy for self-supervised learning models. We explain how this adaptive sampling helps the model focus on underrepresented features and improve the representation. • Extensive experiments on cell detection and classification are provided to validate the effectiveness of DAMA. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Region

Database

Publisher

1,253 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources