Back to Search Start Over

Cancer hallmark analysis using semantic classification with enhanced topic modelling on biomedical literature.

Authors :
Gupta, Supriya
Sharaff, Aakanksha
Nagwani, Naresh Kumar
Source :
Multimedia Tools & Applications; Sep2024, Vol. 83 Issue 31, p76429-76446, 18p
Publication Year :
2024

Abstract

The ever-growing 1.15 million new cases of cancer on a yearly basis alone in India is a major cause of concern for the experts and researchers working in various biomedical organizations. The advent of modern text engineering strategies and NLP techniques can play a crucial role in the discovery and analysis of pre-existing knowledge present in the cancer related biomedical archives. The available 10 Cancer hallmarks can provide key insights and make significant impact in the ongoing cancer research. It is extremely important to identify and classify required information due to time and resource crunch which needs to be quickly accessed. This article introduces a novel machine learning framework called Cancer Hallmark Classification and Topic Modeling (CHCTM), designed for supervised learning. The CHCTM framework is capable of semantically learning, categorizing, and extracting significant topics and their combinations related to the hallmarks of cancer (HoC) from a dataset comprising 1499 PubMed documents. The key contributions of this research include the creation of an innovative ensemble classification model using a meta-classifier based on Random Forest (RF). Additionally, it introduces an Enhanced Latent Dirichlet Allocation (ELDA) topic modeling strategy to generate relevant mixtures of topics. The performance of the CHCTM framework is evaluated using precision, recall, accuracy, and F-score parameters. Comparative analysis with other biomedical baseline methods reveals an 8% improvement in F-score. The coherence values acquired for ELDA are tallied and weighted against PLSA and LDA models to demonstrate the effectiveness of this approach. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13807501
Volume :
83
Issue :
31
Database :
Complementary Index
Journal :
Multimedia Tools & Applications
Publication Type :
Academic Journal
Accession number :
179414544
Full Text :
https://doi.org/10.1007/s11042-024-18533-0