Back to Search
Start Over
Cancer hallmark analysis using semantic classification with enhanced topic modelling on biomedical literature.
- Source :
- Multimedia Tools & Applications; Sep2024, Vol. 83 Issue 31, p76429-76446, 18p
- Publication Year :
- 2024
-
Abstract
- The ever-growing 1.15 million new cases of cancer on a yearly basis alone in India is a major cause of concern for the experts and researchers working in various biomedical organizations. The advent of modern text engineering strategies and NLP techniques can play a crucial role in the discovery and analysis of pre-existing knowledge present in the cancer related biomedical archives. The available 10 Cancer hallmarks can provide key insights and make significant impact in the ongoing cancer research. It is extremely important to identify and classify required information due to time and resource crunch which needs to be quickly accessed. This article introduces a novel machine learning framework called Cancer Hallmark Classification and Topic Modeling (CHCTM), designed for supervised learning. The CHCTM framework is capable of semantically learning, categorizing, and extracting significant topics and their combinations related to the hallmarks of cancer (HoC) from a dataset comprising 1499 PubMed documents. The key contributions of this research include the creation of an innovative ensemble classification model using a meta-classifier based on Random Forest (RF). Additionally, it introduces an Enhanced Latent Dirichlet Allocation (ELDA) topic modeling strategy to generate relevant mixtures of topics. The performance of the CHCTM framework is evaluated using precision, recall, accuracy, and F-score parameters. Comparative analysis with other biomedical baseline methods reveals an 8% improvement in F-score. The coherence values acquired for ELDA are tallied and weighted against PLSA and LDA models to demonstrate the effectiveness of this approach. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 13807501
- Volume :
- 83
- Issue :
- 31
- Database :
- Complementary Index
- Journal :
- Multimedia Tools & Applications
- Publication Type :
- Academic Journal
- Accession number :
- 179414544
- Full Text :
- https://doi.org/10.1007/s11042-024-18533-0