Back to Search
Start Over
Improving real-time detection of laryngeal lesions in endoscopic images using a decoupled super-resolution enhanced YOLO.
- Source :
-
Computer methods and programs in biomedicine [Comput Methods Programs Biomed] 2025 Mar; Vol. 260, pp. 108539. Date of Electronic Publication: 2024 Dec 13. - Publication Year :
- 2025
-
Abstract
- Background and Objective: Laryngeal Cancer (LC) constitutes approximately one third of head and neck cancers. Detecting early-stage lesions in this anatomical region is crucial for achieving a high survival rate. However, it poses significant diagnostic challenges owing to the varied appearance of lesions and the need for precise characterization for appropriate clinical management. Conventional diagnostic approaches rely heavily on endoscopic examination, which often requires expert interpretation and may be limited by subjective assessment. Deep learning (DL) approaches offer promising opportunities for automating lesion detection, but their efficacy in handling multi-modal imaging data and accurately localizing small lesions remains a subject of investigation. Furthermore, the clinical domain may largely benefit from the deployment of efficient DL methods that can ensure equitable access to advanced technologies, regardless of the availability of resources that can often be limited. In this study, a DL-based approach, named SRE-YOLO, was introduced to provide real-time assistance to less-experienced personnel during laryngeal assessment, by automatically detecting lesions at different scales from endoscopic White Light (WL) and Narrow-Band Imaging (NBI) images.<br />Methods: During the training, the SRE-YOLO integrates a YOLOv8 nano (YOLOv8n) baseline with a Super-Resolution (SR) branch to enhance lesion detection. This last component is decoupled during inference to preserve the low computational demand of the YOLOv8n baseline. The evaluation was conducted on a multi-center dataset, encompassing diverse laryngeal pathologies and acquisition modalities.<br />Results: The SRE-YOLO method improved the Average Precision (AP <subscript>@IoU=0.5</subscript> ) in lesion detection by 5% with respect to the YOLOv8n baseline, while maintaining the inference speed of 58.8 Frames Per Second (FPS). Comparative analyses against state-of-the-art DL methods highlighted the efficacy of the SRE-YOLO approach in balancing detection accuracy, computational efficiency, and real-time applicability.<br />Conclusions: This research underscores the potential of SRE-YOLO in developing efficient DL-driven decision support systems for real-time detection of laryngeal lesions at different scales from both WL and NBI endoscopic data.<br />Competing Interests: Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Leonardo S. Mattos reports financial support was provided by RAISE, Robotics and AI for Socio-economic Empowerment (ECS00000035). If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.<br /> (Copyright © 2024. Published by Elsevier B.V.)
Details
- Language :
- English
- ISSN :
- 1872-7565
- Volume :
- 260
- Database :
- MEDLINE
- Journal :
- Computer methods and programs in biomedicine
- Publication Type :
- Academic Journal
- Accession number :
- 39689500
- Full Text :
- https://doi.org/10.1016/j.cmpb.2024.108539