Author: "Wan, Shaohua" / Journal: pattern recognition letters / Publication Year Range: Last 10 years / Topic: feature extraction - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wan, Shaohua"' showing total 2 results

Start Over Author "Wan, Shaohua" Topic feature extraction Publication Year Range Last 10 years Journal pattern recognition letters

2 results on '"Wan, Shaohua"'

1. CDText: Scene text detector based on context-aware deformable transformer.

Author: Wu, Yirui, Kong, Qiran, Yong, Lai, Narducci, Fabio, and Wan, Shaohua
Subjects: *TEXT recognition, *DETECTORS, *FEATURE extraction, *COMPARATIVE method
Abstract: • CDText detect texts of arbitrary shapes by encoding context information. • Feature extractor refines feature map with dilated context encoding blocks. • Transformer aggregates text features of detection boxes for instance segmentation. Scene text detection task aims to precisely locate text regions in natural scenes. However, the existing methods still face challenges in detecting arbitrary-shaped text, due to their limited feature representation capability. To alleviate this problem, we propose a scene text detector, i.e., CDText, based on structure of context-aware deformable transformer. Specifically, CDText firstly adopts different convolution kernel designs for feature extraction, which designs receptive fields with different size for multi-scale feature perception and fusion. Meanwhile, multi-head self-attention mechanism is used to strengthen the reasoning ability of CDText in a global sense, thus enhancing feature maps with abundant context information by extracting implicit relationship between multi-scale text features. Moreover, CDText designs a segmentation head to segment text instances of arbitrary shapes from rectangular detection boxes. Experiments show that CDText is superior to comparative methods in detection accuracy, achieving F -scores of 92.7, 81.9, and 82.9 on ICDAR2013, Total Text, and CTW-1500 datasets, respectively. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

2. GDRL: An interpretable framework for thoracic pathologic prediction.

Author: Wu, Yirui, Li, Hao, Feng, Xi, Casanova, Andrea, Abate, Andrea F., and Wan, Shaohua
Subjects: *DECISION making, *DEEP learning, *FEATURE extraction, *LATENT infection, *IMAGE analysis, *X-ray imaging
Abstract: • Propose a Group-Disentangled Representation Learning framework (GDRL). • Introduce an implicit group-swap structure. • Extract linking relationship between semantical concepts of pathology and visual features. • Demonstrate that GDRL can significantly improve classification accuracy. Deep learning methods have shown significant performance in medical image analysis tasks. However, they generally act like "black box" without explanations in both feature extraction and decision processes, leading to lack of clinical insights and high risk assessments. To aid deep learning in envisioning diseases with visual clues, we propose a novel Group-Disentangled Representation Learning framework (GDRL). The key contribution is that GDRL completely disentangles latent space into disease concepts with abundant and non-overlapping feature related explanations, thus enhancing interpretability in feature extraction and decision processes. Furthermore, we introduce an implicit group-swap structure by emphasizing the linking relationship between semantical concepts of disease and low-level visual features, other than explicit explanations on general objects and their attributes. We demonstrate our framework on predicting four categories of diseases from chest X-ray images. The AUROC of GDRL on ChestX-ray14 for thoracic pathologic prediction are 0.8630, 0.8980, 0.9269 and 0.8653 respectively, and we showcase the potential of our framework in enhancing interpretability of the factors contributing to different diseases. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Wan, Shaohua"'

1. CDText: Scene text detector based on context-aware deformable transformer.

2. GDRL: An interpretable framework for thoracic pathologic prediction.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

2 results on '"Wan, Shaohua"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources