Back to Search Start Over

A Two-Stage Framework for Compound Figure Separation

Authors :
Eric Schwenker
Maria K. Y. Chan
Weixin Jiang
Nicola J. Ferrier
Oliver Cossairt
Trevor Spreadbury
Source :
2021 IEEE International Conference on Image Processing (ICIP).
Publication Year :
2021
Publisher :
IEEE, 2021.

Abstract

Scientific literature contains large volumes of complex, unstructured figures that are compound in nature (i.e. composed of multiple images, graphs, and drawings). Separation of these compound figures is critical for information retrieval from these figures. In this paper, we propose a new strategy for compound figure separation, which decomposes the compound figures into constituent subfigures while preserving the association between the subfigures and their respective caption components. We propose a two-stage framework to address the proposed compound figure separation problem. In particular, the subfigure label detection module detects all subfigure labels in the first stage. Then, in the subfigure detection module, the detected subfigure labels help to detect the subfigures by optimizing the feature selection process and providing the global layout information as extra features. Extensive experiments are conducted to validate the effectiveness and superiority of the proposed framework, which improves the detection precision by 9%.

Details

Database :
OpenAIRE
Journal :
2021 IEEE International Conference on Image Processing (ICIP)
Accession number :
edsair.doi.dedup.....6136ab10966968a50962b16a16f64e83