Back to Search Start Over

DeepTabStR: Deep Learning based Table Structure Recognition

Authors :
Shoaib Ahmed Siddiqui
Sheraz Ahmed
Andreas Dengel
Imran Ali Fateh
Syed Tahseen Raza Rizvi
Source :
ICDAR
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

This paper presents a novel method for the analysis of tabular structures in document images using the potential of deformable convolutional networks. In order to assess the suitability of the model to the task of table structure recognition, most of the prior methods have been tested on the smaller ICDAR-13 table structure recognition dataset comprising of just 156 tables. We curated a new image-based table structure recognition dataset, TabStructDB2, comprising of 1081 tables densely labeled with row and column information. Instead of collecting new images for this purpose, we leveraged the famous Page-Object Detection dataset from ICDAR-17, and added structural information for all the tabular regions present in the dataset. This new publicly available dataset will enable the development of more sophisticated table structure recognition techniques in the future. We performed extensive evaluation on the two datasets (ICDAR-13 and TabStructDB) including crossdataset testing in order to evaluate the efficacy of the proposed approach. We achieved state-of-the-art results with deformable models on ICDAR-13 with an average F-Measure of 92.98% (89.42% for rows and 96.55% for columns) and report baseline results on TabStructDB for guiding future research efforts with an F-Measure of 93.72% (91.26% for rows and 95.59% for columns). Despite promising results, structural analysis of tables with arbitrary layouts is still far from achievable at this point.

Details

Database :
OpenAIRE
Journal :
2019 International Conference on Document Analysis and Recognition (ICDAR)
Accession number :
edsair.doi...........c752722c5684508e843c213a269815f5
Full Text :
https://doi.org/10.1109/icdar.2019.00226