Back to Search
Start Over
DeepTabStR: Deep Learning based Table Structure Recognition
- Source :
- ICDAR
- Publication Year :
- 2019
- Publisher :
- IEEE, 2019.
-
Abstract
- This paper presents a novel method for the analysis of tabular structures in document images using the potential of deformable convolutional networks. In order to assess the suitability of the model to the task of table structure recognition, most of the prior methods have been tested on the smaller ICDAR-13 table structure recognition dataset comprising of just 156 tables. We curated a new image-based table structure recognition dataset, TabStructDB2, comprising of 1081 tables densely labeled with row and column information. Instead of collecting new images for this purpose, we leveraged the famous Page-Object Detection dataset from ICDAR-17, and added structural information for all the tabular regions present in the dataset. This new publicly available dataset will enable the development of more sophisticated table structure recognition techniques in the future. We performed extensive evaluation on the two datasets (ICDAR-13 and TabStructDB) including crossdataset testing in order to evaluate the efficacy of the proposed approach. We achieved state-of-the-art results with deformable models on ICDAR-13 with an average F-Measure of 92.98% (89.42% for rows and 96.55% for columns) and report baseline results on TabStructDB for guiding future research efforts with an F-Measure of 93.72% (91.26% for rows and 95.59% for columns). Despite promising results, structural analysis of tables with arbitrary layouts is still far from achievable at this point.
- Subjects :
- Point (typography)
business.industry
Computer science
Deep learning
02 engineering and technology
010501 environmental sciences
computer.software_genre
01 natural sciences
Column (database)
Convolutional neural network
Image (mathematics)
Task (computing)
0202 electrical engineering, electronic engineering, information engineering
Table (database)
020201 artificial intelligence & image processing
Artificial intelligence
Data mining
business
Row
computer
0105 earth and related environmental sciences
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2019 International Conference on Document Analysis and Recognition (ICDAR)
- Accession number :
- edsair.doi...........c752722c5684508e843c213a269815f5
- Full Text :
- https://doi.org/10.1109/icdar.2019.00226