Back to Search Start Over

Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks.

Authors :
Ye, Jun-Yu
Zhang, Yan-Ming
Yang, Qing
Liu, Cheng-Lin
Source :
Pattern Recognition. Jun2021, Vol. 114, pN.PAG-N.PAG. 1p.
Publication Year :
2021

Abstract

• A framework for joint text/non-text stroke classification and text line grouping in online handwritten documents is proposed. • Stroke classification and text line grouping problems are formulated as node classification/clustering problems in graph. • An edge pooling attention network (EPAT) is proposed to efficiently aggregate information of nodes and edges. • Superior performance of both stroke classification and text line grouping is achieved on public datasets. Stroke classification and text line grouping are important tasks in online handwritten document segmentation. In the past, the two tasks were usually performed using different models which are trained independently and perform sequentially. This cannot optimize the integration of contextual information and the system may suffer from error accumulation in stroke classification. In this paper, we propose a method for joint text/non-text stroke classification and text line grouping in online handwritten documents using attention based graph neural network. In our framework, the stroke classification and text line grouping problems are formulated as node classification and node clustering problems in a relational graph, which is constructed based on the temporal and spatial relationship between strokes. We propose a new graph network architecture, called edge pooling attention network (EPAT) to efficiently aggregate information between the features of neighboring nodes and edges. The proposed model is trained by multi-task learning with cross entropy loss for node classification and distance metric loss for node clustering. In experiments on two online handwritten document datasets IAMOnDo and Kondate, the proposed method is demonstrated effective, yielding superior performance in both stroke classification and text line grouping. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00313203
Volume :
114
Database :
Academic Search Index
Journal :
Pattern Recognition
Publication Type :
Academic Journal
Accession number :
149015042
Full Text :
https://doi.org/10.1016/j.patcog.2021.107859