Back to Search
Start Over
Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks.
- Source :
-
Pattern Recognition . Jun2021, Vol. 114, pN.PAG-N.PAG. 1p. - Publication Year :
- 2021
-
Abstract
- • A framework for joint text/non-text stroke classification and text line grouping in online handwritten documents is proposed. • Stroke classification and text line grouping problems are formulated as node classification/clustering problems in graph. • An edge pooling attention network (EPAT) is proposed to efficiently aggregate information of nodes and edges. • Superior performance of both stroke classification and text line grouping is achieved on public datasets. Stroke classification and text line grouping are important tasks in online handwritten document segmentation. In the past, the two tasks were usually performed using different models which are trained independently and perform sequentially. This cannot optimize the integration of contextual information and the system may suffer from error accumulation in stroke classification. In this paper, we propose a method for joint text/non-text stroke classification and text line grouping in online handwritten documents using attention based graph neural network. In our framework, the stroke classification and text line grouping problems are formulated as node classification and node clustering problems in a relational graph, which is constructed based on the temporal and spatial relationship between strokes. We propose a new graph network architecture, called edge pooling attention network (EPAT) to efficiently aggregate information between the features of neighboring nodes and edges. The proposed model is trained by multi-task learning with cross entropy loss for node classification and distance metric loss for node clustering. In experiments on two online handwritten document datasets IAMOnDo and Kondate, the proposed method is demonstrated effective, yielding superior performance in both stroke classification and text line grouping. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 00313203
- Volume :
- 114
- Database :
- Academic Search Index
- Journal :
- Pattern Recognition
- Publication Type :
- Academic Journal
- Accession number :
- 149015042
- Full Text :
- https://doi.org/10.1016/j.patcog.2021.107859