Back to Search Start Over

A Robust Framework for One-Shot Key Information Extraction via Deep Partial Graph Matching.

Authors :
Yao M
Liu Z
Zhuang L
Wang L
Li H
Source :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society [IEEE Trans Image Process] 2024; Vol. 33, pp. 1070-1079. Date of Electronic Publication: 2024 Feb 05.
Publication Year :
2024

Abstract

Text field labelling plays a key role in Key Information Extraction (KIE) from structured document images. However, existing methods ignore the field drift and outlier problems, which limit their performance and make them less robust. This paper casts the text field labelling problem into a partial graph matching problem and proposes an end-to-end trainable framework called Deep Partial Graph Matching (dPGM) for the one-shot KIE task. It represents each document as a graph and estimates the correspondence between text fields from different documents by maximizing the graph similarity of different documents. Our framework obtains a strict one-to-one correspondence by adopting a combinatorial solver module with an extra one-to-(at most)-one mapping constraint to do the exact graph matching, which leads to the robustness of the field drift problem and the outlier problem. Finally, a large one-shot KIE dataset named DKIE is collected and annotated to promote research of the KIE task. This dataset will be released to the research and industry communities. Extensive experiments on both the public and our new DKIE datasets show that our method can achieve state-of-the-art performance and is more robust than existing methods.

Details

Language :
English
ISSN :
1941-0042
Volume :
33
Database :
MEDLINE
Journal :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Publication Type :
Academic Journal
Accession number :
38285573
Full Text :
https://doi.org/10.1109/TIP.2024.3357251