1. Administrative Document Analysis and Structure
- Author
-
Hatem Hamza, Vincent Poulain d'Andecy, Abdel Belaïd, Yolande Belaïd, READ (READ), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS), Itesoft R&D, ITESOFT, and Marenglen Biba and Fatos Xhafa
- Subjects
Neural gas ,Artificial neural network ,Invoice ,business.industry ,Computer science ,020207 software engineering ,02 engineering and technology ,Document analysis ,computer.software_genre ,Local structure ,[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing ,Invoice processing ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Data mining ,business ,[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing ,computer ,Case base - Abstract
International audience; This chapter reports our knowledge about the analysis and recognition of scanned administrative documents. Regarding essentially the administrative paper flow with new and continuous arrivals, all the conventional techniques reserved to static databases modeling and recognition are doomed to failure. For this purpose, a new technique based on the experience was investigated giving very promising results. This technique is related to the case-based reasoning already used in data mining and various problems of machine learning. After the presentation of the context related to the administrative document flow and its requirements in a real time processing, we present a case based reasonning for invoice processing. The case corresponds to the co-existence of a problem and its solution. The problem in an invoice corresponds to a local structure such as the keywords of an address or the line patterns in the amounts table, while the solution is related to their content. This problem is then compared to a document case base using graph probing. For this purpose, we proposed an improvement of an already existing neural network called Incremental Growing Neural Gas
- Published
- 2011
- Full Text
- View/download PDF