Key information extraction from documents: Evaluation and generator?

Oliver Bensch, Mirela Popa, Constantin Spille

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Extracting information from documents usually relies on natural language processing methods working on one-dimensional sequences of text. In some cases, for example, for the extraction of key information from semi-structured documents, such as invoice-documents, spatial and formatting information of text are crucial to understand the contextual meaning. Convolutional neural networks are already common in computer vision models to process and extract relationships in multidimensional data. Therefore, natural language processing models have already been combined with computer vision models in the past, to bene efit from e.g. positional information and to improve performance of these key information extraction models. Existing models were either trained on unpublished data sets or on an annotated collection of receipts, which did not focus on PDF-like documents. Hence, in this research project a template-based document generator was created to compare state-of-theart models for information extraction. An existing information extraction model "Chargrid" (Katti et al., 2019) was reconstructed and the impact of a bounding box regression decoder, as well as the impact of an NLP pre-processing step was evaluated for information extraction from documents. The results have shown that NLP based pre-processing is beneficial for model performance. However, the use of a bounding box regression decoder increases the model performance only for fields that do not follow a rectangular shape.
Original languageEnglish
Title of host publicationAdvances in Semantics and Explainability for NLP: Joint proceedings of the DeepOntoNLP and X-SENTIMENT Workshops
Pages47-53
Number of pages7
Volume2918
Publication statusPublished - 1 Jan 2021
EventJoint 2nd International Workshop on Deep Learning Meets Ontologies and Natural Language Processing and 6th International Workshop on Explainable Sentiment Mining and Emotion Detection - Online, Hersonissos, Greece
Duration: 6 Jun 20217 Jun 2021
Conference number: 2

Publication series

SeriesCEUR Workshop Proceedings
ISSN1613-0073

Workshop

WorkshopJoint 2nd International Workshop on Deep Learning Meets Ontologies and Natural Language Processing and 6th International Workshop on Explainable Sentiment Mining and Emotion Detection
Abbreviated titleDeepOntoNLP and X-SENTIMENT 2021
Country/TerritoryGreece
CityHersonissos
Period6/06/217/06/21

Keywords

  • Bounding Box Regression Decoder
  • Document Generator
  • Key Information Extraction

Cite this