KVP10k: A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

Oshri Naparstek, Ophir Azulai, Inbar Shapira, Elad Amrani, Yevgeny Yaroker, Yevgeny Burshtein, Roi Pony, Nadav Rubinstein, Foad Abo Dahood, Orit Prince, Idan Friedman, Christoph Auer, Nikolaos Livathinos, Maksym Lysak, Ahmed Nassar, Peter Staar, Udi Barzelay

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where the extraction process revolves around extracting information using a specific, predefined set of keys. Unlike most existing datasets and benchmarks, our focus is on discovering key-value pairs (KVPs) without relying on predefined keys, navigating through an array of diverse templates and complex layouts. This task presents unique challenges, primarily due to the absence of comprehensive datasets and benchmarks tailored for non-predetermined KVP extraction. To address this gap, we introduce KVP10k , a new dataset and benchmark specifically designed for KVP extraction. The dataset contains 10707richly annotated images. In our benchmark, we also introduce a new challenging task that combines elements of KIE as well as KVP in a single task. KVP10k sets itself apart with its extensive diversity in data and richly detailed annotations, paving the way for advancements in the field of information extraction from complex business documents.

Original languageEnglish
Title of host publicationDocument Analysis and Recognition - ICDAR 2024 - 18th International Conference, Proceedings
EditorsElisa H. Barney Smith, Marcus Liwicki, Liangrui Peng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages97-116
Number of pages20
ISBN (Print)9783031705328
DOIs
StatePublished - 2024
Externally publishedYes
Event18th International Conference on Document Analysis and Recognition, ICDAR 2024 - Athens, Greece
Duration: 30 Aug 20244 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14804 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference18th International Conference on Document Analysis and Recognition, ICDAR 2024
Country/TerritoryGreece
CityAthens
Period30/08/244/09/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

Fingerprint

Dive into the research topics of 'KVP10k: A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents'. Together they form a unique fingerprint.

Cite this