2022 年 2022 巻 FIN-029 号 p. 32-38
This paper proposes an Understanding of non-Financial Objects in Financial Reports (UFO) task. The UFO task aims to develop techniques for extracting structured information from tabular data and documents, focusing on annual securities reports. We will provide a dataset based on annual securities reports and organize an evaluation-based workshop for participants. The UFO task consists of two subtasks: table data extraction (TDE) and text-to-table relationship extraction (TTRE). The table data extraction subtask aims to extract the correct entries and values in the tables of the annual securities reports. The text-to-table relationship extraction subtask aims to link the values contained in the tables with the relevant statements in the text. In this paper, we describe an overview of the UFO task.