Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
34th (2020)
Session ID : 4Q3-GS-9-04
Conference information

Information Extraction from Financial Documents by Syntactic Parsing and Table Analysis
*Yasuhiro SOGAWAMisa SATOKohsuke YANAI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In the financial domain, when an investor makes an investment decision, he/she reads the necessary information from documents disclosed in accordance with the issuance of share certificates and corporate bonds. The financial disclosure documents are published in XBRL format and consists of a plurality of text blocks and tables, where the necessary information are scattered in the form of natural language. Extracting the information from disclosure documents and managing it continuously with DB is desirable.However, the cost is expensive to extract by hand because of the large number of the documents consisting of about 40 to 60 items to be required. In this manuscript, we apply the natural language processing techniques to the disclosure document and report the result of the extraction of the necessary information by pattern matching of syntax tree and table analysis.

Content from these authors
© 2020 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top