Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Original Paper
Extraction of causal information from PDF files of the summary of financial statements of companies
Hiroyuki SakaiHiroko NishizawaShogo MatsunamiHiroki Sakaji
Author information
JOURNAL FREE ACCESS

2015 Volume 30 Issue 1 Pages 172-182

Details
Abstract
In this paper, we propose a method of extracting causal information from PDF files of the summary of financial statements of companies, e.g., ''The sales of smart phones was expanded continually''. Cause information is useful for investors in selecting companies to invest. We downloaded 106,885 PDF files of the summary of financial statements of companies from Web pages of the companies automatically. Our method extracts causal information from the PDF files by using clue expressions (e.g., ''was expanded'') and keywords relevant to a company. The clue expressions are extracted from the PDF files of the summary of financial statements of companies and articles concerning business performance of companies automatically. We developed the search system which is able to retrieve causal informations extracted by our method. The search system shows causal information containing a keyword inputted by users, and the summary of financial statements containing the retrieved causal information. We evaluated our method and it attained 83.91% precision and 55.04% recall, respectively. Moreover, we compared our method with Sakai et al's method originally proposed for extracting causal information from financial articles concerning business performance of companies and experimental results showed that our method outperforms Sakai et al's method.
Content from these authors
© The Japanese Society for Artificial Intelligence 2015
Previous article Next article
feedback
Top