Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 4Xin1-03
Conference information

Named entity recognition for corporate names in news text data and identification of the name by characteristics of neighboring words
*Koutarou TAMURAAkira KITAUCHIAtsushi TAKAYAMA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

We used huge news data distributed by SPEEDA service, which include not only news on general interest but also business and industry-specific topics, and built a model to extract corporate information appearing in the text as named entity. In this study, we proposed a method to extract the corporate names in the text after segmented by tokenizers and the extracted were matched with a corporate name dictionary added with their automatically-generated abbreviations and so on. Thereby, we succeed in extracting a named entity which is identified as a corporate name and the method improved the accuracy of the task of extracting corporate names and identification of the company.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top