Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 4G1-GS-4-02
Conference information

Extracting Named Entities from Press Releases Using Anomaly Detection Techniques
*Reiji KIFUNEAyahiko NIIMI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

This paper introduces a novel approach for extracting named entities as outliers from press-release texts using anomaly detection techniques and validated its effectiveness and potential applicability in company research. This study used the local outlier factor (LOF), a data density-based anomaly detection technique known for its robust performance even in high-dimensional spaces. Specifically, this approach initially uses pretrained FastText on the entire press-release texts to convert nouns into vectors, leveraging FastText’s adaptability to unknown words. Subsequently, these vectors are fed into LOF to detect outliers. Results showed that the proposed method successfully extracted eight types of named entities, as defined by IREX, as outliers in the experiments. However, among the identified outliers, several words deviated from the defined criteria of named entities and noise was present in the output.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top