Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
35th (2021)
Session ID : 4J2-GS-6e-01
Conference information

Temporal Expression Classification Based on Data Labelling with Word Alignment in Japanese-English Parallel Corpus
*Kazutaka KINUGAWAHitoshi ITOHideya MINOIsao GOTOIchiro YAMADA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Temporal expression recognition is a long-standing problem in natural language processing (NLP). One difficulty of this task is to disambiguate specific temporal expressions which change the meanings depending on their contexts. Especially in Japanese news domain, this is an essential issue since these temporal expressions frequently occur and consequently mislead NLP systems. One of the effective approaches to tackle this problem is to build a supervised classification model, but a huge cost is required to prepare an enough amount of labeled training data. In this paper, we present an automatic data labelling method for such a Japanese specific temporal term. We leverage word alignment in Japanse-English parallel corpus and resolve their ambiguities based on both Japanese and English side information. We efficiently build a dataset and conduct a manual inspection against this dataset to confirm the efficacy of our technique. We train several baseline models on this dataset and obtain consistent performance.

Content from these authors
© 2021 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top