Host: The Japanese Society for Artificial Intelligence
Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 35
Location : [in Japanese]
Date : June 08, 2021 - June 11, 2021
Temporal expression recognition is a long-standing problem in natural language processing (NLP). One difficulty of this task is to disambiguate specific temporal expressions which change the meanings depending on their contexts. Especially in Japanese news domain, this is an essential issue since these temporal expressions frequently occur and consequently mislead NLP systems. One of the effective approaches to tackle this problem is to build a supervised classification model, but a huge cost is required to prepare an enough amount of labeled training data. In this paper, we present an automatic data labelling method for such a Japanese specific temporal term. We leverage word alignment in Japanse-English parallel corpus and resolve their ambiguities based on both Japanese and English side information. We efficiently build a dataset and conduct a manual inspection against this dataset to confirm the efficacy of our technique. We train several baseline models on this dataset and obtain consistent performance.