Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
General Paper (Peer-Reviewed)
Building a Commonsense Inference Dataset based on Basic Events and its Application
Kazumasa OmuraDaisuke KawaharaSadao Kurohashi
Author information
JOURNAL FREE ACCESS

2023 Volume 30 Issue 4 Pages 1206-1239

Details
Abstract

We propose a method for building a commonsense inference dataset based on basic events. Specifically, we automatically extract contingent pairs of basic event expressions such as “I'm hungry, so I have a meal” from text, verify by crowdsourcing, and automatically generate commonsense inference problems regarding the contingent relation between basic events. We built a commonsense inference dataset of 100k problems by the proposed method and conducted experiments to investigate the model performance. The results showed that there is a performance gap between high-performance language models and humans. In addition, we automatically generated large-scale pseudo problems by utilizing the scalability of the proposed method and investigated the effects by the data augmentation on the commonsense inference task and the related tasks. The results demonstrated the effectiveness of learning extensive contingent knowledge for both the commonsense inference task and the related tasks, which suggests the importance of contingent reasoning.

Content from these authors
© 2023 The Association for Natural Language Processing
Previous article Next article
feedback
Top