Host: The Japanese Society for Artificial intelligence
Name : 74th SIG-SLUD
Number : 74
Location : [in Japanese]
Date : July 22, 2015
Pages 08-
In this paper, we report the activity of a preparatory project for building a large-scale corpus of conversational Japanese (NINJAL collaborative research project, 2014/7/1-2015/8/31). The aims of this project are to establish i) a corpus design for collecting various kinds of everyday conversations in a balanced manner, ii) methodology of recording naturally occurring conversations, and iii) a transcription system suitable for efficiently transcribing natural conversations. We first describe the survey of everyday conversational behavior, which was conducted last year for the corpus design. We then discuss how to record naturally occurring conversations, focusing on technological and ethical issues.