Host: The Japanese Society for Artificial Intelligence
Name : The 103rd SIG-SLUD
Number : 103
Location : [in Japanese]
Date : March 20, 2025 - March 22, 2025
Pages 68-73
This paper presents our spoken dialogue system developed for the 7th Dialogue System Live Competition. To achieve the competition tasks about travel planing while providing with a better user experience, we designed the system to exhibits evidence that matches user preferences using retrieval-augmented generation in addition to various gestures and facial expressions. The system largely consists of the following three blocks. The first block determines the next dialogue state and tasks based on the evaluation results in order to enable the system to return flexible responses to various topics. The second incorporates possible responses into prompts and then generates convincing proposals with supporting images and maps. The last generates expressive turn-taking and behavior of the virtual agent such as facial expressions, gestures and backchannels. As a result, we placed second in the preliminary round with high rate of evaluation.