生成モデルによる応答タイミング推定と動的Prompt-Tuneを用いた応答詳細性のパラメーター制御

室町 俊貴; 狩野 芳伸

doi:10.1527/tjsai.39-3_IDS6-C

Abstract

A spoken dialogue system is required to continuously listen to a human user for smooth conversation. We propose a method that simultaneously performs response generation and response timing estimation. Our proposed method estimates response timing by adding pseudo-samples where response should be irrelevant, which allows using text-only conversation dataset without audio information. Furthermore, our proposed method can control substantialness of responses by user-specified parameter integrated with the Dynamic-Prompt-Tune method, which uses prompt token embedding dynamically generated from the parameter. Our automatic and manual evaluation showed that the proposed method can generate responses with more natural timing and more in line with the response substantialness parameter compared to the baseline model.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!