人工知能学会研究会資料 言語・音声理解と対話処理研究会
Online ISSN : 2436-4576
Print ISSN : 0918-5682
105回 (2025/11)
会議情報

FiLMを活用した音声対話における感情的対話破綻検出
中畔 彪雅吉野 幸一郎
著者情報
会議録・要旨集 認証あり

p. 31-36

詳細
抄録

When the same linguistic content carries different acoustic nuances, particularly in terms of expressed emotions, the corresponding dialogue system response must align with the given nuance. However, existing SLMs such as Qwen2-Audio are not necessarily robust against such differences. In this work, we define a task that detects the consistency or inconsistency between the emotional label of an utterance and the system's response, and build a model to perform this prediction. We hypothesize that emotion labels are a control signal that modulates text interpretation, and we construct a prediction model based on Feature-wise Linear Modulation (FiLM).

著者関連情報
© 2025 人工知能学会
前の記事 次の記事
feedback
Top