主催: 人工知能学会
会議名: 第105回言語・音声理解と対話処理研究会
回次: 105
開催地: 東京科学大学大岡山キャンパス 蔵前記念会館 くらまえホール
開催日: 2025/11/10 - 2025/11/11
p. 150-151
In collaborative problem-solving, particularly in technical domains like mathematics, discussions often combine spoken dialogue with a shared visual space, such as a whiteboard. A critical challenge for comprehending these interactions is resolving the reference between ambiguous expressions in dialogue (e.g., pronouns) and the specific symbols or equations written on the board.To address this, and drawing inspiration from research in Visually-Grounded Dialogue, we propose a new annotation schema for capturing the discourse structure of these multimodal discussions by explicitly linking dialogue utterances to their corresponding element on the whiteboard.