主催: The Japanese Society for Artificial Intelligence
会議名: 2019年度人工知能学会全国大会(第33回)
回次: 33
開催地: 新潟県新潟市 朱鷺メッセ
開催日: 2019/06/04 - 2019/06/07
In this paper, we address the fetching task from ambiguous instructions. A typical fetching task consists of picking up a target object specified by ambiguous instructions. We specifically propose a multimodal target-source classifier model (MTCM) that grounds the instructions in the scene. More explicitly, MCTM can predict the likelihood of a target object in addition to the source of this target using linguistic and visual features. Our approach improves the accuracy of the previous state-of-the-art method for target object prediction in fetching task.