人工知能学会全国大会論文集
Online ISSN : 2758-7347
33rd (2019)
セッションID: 2D3-E-4-03
会議情報

A Multimodal Target-Source Classifier Model for Object Fetching from Natural Language Instructions
*Aly MAGASSOUBAKomei SUGIURAHisashi KAWAI
著者情報
会議録・要旨集 フリー

詳細
抄録

In this paper, we address the fetching task from ambiguous instructions. A typical fetching task consists of picking up a target object specified by ambiguous instructions. We specifically propose a multimodal target-source classifier model (MTCM) that grounds the instructions in the scene. More explicitly, MCTM can predict the likelihood of a target object in addition to the source of this target using linguistic and visual features. Our approach improves the accuracy of the previous state-of-the-art method for target object prediction in fetching task.

著者関連情報
© 2019 The Japanese Society for Artificial Intelligence
前の記事 次の記事
feedback
Top