ナビゲーション課題における視覚と言語の対応づけのための軌道の大域表現の半教師あり学習

阿久澤 圭; 岩澤 有祐; 松尾 豊

doi:10.11517/pjsai.JSAI2021.0_4G2GS2k02

Abstract

Constructing agents that can understand natural language instructions is useful, for example, for developing robots that can do household chores. However, in order to create agents that can adapt to various language instructions and environments using imitation learning, a huge amount of paired data, which is composed of <trajectory, language instruction>, is required. To tackle the problem, existing research has proposed to train a speaker model that generates language instructions from trajectories and to annotate artificially generated language instructions to unannotated trajectories. In this paper, in order to facilitate the learning of the speaker model, we propose to extract latent representation from a trajectory with semi-supervised representation learning, using paired data and additional trajectory data. Specifically, we constrain the latent representations to acquire only information about the language by considering the structure that language instructions correspond to the global representation of the trajectory. In the experiments, we evaluate the proposed method in BabyAI environment and show that the representation extracted from the trajectory by the proposed method acquires information about the language.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!