Target-Driven Navigation Based on Transformer

Rui FUKUSHIMA; Yusuke YOSHIYASU

doi:10.1299/jsmermd.2021.1A1-G17

Abstract

This paper presents a target-driven visual navigation technique that can exploit long-term history for navigating an agent to a given target image. In particular we use Transformer architecture that has been developed in the natural language field and can handle long-term temporal dependencies. Experimental results showed that the use of Transformer improves the navigation performance to new target images by utilizing long-term history and also improves the data efficiency, especially in large-scale environments. We also conducted an ablation study to show how the number of training frames affects the navigation performance. This results in the accuracy of the proposed method improving while the baseline decreases as the number of training frames increases.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!