大規模言語モデルの事前知識を活用した構成論的なロボットタスクにおける学習効率の改善

髙城 頌太; 松嶋 達也; 岩澤 有祐; 松尾 豊

doi:10.11517/pjsai.JSAI2024.0_3O1OS16b05

Abstract

Large language model have shown high general performance in various tasks, and their applications are expanding not only in natural language processing but also in various other fields. Although there are many existing studies that utilize large language model in robot control, most of them are used for action planning in compositional tasks, and fail if an action is selected that is not prepared in advance by the robot. In other words, the a priori knowledge in large-scale language models can be used for policy selection during inference, but it cannot be used during actual policy learning. In this paper, we aim to decompose a task using prior knowledge from a large language model and intensively reinforce learning only the failed steps, so that the robot can acquire a new strategy with minimal interaction with the environment.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!