世界モデルを利用したプレイデータの学習に基づく実ロボット行動生成とデータ拡張

野村 優太; 村田 真悟

doi:10.11517/pjsai.JSAI2023.0_2G6OS21f02

Abstract

Robots are expected to perform various tasks in complex environments with a high generalization ability, similar to that of humans. Generally, imitation learning with expert demonstrations has high efficiency but low generalization ability. In contrast, reinforcement learning with explorations has high generalization ability but low efficiency. To combine their strengths, we focus on ``play data'' collected by humans teleoperating a robot with curiosity. Specifically, we propose a framework for real-world robot control and data augmentation based on world model learning from play data. Robot experiments demonstrated that the robot with the framework can perform goal-conditioned object manipulation tasks. Furthermore, we also found that simulation in the world model can create novel combinations that are not included in the original play data. These findings suggest that further learning the augmented data has the potential to enable the robot to acquire higher generalization ability.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!