商品整列タスクにおける複数のResidual Reinforcement Learningモデルの選択

嘉藤 佑亮; 中村 友昭; 長井 隆行; 山野辺 夏樹; 永田 和之; 小澤 順

doi:10.11517/pjsai.JSAI2020.0_1Q4GS1101

34th (2020)

Session ID : 1Q4-GS-11-01

DOI https://doi.org/10.11517/pjsai.JSAI2020.0_1Q4GS1101

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : 34th Annual Conference, 2020

Number : 34

Location : Online

Date : June 09, 2020 - June 12, 2020

Residual Reinforcement Learning Models Selection Considering Initial State for Item Alignment Task

*Yusuke KATO, Tomoaki NAKAMURA, Takayuki NAGAI, Natsuki YAMANOBE, Nagata KAZUYUKI, Jun OZAWA

Author information

Keywords: AI, Robotics, Reinforcement Learning

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

In this research, the robot learned skillful behaviors performed by humans to perform the product alignment task in retail stores. Humans can perform more optimal actions by using different strategies for the same task in different initial environments. Therefore, we proposed a system in which the robot can autonomously select a strategy according to the initial state. We created multiple alignment behavior models obtained by simulation-based reinforcement learning and a selector to use them properly. As a result of performing the alignment task using our system, the alignment was more accurate than when only one model was used. In addition, using the model learned on the simulation, we confirmed that the alignment behavior was possible in the real environment.

Corresponding author

Conference information

Register with J-STAGE for free!