Host: The Japanese Society for Artificial Intelligence
Name : 34th Annual Conference, 2020
Number : 34
Location : Online
Date : June 09, 2020 - June 12, 2020
In this research, the robot learned skillful behaviors performed by humans to perform the product alignment task in retail stores. Humans can perform more optimal actions by using different strategies for the same task in different initial environments. Therefore, we proposed a system in which the robot can autonomously select a strategy according to the initial state. We created multiple alignment behavior models obtained by simulation-based reinforcement learning and a selector to use them properly. As a result of performing the alignment task using our system, the alignment was more accurate than when only one model was used. In addition, using the model learned on the simulation, we confirmed that the alignment behavior was possible in the real environment.