軌道学習における試行回数削減のための強化学習手法

嘉藤 佑亮; 中村 友昭; 長井 隆行; 山野辺 夏樹; 永田 和之; 小澤 順

doi:10.11517/pjsai.JSAI2018.0_2A302

32nd (2018)

Session ID : 2A3-02

DOI https://doi.org/10.11517/pjsai.JSAI2018.0_2A302

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 32nd Annual Conference of the Japanese Society for Artificial Intelligence, 2018

Number : 32

Location : [in Japanese]

Date : June 05, 2018 - June 08, 2018

Trials Reduction Method for Reinforcement Learning in Trajectory Discovery

*Yusuke KATO, Tomoaki NAKAMURA, Takayuki NAGAI, Natsuki YAMANOBE, Kazuyuki NAGATA, Jun OZAWA

Author information

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

In recent years, there are many researches of deep reinforcement learning to realize autonomous motion of robots. In deep reinforcement learning, a large number of trials such as thousands of times or more are required to realize sufficient performance as a learning result. However, learning in a real environment often requires assistance by people, so it is difficult to do thousands of trials. In this research, we create a learning database from efficient reinforcement learning that utilizes knowledge about tasks given by people in advance, and realize learning with a relatively small number of trials by performing mini batch learning using that database. We apply our proposed method to learning of picking task in the logistics warehouse and show the usefulness of our proposed method by comparing the results with other methods.

Corresponding author

Conference information

Register with J-STAGE for free!