Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
32nd (2018)
Session ID : 2A3-02
Conference information

Trials Reduction Method for Reinforcement Learning in Trajectory Discovery
*Yusuke KATOTomoaki NAKAMURATakayuki NAGAINatsuki YAMANOBEKazuyuki NAGATAJun OZAWA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In recent years, there are many researches of deep reinforcement learning to realize autonomous motion of robots. In deep reinforcement learning, a large number of trials such as thousands of times or more are required to realize sufficient performance as a learning result. However, learning in a real environment often requires assistance by people, so it is difficult to do thousands of trials. In this research, we create a learning database from efficient reinforcement learning that utilizes knowledge about tasks given by people in advance, and realize learning with a relatively small number of trials by performing mini batch learning using that database. We apply our proposed method to learning of picking task in the logistics warehouse and show the usefulness of our proposed method by comparing the results with other methods.

Content from these authors
© 2018 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top