The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)
Online ISSN : 2424-3124
2019
Session ID : 1P2-A09
Conference information

Application of Soft Q-learning to Maze Solving Probrem and Verification of Policy Compositionality
*Junki MATSUOKAYoshihisa TSURUMINETakamitsu MATSUBARA
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details
Abstract

Learning of the robot movement with reinforcement learning (RL) has attracted attention, and improvements of various RL methods have been carried out intensively. With conventional RL methods, however, a complicated task takes a long learning process, which is problematic in the robotics domain. In this paper, we focused on the compositionality of policies of Soft Q-learning (SoftQL). With SoftQL, it is possible to compose multiple already-learned policies and execute compound tasks efficiently. However, in the SoftQL, the action-sampling procedure and learning algorithm are complex due to the continuous action space. In this paper, we applied the SoftQL to a maze-solving problem which has discrete space and investigated its performance and computational tractability for discrete-space problems.

Content from these authors
© 2019 The Japan Society of Mechanical Engineers
Previous article Next article
feedback
Top