主催: 一般社団法人 日本機械学会
会議名: ロボティクス・メカトロニクス 講演会2021
開催日: 2021/06/06 - 2021/06/08
In this paper, we propose a policy search reinforcement learning method with a non-parametric policy model and self-triggered control. We formulate a self-triggered policy search that employs a control policy and an execution length policy to reduce the number of action decisions in a trial. Our method employs sparse Gaussian process as a policy model with a self-triggered control framework, and its update law for maximizing return is derived based on variational Bayesian learning. We conducted simulations for a reaching task in a two-dimensional environment and confirmed the effectiveness of our proposed method.