計測自動制御学会論文集
Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654
論文
自然勾配を用いたポートハミルトン系のための強化学習の高速化
福永 修一岩本 有生
著者情報
ジャーナル フリー

2023 年 59 巻 2 号 p. 70-76

詳細
抄録

In this paper, we accelerated a reinforcement learning algorithm for port-Hamiltonian systems using a natural gradient method. The proposed algorithm consists of an actor-critic structure wherein the actor generates a control input according to a policy and learns the policy using a temporal difference (TD) error, and the critic computes the TD error and learns a state-value function. Furthermore, the reinforcement learning algorithm for port-Hamiltonian systems has two types of the policy parameters which the proposed algorithm learns using the natural gradient method. Additionally, the proposed method was applied to the problem of swing-up control for an inverted pendulum through numerical simulation. The simulation result showed that the learning speed of the proposed method was higher than that of the existing method.

著者関連情報
© 2023 公益社団法人 計測自動制御学会
前の記事 次の記事
feedback
Top