A Modification of RLS-based Natural Actor-critic Method toward the Use of Learning Rate Adaptation

Dongkyun Nam; Jooyoung Park; Dasesung Kang

doi:10.14864/softscis.2010.0.762.0

抄録

Actor-critic approach and natural-gradient-based methods have recently drawn significant interests in the area of reinforcement learning, and several algorithms have been studied along the line of the natural actor-critic strategy. This paper considers the problem of improving a previously reported RLS-based natural actor-critic algorithm toward a version that employs learning rate adaptation. In the actor part of the studied algorithm, we follow the strategy of performing parameter update via the use of the natural gradient together with learning rate adaptation, while in its update for the critic part, the recursive least-squares method is utilized for estimating the advantage function and the state value function. The applicability of the studied algorithm is illustrated via locomotion of a two-linked robot arm.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Proposal of a Simple Method to Determine Strength Parameters of Inactive Landslides
The Impairment of Trans Fatty Acids on Learning, Memory and Brain Amino Acid Neurotransmitters in Mice
[title in Japanese]
科学理論の経験的基礎
初期直径分布を有する希薄な水液滴群を含む混合気中を伝播する気相デトネーションに関する数値解析

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）