ガウス過程に基づく自己駆動型方策による方策探索

佐々木 光; 松原 崇充

doi:10.1299/jsmermd.2021.1P1-I17

抄録

In this paper, we propose a policy search reinforcement learning method with a non-parametric policy model and self-triggered control. We formulate a self-triggered policy search that employs a control policy and an execution length policy to reduce the number of action decisions in a trial. Our method employs sparse Gaussian process as a policy model with a self-triggered control framework, and its update law for maximizing return is derived based on variational Bayesian learning. We conducted simulations for a reaching task in a two-dimensional environment and confirmed the effectiveness of our proposed method.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

基礎自治体における子ども行政の一元化に関する研究
WATERTIGHT EMBANKMENT USING L-SHAPED GEOSYNTHETIC DRAIN
Miller Class II の歯肉退縮の患者にエンベロープテクニックを用いて結合組織移植を行った一症例
[title in Japanese]
[title in Japanese]

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）