変分学習によるスパース擬似入力ガウス過程方策探索

佐々木 光; 小澤 裕斗; 松原 崇充

doi:10.1299/jsmermd.2018.1A1-C16

抄録

In this paper, we introduce a policy search reinforcement learning method with a sparse non-parametric policy model. We formulate policy search as a variational learning problem. A sparse pseudo-input Gaussian processes (SPGP) is placed as a prior distribution of the control policy, then a variational lower bound of the expected reward is derived, which is optimized w.r.t. the hyper parameters and the pseudo-input variables. We conducted numerical simulations and real robot experiments, and confirmed the effectiveness of our proposed method.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

The Effects of Anemia Artificially Induced by Bleeding on Blood Constituents, Concentration of ATP-Related Compounds and Na⁺-K⁺ ATPase Activity in Gill and Kidney of Coho Salmon
Recovery from pregnancy-induced down-regulation of cytochrome P450 isozymes (CYPs) protein levels in postpartum rat liver
Evaluation of bivalve shell growth based on combined sclerochronological and geochemical analysis: Application of geochemical technique to fisheries science.
Empirical Study on Fair Allocation of Joint Project by Cooperative Game Theory—A Case of Local Bus Transportation Service in Japan—
Bone Mineralization and Physical Development of Children

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）