多重学習器を用いる強化学習 ―報酬付与に遅れがある学習環境への適用―

西澤 智恵子; 松井 博和

doi:10.1299/jsmermd.2016.1P1-04b3

抄録

In this report, we apply the proposed reinforcement learning with multiplex learning spaces to an environment with a delay time to get a reward and compare it with an ordinary reinforcement learning without considering the delay in experimental simulations. We adapt the proposed method by multiplexing some learning spaces corresponding to a different delay time each other. As the result of an experiment, the ordinary method couldn't get the best policy, but the proposed method could get it effectively.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

[title in Japanese]
三宅島火山すおう穴—風早噴火におけるマグマ噴火からマグマ水蒸気噴火への推移とそのメカニズム
2257 横波散乱波を用いた材料内部界面の画像化(S26-3 非破壊評価とモニタリング(3),S26 非破壊評価とモニタリング)
Corrosion Issues from a Bird's-eye View of Fukushima Daiichi NPP Decommissioning
Alcohol Consumption and Breast Cancer Risk According to Hormone Receptor Status in Japanese Women: A Case-Control Study

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）