人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
速報論文
複利型強化学習
松井 藤五郎
著者情報
ジャーナル フリー

2011 年 26 巻 2 号 p. 330-334

詳細
抄録
This paper describes a reinforcement learning framework based on compound returns, which is called compound reinforcement learning. Compound reinforcement learning maximizes the compound return in returns-based MDPs. We also describe compound Q-learning algorithm. We present experimental results using an ilustrative example, 2-armed bandit.
著者関連情報
© 2011 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事
feedback
Top