Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Short Paper
Compound Reinforcement Learning
Tohgoroh Matsui
Author information
JOURNAL FREE ACCESS

2011 Volume 26 Issue 2 Pages 330-334

Details
Abstract
This paper describes a reinforcement learning framework based on compound returns, which is called compound reinforcement learning. Compound reinforcement learning maximizes the compound return in returns-based MDPs. We also describe compound Q-learning algorithm. We present experimental results using an ilustrative example, 2-armed bandit.
Content from these authors
© 2011 JSAI (The Japanese Society for Artificial Intelligence)
Previous article Next article
feedback
Top