Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Short Paper
Optimizing Betting Fraction in Compound Reinforcement Learning
Tohgoroh MatsuiTakashi GotoKiyoshi IzumiYu Chen
Author information
JOURNAL FREE ACCESS

2013 Volume 28 Issue 3 Pages 267-272

Details
Abstract
This paper describes optimization of the betting fraction parameter in compound reinforcement learning. Compound reinforcement learning maximizes the expected logarithm of compound returns in return-based MDPs. However, a new betting fraction parameter is introduced in order not to diverge values to negative infinity and it causes a problem of choosing the parameter. In this paper, we proposed a method to optimize the betting fraction with on-line gradient ascent in compound reinforcement learning.
Content from these authors
© 2013 JSAI (The Japanese Society for Artificial Intelligence)
Previous article Next article
feedback
Top