複利型強化学習

松井 藤五郎

doi:10.1527/tjsai.26.330

Transactions of the Japanese Society for Artificial Intelligence

Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714

Short Paper

Compound Reinforcement Learning

Tohgoroh Matsui

Author information

Keywords: reinforecement learning, value functions, compound returns, Q-learning

JOURNAL FREE ACCESS

2011 Volume 26 Issue 2 Pages 330-334

DOI https://doi.org/10.1527/tjsai.26.330

Details

Abstract

This paper describes a reinforcement learning framework based on compound returns, which is called compound reinforcement learning. Compound reinforcement learning maximizes the compound return in returns-based MDPs. We also describe compound Q-learning algorithm. We present experimental results using an ilustrative example, 2-armed bandit.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!