強化学習における方策評価の効率化による学習の加速

泉田 啓; 服部 俊; 幸田 武久

doi:10.9746/sicetr.49.696

Transactions of the Society of Instrument and Control Engineers

Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654

Paper

Acceleration of Reinforcement Learning by Efficient Policy Evaluation

Kei SENDA, Suguru HATTORI, Takehisa KOHDA

Author information

Keywords: reinforcement learning, policy evaluation, Krylov subspace method

JOURNAL FREE ACCESS

2013 Volume 49 Issue 7 Pages 696-702

DOI https://doi.org/10.9746/sicetr.49.696

Details

Abstract

Typical methods for solving reinforcement learning problems iterate two steps, policy evaluation and policy improvement. This study proposes algorithms for the policy evaluation to improve learning efficiency. The proposed algorithms, based on the Krylov Subspace Method (KSM), are tens to hundreds times more efficient than existing algorithms based on the Stationary Iterative Methods (SIM). Algorithms based on KSM are far more efficient than they have been generally expected. This study clarifies what makes algorithms based on KSM makes more efficient with numerical examples and theoretical discussions.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!