Transactions of the Society of Instrument and Control Engineers
Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654
Reinforcement Learning in Multi-dimensional State-action Space Using Random Tiling and Gibbs Sampling
Hajime KIMURA
Author information
JOURNAL FREE ACCESS

2006 Volume 42 Issue 12 Pages 1336-1343

Details
Abstract
In real-robot applications, learning controllers are often required to obtain control rules over high-dimensional continuous state-action space. Random tile-coding is a promising method to deal with high-dimensional state space for representing the state value function. However, there is no standard reinforcement learning scheme to deal with action selection in high-dimensional action space, especially the probability of action variables are mutually dependent. This paper introduces a new action selection scheme using random tile-coding and Gibbs sampling, and shows the Q-learning algorithm applying the proposed scheme. We demonstrate it through a Rod in maze problem and a redundant arm reaching task.
Content from these authors
© The Society of Instrument and Control Engineers (SICE)
Previous article Next article
feedback
Top