Reinforcement Learning in Multi-dimensional State-action Space Using Random Tiling and Gibbs Sampling

Hajime KIMURA

doi:10.9746/sicetr1965.42.1336

抄録

In real-robot applications, learning controllers are often required to obtain control rules over high-dimensional continuous state-action space. Random tile-coding is a promising method to deal with high-dimensional state space for representing the state value function. However, there is no standard reinforcement learning scheme to deal with action selection in high-dimensional action space, especially the probability of action variables are mutually dependent. This paper introduces a new action selection scheme using random tile-coding and Gibbs sampling, and shows the Q-learning algorithm applying the proposed scheme. We demonstrate it through a Rod in maze problem and a redundant arm reaching task.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

予測評価探索をともなう分枝限定法による出荷バースの操業スケジューリング
S-I-3 新生児横隔膜ヘルニアの治療と問題点
直腸肛門機能 (2)(III-B-9&acd;13)(応募演題(3), 座長まとめ, 第 14 回日本小児外科学会総会)
Wear of Various Textile Guide Materials
「想いをかたちに未来へつなぐ」

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）