日本ロボット学会誌
Online ISSN : 1884-7145
Print ISSN : 0289-1824
ISSN-L : 0289-1824
階層型学習機構における状態行動空間の構成
高橋 泰岳浅田 稔
著者情報
ジャーナル フリー

2003 年 21 巻 2 号 p. 164-171

詳細
抄録

This paper proposes a multi-layered reinforcement learning system that integrates lower learning modules and generates one of higher purposive behaviors based on which an autonomous robot learns from lower level behaviors to higher level ones through its life time. We decompose a large state space at the bottom level into several subspaces and merge those subspaces at the higher level. This allows the system to reuse the policies already learned and to learn the policy against the new features. As a result, curse of dimension is avoided. To show its validity, we apply the proposed method to a simple soccer situation in the context of RoboCup, and show the experimental results.

著者関連情報
© 社団法人 日本ロボット学会
前の記事 次の記事
feedback
Top