日本神経回路学会誌
Online ISSN : 1883-0455
Print ISSN : 1340-766X
ISSN-L : 1340-766X
研究論文
強化学習における適応的状態空間構成法
鮫島 和行大森 隆司
著者情報
ジャーナル フリー

1999 年 6 巻 3 号 p. 144-154

詳細
抄録
For the application of reinforcement learning to real-world problems, an internal state space has to be constructed from a high dimensional observation space. The algorithm presented here constructs the internal state space during the course of learning desirable actions, and assigns local basis functions adaptively depending on the task requirement. The internal state space initially has only one basis function over the entire observation space, and that basis is eventually divided into smaller ones due to the statistical property of locally weighted temporal difference error. The algorithm was applied to an autonomous robot collision avoidance problem, and the validity of the algorithm was evaluated to show, for instance, the need of a smaller number of basis functions in comparison to other method.
著者関連情報
© 1999 日本神経回路学会
前の記事 次の記事
feedback
Top