強化学習における適応的状態空間構成法

鮫島 和行; 大森 隆司

doi:10.3902/jnns.6.144

抄録

For the application of reinforcement learning to real-world problems, an internal state space has to be constructed from a high dimensional observation space. The algorithm presented here constructs the internal state space during the course of learning desirable actions, and assigns local basis functions adaptively depending on the task requirement. The internal state space initially has only one basis function over the entire observation space, and that basis is eventually divided into smaller ones due to the statistical property of locally weighted temporal difference error. The algorithm was applied to an autonomous robot collision avoidance problem, and the validity of the algorithm was evaluated to show, for instance, the need of a smaller number of basis functions in comparison to other method.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

無補剛箱形断面鋼製橋脚の延性破壊解析における損傷進展エネルギーの決定方法の一検討
Static Characteristics and Nonlinear Seismic Response of Concrete-Filled Tubular Arch Bridge with Half-Through Deck
Enhancing the neural differentiation capabilities of genetically asymmetric mouse F1 hybrid embryonic stem cell lines
超音波振動ドラムによる摩擦軽減と記録再生実験
放射線による細胞死の過程とその促進

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）