Consideration of Context-based Designing Method for Complex-valued Reinforcement Learning

Akira Notsu; Naoto Ichikawa; Katsuhiro Honda; Hidetomo Ichihashi

doi:10.14864/softscis.2010.0.676.0

抄録

We propose a new method to deﬁne the constants of Complex-Valued Q-learning, which is an Reinforcement Learning algorithm that can deal with incomplete perception problems. It enables agents without sufﬁcient perception to recognize the context of actions at some level by applying complex numbers to value functions. We improve this method in two ways. One, we predict contexts not by adjacent situations as before but by how many times an agent acts from a starting situation. Second, we efﬁciently use memory that depends on the number of the steps that is required to get rewards. Our agents successfully solved more complex, incomplete perception problems by these methods. We also consider a context-based designing method for Complex-Valued Reinforcement Learning.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Brelot spaces of Schrödinger equations
Spatial association between malaria pandemic and mortality
The First Discovery of Pentalophodon from Japan
[title in Japanese]
Reflection and Transmission Characteristics of Laminated Structures Consisting a Dipole Array Sheet and a Wire Grid and Dielectric Layer

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）