A Study on Architecture, Algorithms and Internal Representation for Reinforcement Learning with Recurrent Neural Networks

Ahmet ONAT; Hajime KITA; Yoshikazu NISHIKAWA

doi:10.9746/sicetr1965.35.1599

抄録

Most algorithms for reinforcement learning face difficulty in achieving optimal performance when the state of the environment is not completely known. The authors have proposed a method for overcoming this problem by using recurrent neural networks in a learning agent. In this paper, we discuss the implementation of the proposed method using several types of network architecture and supervised learning algorithms. Further, the internal representation of the environment acquired in the learning agent is examined using a technique of cluster analysis. The results show that the learning agent achieves optimal performance in reinforcement learning tasks by constructing an accurate internal model, despite incomplete perception of the state of the environment.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

16. シロイナズナのオーキシン結合タンパク質, ABP1, の機能調節タンパク質の探索
[title in Japanese]
ワグネルのポット試験と明治期日本の肥料試験法
[title in Japanese]
症例心筋梗塞後の仮性心室瘤の1例

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）