状態予測型強化学習システム

小林 邦和; 中野 浩二; 呉本 尭; 大林 正直

doi:10.1541/ieejeiss.128.1303

抄録

The present paper proposes a new reinforcement learning (RL) system called a state predictor based RL system in order to solve the explosion of state space and create cooperative behaviors in multi-agent systems. The proposed system realizes a predictive function by representing both the present and the next state-action groups with ITPM which is one of incremental topology maps. The proposed system is applied to pursuit problem, and its performance is evaluated by comparing with conventional RL method through computer simulations. The experimental result shows that the proposed system can appropriately learn in a complex environment which is hardly solved by conventional RL. Furthermore, it is confirmed that the proposed system can acquire cooperative strategies.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (389.7K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）