電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<ソフトコンピューティング>
状態予測型強化学習システム
小林 邦和中野 浩二呉本 尭大林 正直
著者情報
ジャーナル フリー

2008 年 128 巻 8 号 p. 1303-1311

詳細
抄録

The present paper proposes a new reinforcement learning (RL) system called a state predictor based RL system in order to solve the explosion of state space and create cooperative behaviors in multi-agent systems. The proposed system realizes a predictive function by representing both the present and the next state-action groups with ITPM which is one of incremental topology maps. The proposed system is applied to pursuit problem, and its performance is evaluated by comparing with conventional RL method through computer simulations. The experimental result shows that the proposed system can appropriately learn in a complex environment which is hardly solved by conventional RL. Furthermore, it is confirmed that the proposed system can acquire cooperative strategies.

著者関連情報
© 電気学会 2008
前の記事 次の記事
feedback
Top