人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
論文
環境の変化に適応するマルチエージェントの学習手法
今福 啓
著者情報
ジャーナル フリー

2006 年 21 巻 2 号 p. 153-166

詳細
抄録
In this paper, several agents construct the multi-agent system, and each agent defines its action to maximize the reward that can obtain from the environment without communicating each other. However, if the environment around the multi-agent system changes, the action which can obtain the high reward also changes, so each agent should adapt to the change of the environment to obtain the high reward. Therefore, each agent is required to recognize the change of the environment through the acquired reward and should learn which action will obtain a high reward.
To adapt to the change of the environment, we propose a new learning method for multi-agent system. In the proposed method, each agent has a matrix named ``transition probability matrix'' that expresses which action will obtain the high reward in the future time. Each agent updates the element of the matrix by using not only the acquired reward but also the entropy of the matrix. The update procedure of the matrix is classified into three cases according to the increase or the decrease of the acquired reward and the entropy of the matrix in the past time.
Some simulations were done by using the proposed method. The results show that each agent can adapt to the change of the reward and obtain the high reward.
著者関連情報
© 2006 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事
feedback
Top