電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<ソフトコンピューティング・学習>
集中型マルチエージェント強化学習法の高速化
赤羽根 拓真飯間 等
著者情報
ジャーナル 認証あり

2020 年 140 巻 2 号 p. 242-248

詳細
抄録

For multiagent environments, a centralized reinforcement learner can find optimal policies, but it is time-consuming. A method is proposed for finding the optimal policies acceleratingly. The method basically uses the centralized learner and supplementarily uses independent learners in the former phase. The independent learners transfer their learning results to the centralized learner, but excessive transfers cause the failure of learning. Therefore the independent learners should stop according to an appropriate condition. However, it is difficult for this method to find optimal policies for environments in which initial states are far from termination states. In order to find the optimal policies acceleratingly for such environments, this paper proposes multiagent reinforcement learning methods introducing new stop conditions.

著者関連情報
© 2020 電気学会
前の記事 次の記事
feedback
Top