集中型マルチエージェント強化学習法の高速化

赤羽根 拓真; 飯間 等

doi:10.1541/ieejeiss.140.242

Abstract

For multiagent environments, a centralized reinforcement learner can find optimal policies, but it is time-consuming. A method is proposed for finding the optimal policies acceleratingly. The method basically uses the centralized learner and supplementarily uses independent learners in the former phase. The independent learners transfer their learning results to the centralized learner, but excessive transfers cause the failure of learning. Therefore the independent learners should stop according to an appropriate condition. However, it is difficult for this method to find optimal policies for environments in which initial states are far from termination states. In order to find the optimal policies acceleratingly for such environments, this paper proposes multiagent reinforcement learning methods introducing new stop conditions.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!