Host: The Japanese Society for Artificial Intelligence
Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 37
Location : [in Japanese]
Date : June 06, 2023 - June 09, 2023
Research on multi-agent path planning using reinforcement learning methods has recently been developed. However, a common problem in this field is the difficulty of agents learning to cooperate with each other, since each agent is motivated by its own reward. In this study, we examined the impact of considering not only self-reward but also those of others. A world model is introduced to predict the future states of the environment. Considering agents' fairness is expected to be an effective solution to address reward bias among agents and ultimately achieve satisfactory performance in real-world applications, such as operating in crowded environments.