Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
36th (2022)
Session ID : 3Yin2-10
Conference information

Efficient Deep Reinforcement Learning in Large-Scale Environments Using Exploration Criteria by Critic-Attention
*Takuya MURASETsubasa HIRAKAWATakayoshi YAMASHITAHironobu FUJIYOSHI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Deep reinforcement learning is a method in which an agent learns optimal behavior by trial-and-error in an unknown environment and relying on the rewards it obtains, and it has outperformed humans in various gaming tasks such as Atari2600 and board games. However, the agent acts randomly without any exploration criteria until it reaches the reward. Therefore, in large and complex environments where there are few opportunities to obtain rewards, a large number of trials are required to obtain an appropriate action. In this paper, we pre-train a Critic model with a Mask-Attention mechanism and use the resulting attention map as a exploration criterion for the Policy model to enable efficient learning. Experiments using Minecraft show that the proposed method can learn actions efficiently.

Content from these authors
© 2022 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top