Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 1B4-GS-2-04
Conference information

Natural reinforcement learning based on Stochastic policy
*Takumi SUZUKIShunpei KOSHIKAWATatsuji TAKAHASHIYu KONO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In recent years, much attention has been given to deep reinforcement learning, which is one of the artificial intelligence technologies that combines reinforcement learning and deep learning. Deep reinforcement learning, for example, has already shown better performance than humans in games such as Go and Atari video games. Whereas, the progress of its application to real-world tasks beyond artificially limited environments has been slow, and this fact may mean the necessity of other approaches. We focused, in this study, on natural reinforcement learning, which sets an aspiration level and finds quality in rewards. Risk-sensitive Satisficing (RS), an algorithm for natural reinforcement learning, has already demonstrated certain target-oriented exploration and its efficiency in table-based reinforcement learning. However, the current RS employs a Deterministic policy, meaning the difficulty of its application to using probability distributions which deep reinforcement learning draws on. In this study, we extended the Deterministic policy to a Stochastic policy, and verified whether its performances are as good as those of existing table-based reinforcement learning tasks.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top