Host: The Japanese Society for Artificial Intelligence
Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 37
Location : [in Japanese]
Date : June 06, 2023 - June 09, 2023
In recent years, much attention has been given to deep reinforcement learning, which is one of the artificial intelligence technologies that combines reinforcement learning and deep learning. Deep reinforcement learning, for example, has already shown better performance than humans in games such as Go and Atari video games. Whereas, the progress of its application to real-world tasks beyond artificially limited environments has been slow, and this fact may mean the necessity of other approaches. We focused, in this study, on natural reinforcement learning, which sets an aspiration level and finds quality in rewards. Risk-sensitive Satisficing (RS), an algorithm for natural reinforcement learning, has already demonstrated certain target-oriented exploration and its efficiency in table-based reinforcement learning. However, the current RS employs a Deterministic policy, meaning the difficulty of its application to using probability distributions which deep reinforcement learning draws on. In this study, we extended the Deterministic policy to a Stochastic policy, and verified whether its performances are as good as those of existing table-based reinforcement learning tasks.