Host: The Japanese Society for Artificial Intelligence
Name : The 37th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 37
Location : [in Japanese]
Date : June 06, 2023 - June 09, 2023
When humans engage in an unknown reinforcement learning task, they usually search quickly to achieve a certain level of performance and terminate the search when that level is achieved. This property has led to the proposal of the search method Risk-sensitive Satisficing (RS) in previous studies. We have shown that RS is more efficient in trial-and-error and performs as good as or better than conventional methods that aim for optimization. RS has been extended to learning in state transitions by combining it with Global Reference Conversion (RS+GRC), a global reference conversion method that can convert the entire rarefaction level into the rarefaction level of each state and give it to the user. However, while the current RS+GRC performs well under the condition that the optimal rarefaction level is given, the method for proactively adjusting the rarefaction level has not been discussed in depth. In this study, we propose a dynamic, stepwise goal modification algorithm for reinforcement learning based on goal attainment, aiming to deal with tasks in which the scale of the reward function and the level of task attainment are unknown.