Host: The Japanese Society for Artificial Intelligence
Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 35
Location : [in Japanese]
Date : June 08, 2021 - June 11, 2021
We humans tend to search for a satisfiable action above an acceptability threshold (satisficing). A value function that implements satisficing together with the prospect theory-like risk attitudes called “risk-sensitive satisficing” (RS) model shows superior results in the bandit problems. However, wider application and analysis of the behavior of the model is intractable in some ways, because of the deterministic nature of the policy. In this study, we introduce the stochastic version of RS (SRS). Through comparison of RS and SRS in stationary and non-stationary environments, we show the merits of SRS.