Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
34th (2020)
Session ID : 2I5-GS-2-02
Conference information

Function approximation of Cognitive Satisficing Value Function
*Yuki YOSHIIYu KONOTatsuji TAKAHASHI
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Humans have a tendency in decision-making called satisficing: they stop exploring more when they find an option above a criterion (aspiration level). Risk-sensitive Satisficing (RS) model is a value function that enables efficient non-random exploration and realizes satisficing in reinforcement learning (Tamatsukuri & Takahashi, 2019). To apply RS to continuous state spaces, we extended RS to Linear RS (LinRS) for function approximation and test its performance in the contextual bandit problems. As a result, it was found that the algorithm had better performance in probabilistic environments than the existing algorithms. Also, it was found that the aspiration level needed to be corrected because of the approximation error.

Content from these authors
© 2020 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top