深層満足化強化学習に向けて

佐鳥 玖仁朗; 吉田 豊; 神谷 匠; 高橋 達二

doi:10.11517/pjsai.JSAI2019.0_2P1J201

33rd (2019)

Session ID : 2P1-J-2-01

DOI https://doi.org/10.11517/pjsai.JSAI2019.0_2P1J201

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 33rd Annual Conference of the Japanese Society for Artificial Intelligence, 2019

Number : 33

Location : [in Japanese]

Date : June 04, 2019 - June 07, 2019

Toward Deep Satisficing Reinforcement Learning

*Kuniaki SATORI, Yutaka YOSHIDA, Takumi KAMIYA, Tatsuji TAKAHASHI

Author information

Keywords: reinforcement learning, Trade-off between exploration and knowledge use, intrinsic motivation

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

For dealing with continuous state spaces, DQN and other algorithms have been proposed in reinforcement learning (RL). However, it is hard for DQN to explore efficiently, as it depends on random search strategies such as epsilon-greedy. Humans are known to effectively search and learn through "satisficing" instead of optimizing. Although the risk-sensitive satisificing (RS) algorithm enables satisficing in RL, it depends on the count of visiting each state, which poses a problem for continuous spaces. We propose a method for solving this problem by pseudocount and hash+auto encoder methods that enables intrinsically motivated exploration. Through two experiments, we show that RS combined with the two methods enables deep satisficing RL that searches and learns efficiently in continuous spaces.

Corresponding author

Conference information

Register with J-STAGE for free!