事前知識を活用したMemory Reinforcement Learningによる行動獲得

稲盛 有那; 平川 翼; 山下 隆義; 藤吉 弘亘; 柏原 良太; 稲葉 正樹; 二反田 直己

doi:10.11517/pjsai.JSAI2018.0_3Pin135

32nd (2018)

Session ID : 3Pin1-35

DOI https://doi.org/10.11517/pjsai.JSAI2018.0_3Pin135

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 32nd Annual Conference of the Japanese Society for Artificial Intelligence, 2018

Number : 32

Location : [in Japanese]

Date : June 05, 2018 - June 08, 2018

Action acquisition by Memory Reinforcement Learning useing a prior knowledge

*Yuna INAMORI, Tsubasa HIRAKAWA, Takayoshi YAMASHITA, Hironobu FUJIYOSHI, Ryota KASHIHAWA, Masaki INABA, Naoki NITANDA

Author information

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Obtaining a human-level control through reinforcement learning (RL) requires massive training. Furthermore, a deep learning-based RL method such as deep Q network (DQN) is difficult to obtain a stable control. In this paper, we propose a novel deep reinforcement learning method to learn stable controls efficiently. Our approach leverages the technique of experience replay and a replay buffer architecture. We manually create a desirable transition sequence and store the transition in the replay buffer at the beginning of training. This hand-crafted transition sequence enables us to avoid random action selections and optimum local policy. Experimental results on a lane-changing task of autonomous driving show that the proposed method can efficiently acquire a stable control.

Corresponding author

Conference information

Register with J-STAGE for free!