ルールベースの判定を利用した安全な強化学習による自動運転

皆本 岳; 金子 敏充; 平山 紀之

doi:10.1299/jsmermd.2022.2A2-K03

Abstract

Reinforcement learning (RL) in safety critical domains like autonomous driving requires safe exploration, but conventional safe reinforcement learning methods have low performance at the initial stage of learning. In this paper, we present a RL method that selects action using independent Q-functions on rule-based policy and RL policy to address this issue. In our method, Q-function on rule-based policy is pre-trained using offline data and Q-function on RL policy is initialized randomly. Our method selects action by comparing both Q-functions and increases probability of selecting rule-based action at initial stage of learning. We conduct experiments on driving lane selection task, and find that our approach can improve performance at the initial stage of learning.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!