The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)
Online ISSN : 2424-3124
2022
Session ID : 2A2-K03
Conference information

Autonomous driving with safe reinforcement learning using rule-based judgment
*Gaku MINAMOTOToshimitsu KANEKONoriyuki HIRAYAMA
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details
Abstract

Reinforcement learning (RL) in safety critical domains like autonomous driving requires safe exploration, but conventional safe reinforcement learning methods have low performance at the initial stage of learning. In this paper, we present a RL method that selects action using independent Q-functions on rule-based policy and RL policy to address this issue. In our method, Q-function on rule-based policy is pre-trained using offline data and Q-function on RL policy is initialized randomly. Our method selects action by comparing both Q-functions and increases probability of selecting rule-based action at initial stage of learning. We conduct experiments on driving lane selection task, and find that our approach can improve performance at the initial stage of learning.

Content from these authors
© 2022 The Japan Society of Mechanical Engineers
Previous article Next article
feedback
Top