The Proceedings of JSME annual Conference on Robotics and Mechatronics (Robomec)
Online ISSN : 2424-3124
2023
Session ID : 2P1-D11
Conference information

Investigation of Effective Exploration Methods in Reinforcement Learning Control of a Flapping UAV
*Kentaro HIRAIMiku SAITOZhi LILifeng XIEShunto SASAZAKITakanobu WATANABE
Author information
CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details
Abstract

We conducted investigation into an effective scheduling method of the exploration in a reinforcement learning algorithm, aiming at the control of a flapping unmanned aerial vehicle (UAV) we have developed. Deep Q Network (DQN) algorithm was employed to determine optimal gain parameters of PID control of the Yaw angle of the airframe. Although the Yaw angle can be stabilized by this PID-DQN hybrid method, we noticed that the gain parameters tend to be biased toward highly rated values in the early stages of the learning. In this study, we solved this problem by modifiying the scheduling of epsilon-greedy method in DQN.

Content from these authors
© 2023 The Japan Society of Mechanical Engineers
Previous article Next article
feedback
Top