羽ばたき型UAVの強化学習制御における効果的な探索法の検討

平井 健太郎; 齋藤 未来; 李 直; 謝 砺鋒; 笹崎 舜翔; 渡邉 孝信

doi:10.1299/jsmermd.2023.2P1-D11

2023

Session ID : 2P1-D11

DOI https://doi.org/10.1299/jsmermd.2023.2P1-D11

Conference information

Host: The Japan Society of Mechanical Engineers

Name : [in Japanese]

Date : June 28, 2023 - July 01, 2023

Investigation of Effective Exploration Methods in Reinforcement Learning Control of a Flapping UAV

*Kentaro HIRAI, Miku SAITO, Zhi LI, Lifeng XIE, Shunto SASAZAKI, Takanobu WATANABE

Author information

Keywords: Flapping UAV, Flight posture control, PID control, Reinforcement learning

CONFERENCE PROCEEDINGS RESTRICTED ACCESS

Details

Abstract

We conducted investigation into an effective scheduling method of the exploration in a reinforcement learning algorithm, aiming at the control of a flapping unmanned aerial vehicle (UAV) we have developed. Deep Q Network (DQN) algorithm was employed to determine optimal gain parameters of PID control of the Yaw angle of the airframe. Although the Yaw angle can be stabilized by this PID-DQN hybrid method, we noticed that the gain parameters tend to be biased toward highly rated values in the early stages of the learning. In this study, we solved this problem by modifiying the scheduling of epsilon-greedy method in DQN.

Corresponding author

Register with J-STAGE for free!