深層強化学習のハイパーパラメータと報酬関数のベイズ最適化～移動ロボットへの適用～

曽田 涼介; 西村 拓人; 堀内 匡

doi:10.1541/ieejeiss.145.190

抄録

Deep reinforcement learning is a machine learning method that combines deep learning and reinforcement learning. Deep Q-network (DQN) is one of the typical methods of deep reinforcement learning. DQN uses Convolutional Neural Network (CNN) which can extract features from the input images. We have applied DQN method to the mobile robot navigation problem. The values of hyper-parameters, including the network structure of DQN, and the reward function used in the DQN algorithm, have been determined empirically. In this study, we attempt to optimize both of the values of hyper-parameters and reward function of deep reinforcement learning by using Bayesian optimization. We realized to optimize the values of hyper-parameters including the network structure of DQN, and the reward function by using Optuna, a framework of Bayesian optimization. We confirmed that the values of hyper-parameters and reward function obtained by Optuna have higher learning performance than that by empirical method.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (327.9K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）