ロボティクス・メカトロニクス講演会講演概要集
Online ISSN : 2424-3124
セッションID: 1A1-C15
会議情報

報酬寄与率を考慮したパラメータノイズによる深層強化学習の探索と活用の調節
*狩野 泉実田中 一敏新山 龍馬國吉 康夫
著者情報
会議録・要旨集 フリー

詳細
抄録

In recent years, reinforcement learning has developed rapidly with deep learning and achieves great performance not only in the game playing but also in the continuous control of robots. Reinforcement learning requires exploratory behavior, and action noise is widely used to realize it. Recent researches have tackled exploration problems in deep reinforcement learning by using parameter noise. It has been experimentally shown that parameter noise performs a better exploration than commonly used action noise. However, the methods used so far need long time to update noise distribution or explore uniformly in a huge parameter space by using isotropic noise distribution. This paper proposes a method which improves the update of the noise distribution for faster learning.

著者関連情報
© 2018 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top