2A1-M05 強化学習における報酬情報に基づいた学習率の制御法(進化・学習とロボティクス)

鎌田 徹平; 中澤 和夫

doi:10.1299/jsmermd.2007._2A1-M05_1

抄録

Recently, as robotics has been developed, it is expected that robots are workforce taking place of people. Conventionally, rules for controlling the robots are designed by a designer, but under unknown environment with various uncertainties, it is difficult to design the rules previously. Therefore, there are many researches on controlling autonomous robots with learning abilities, especially reinforcement learning attracts attention. Reinforcement learning largely depends on parameters. Accordingly, it is necessary to develop an algorithm which can set parameters autonomously according to the state of the environment and the advance condition of learning while learning. In this research, it aims at developing a new algorithm controlling the parameters based on only reward information while learning.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）