1A1-M03 高次元空間における行動生成のための大域的・局所的最適制御法(進化・学習とロボティクス)

関口 拓生; 小林 祐一

doi:10.1299/jsmermd.2011._1A1-M03_1

抄録

Reinforcement learning is effective in acquisition of optimal control policy. However, the calculation amount increases in high-dimensional space. In this paper, we propose a global and local optimal control method using dynamic programming(DP) and differential dynamic programming(DDP). In the global part, approximate the optimal trajectory in the state space by DP. In the local part, optimize the approximate trajectory in the neighborhood by DDP. The proposed method can reduce the calculation amount in optimal control.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

ハロゲンランプ用高色温度変換反射板の開発

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）