未知評価関数を有する連続時間最適制御問題におけるベイズ的最適化手法

豊田 充

doi:10.9746/sicetr.55.100

抄録

This study presents an extension of Bayesian learning approach with Gaussian process regression focusing on continuous-time optimal control problem in which stage cost function is unknown. By applying control parametrization method, the optimal control problem can be approximately formulated as a nonlinear programming problem, and the statistics of the cost function estimated by Gaussian process regression is analyzed. To obtain a solution to Bayesian optimization problem, an effective gradient calculation based on variational method is developed. Furthermore, the analysis of optimality in the fashion of bandit problem provides the order of regret bound achieved by the proposed algorithm.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

シンポジウム:和算史研究の現状と課題 : 特に現代の数学教育に関連して(2000年度年会報告)
表紙
Annual Reproductive Cycle in the Scincid Lizard Chalcides viridanus from Tenerife, Canary Islands
Hereditary Leiomyomatosis and Renal Cell Cancer; HLRCC

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）