計測自動制御学会論文集
Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654
論文
無限プラント集合で表わされる変動に対するロバスト強化学習
泉田 啓大坪 立サミュエル谷 百合夏
著者情報
ジャーナル フリー

2016 年 52 巻 9 号 p. 474-480

詳細
抄録
In a general reinforcement learning problem, a learning policy for an estimated plant is applied to a real plant. However, if the difference between the two plants is large, the learning policy is not effective. Therefore, a learning policy for a variation plant set, including elements made by adding variations to the estimated plant, is obtained. However, the number of elements of the set is infinite. To solve this problem, we discretize the infinite plant set by using the relationships between the structures of the plants. The policy that is proper for all the elements of the finite set obtained by the discrete approximations is also proper for all the elements of the original infinite plant set. Using the relationships between the structures of plants and policy, the properness of policy, which is the solution of the relaxation problem for the finite plant set, is revealed. The effectiveness of the proposed method is demonstrated by numerical examples.
著者関連情報
© 2016 公益社団法人 計測自動制御学会
前の記事 次の記事
feedback
Top