Transactions of the Society of Instrument and Control Engineers
Online ISSN : 1883-8189
Print ISSN : 0453-4654
ISSN-L : 0453-4654
Paper
Robust Reinforcement Learning for Variations Represented by an Infinite Plant Set
Kei SENDARitsuSamuel OTSUBOYurika TANI
Author information
JOURNAL FREE ACCESS

2016 Volume 52 Issue 9 Pages 474-480

Details
Abstract
In a general reinforcement learning problem, a learning policy for an estimated plant is applied to a real plant. However, if the difference between the two plants is large, the learning policy is not effective. Therefore, a learning policy for a variation plant set, including elements made by adding variations to the estimated plant, is obtained. However, the number of elements of the set is infinite. To solve this problem, we discretize the infinite plant set by using the relationships between the structures of the plants. The policy that is proper for all the elements of the finite set obtained by the discrete approximations is also proper for all the elements of the original infinite plant set. Using the relationships between the structures of plants and policy, the properness of policy, which is the solution of the relaxation problem for the finite plant set, is revealed. The effectiveness of the proposed method is demonstrated by numerical examples.
Content from these authors
© 2016 The Society of Instrument and Control Engineers
Previous article Next article
feedback
Top