The Proceedings of Conference of Kanto Branch
Online ISSN : 2424-2691
ISSN-L : 2424-2691
2010.16
Session ID : 11912
Conference information
11912 Proposal and Evaluation of the Improved Penalty Avoiding Rational Policy Making algorithm with a Learning Mechanism of Threshold of the Penalty Basis Function
Ryohei KobayashiKazuteru MiyazakiHiroaki Kobayashi
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract
Penalty Avoiding Rational Policy Making algorithm (PARP) based on Profit Sharing method and was planed to learn a penalty avoiding policy. PARP is improved to save memories and to cope with uncertainties. The efficiency of the Improved Penalty Avoiding Rational Policy Making algorithm is influenced by threshold of the penalty basis function γ significantly. Up to now, it is necessary to set appropriate γ through a preliminary experiment. In this paper, we propose a technique for learning γ with the multi start method. The proposal technique is applied to a keepaway task that is a benchmark in a robotic soccer game, to confirm the effectiveness.
Content from these authors
© 2010 The Japan Society of Mechanical Engineers
Previous article Next article
feedback
Top