電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<ソフトコンピューティング・学習>
合理的な忘却型Profit Sharing強化学習法
幸若 完壮渡辺 浩太五十嵐 一
著者情報
ジャーナル フリー

2012 年 132 巻 3 号 p. 448-454

詳細
抄録
In this paper, Rationally oriented Forgettable Profit Sharing method (RFPS) for reinforcement learning is proposed. Although the Profit Sharing (PS) provides good performances in real environments, its learning is often slow in long term tasks because it is difficult to determine the adequate discount rate which satisfies the Miyazaki rational theorem. There are several rationality-relaxed PS methods which work well for such tasks. However, these PS may result in many irrational loops. The proposed method fulfills the rationality by forgetting the reinforced irrational loops. This method can be easily combined with ordinary PS methods and performs well in long term tasks. The simulation results show that the proposed method can learn more efficiently than the conventional PS methods.
著者関連情報
© 電気学会 2012
前の記事 次の記事
feedback
Top