人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
論文
合理的政策形成アルゴリズムの連続値入力への拡張
宮崎 和光木村 元小林 重信
著者情報
ジャーナル フリー

2007 年 22 巻 3 号 p. 332-341

詳細
抄録

Reinforcement Learning is a kind of machine learning. We know Profit Sharing, the Rational Policy Making algorithm (RPM), the Penalty Avoiding Rational Policy Making algorithm and PS-r* to guarantee the rationality in a typical class of the Partially Observable Markov Decision Processes. However they cannot treat continuous state spaces. In this paper, we present a solution to adapt them in continuous state spaces. We give RPM a mechanism to treat continuous state spaces in the environment that has the same type of a reward. We show the effectiveness of the proposed method in numerical examples.

著者関連情報
© 2007 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事
feedback
Top