人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
論文
Profit Sharingの不完全知覚環境下への拡張 : PS-r*の提案と評価
宮崎 和光小林 重信
著者情報
ジャーナル フリー

2003 年 18 巻 5 号 p. 286-296

詳細
抄録
We know the rationality theorem of Profit Sharing(PS) [Miyazaki 94, Miyazaki 99b] and the Rational Policy Making algorithm(RPM) [Miyazaki 99a] to guarantee the rationality in a typical class of Partially Observable Markov Decision Processes (POMDPs). In this paper, we focus on the whole class of POMDPs and propose PS-r that is an algorithm connected PS and RPM with random selection. In the first, we have analyzed the behavior of PS-r. We have derived that the maximum value of the step to get a reward by PS-r divided by that of random selection is
$({\Large r\frac{(1+\frac{M-1}{r})^n}{M^n}})$ where $n$
is the maximum number of state that senses same state due to the agent's sensory limitation and M is the number of actions. Furthermore, we propose PS-r* that can improve the behavior of PS-r. Through numerical examples, we conform the effectiveness of PS-r*.
著者関連情報
© 2003 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事
feedback
Top