SCIS & ISIS
SCIS & ISIS 2006
セッションID: TH-H2-2
会議情報

TH-H2 Decision Making and Discrete Event System
How to Reach Pareto Optimal Equilibrium in Double-bind Prisoner's Dilemma Game
*Shihomi WadaKeiji Suzuki
著者情報
会議録・要旨集 フリー

詳細
抄録
In this paper, we examine how to reach Pareto optimal equilibrium in Double-bind Prisoner's Dilemma game (hereinafter referred to as DbPD game). DbPD game are made by adding dominated strategy to 2 by 2 prisoner's dilemma game (Wada and Suzuki (2006)), and the problem of playing DbPD game or not also becomes a dilemma. Though our agents are all set as rational, selfish, reinforcement-learning based model agents, we discover that they learn to avoid using Nash equilibrium in early round. After finishing to learn that, then they start to learn cooparating. Through these process, our agents are possible to contribute each other in DbPD game.
著者関連情報
© 2006 Japan Society for Fuzzy Theory and Intelligent Informatics
前の記事 次の記事
feedback
Top