Host: Japan SOciety for Fuzzy Theory and intelligent informatics
Co-host: The Korea Fuzzy Logic and Intelligent Systems Society, IEEE Computational Intelligence Society, The International Fuzzy Systems Association, 21th Century COE Program "Creation of Agent-Based Social Systems Sciences"
In this paper, we examine how to reach Pareto optimal equilibrium in Double-bind Prisoner's Dilemma game (hereinafter referred to as DbPD game). DbPD game are made by adding dominated strategy to 2 by 2 prisoner's dilemma game (Wada and Suzuki (2006)), and the problem of playing DbPD game or not also becomes a dilemma. Though our agents are all set as rational, selfish, reinforcement-learning based model agents, we discover that they learn to avoid using Nash equilibrium in early round. After finishing to learn that, then they start to learn cooparating. Through these process, our agents are possible to contribute each other in DbPD game.