主催: 人工知能学会
会議名: 第72回 先進的学習科学と工学研究会
回次: 72
開催地: 慶応義塾大学 日吉キャンパス 来往舎
開催日: 2014/11/22
p. 04-
In this study, we consider a nonzero-sum bargaining game with imperfect information in which two players exchange resources to achieve individual goals. One of the two players learns how to achieve the goal through repeated games using Q-learing. Experiment results show that a learning player can learn winning strategies against cooperative and non-cooperative players.