二人零和展開型ゲームにおける突然変異付き乗算型重み更新に関する研究

坂本 充生; 阿部 拳之; 蟻生 開人; 岩崎 敦

doi:10.11517/pjsai.JSAI2023.0_2T4GS502

Abstract

In this study, we propose a multiplicative weight update algorithm that utilizes mutations in two-player zero-sum extensive-form games. These games are important models for decision-making under imperfect information. While equilibria in these games can be computed using linear programming, it becomes challenging to handle large-scale games such as poker. To address this issue, learning algorithms for finding an (approximate) equilibrium have been proposed. However, most of the existing algorithms converge to Nash equilibrium through time-averaged strategies. In normal-form games, it has been shown that introducing mutations allows for learning equilibrium strategies without taking time averages. Inspired by that, we propose the Dilated Mutant Multiplicative Weight Update with the introduction of mutations in extensive-form games. The experimental results show that the proposed method can learn equilibrium strategies without computing time averages for multiple settings.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!