ぷよぷよAIにおける並列Actorと優先度付き経験再生を用いた深層強化学習

森 春介; 越野 亮

doi:10.14864/fss.40.0_883

Abstract

“Puyo Puyo AI” includes rule-based methods and those using correlation matrices, but both are inferior to human players. Additionally, “Puyo Puyo AI” using deep reinforcement learning have shown insufficient performance. This study focuses on improving the performance of “Puyo Puyo AI” using deep reinforcement learning and proposes a method employing parallel actors and prioritized experience replay. Experimental results demonstrate that the proposed method achieved an average maximum chain length of 6.243 and an average score of 33,114, surpassing previous studies utilizing deep reinforcement learning.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Conference information

Register with J-STAGE for free!