Host: The Japanese Society for Artificial Intelligence
Name : 34th Annual Conference, 2020
Number : 34
Location : Online
Date : June 09, 2020 - June 12, 2020
In the field of board game AI, a technique that combines neural networks and tree search has attracted attention. In order to perform a tree search, the transition rules of the board need to be known. Researches on learning the transition rules of the state are also actively pursued as model-based reinforcement learning, and MuZero shows high performance in games such as Atari, Go, Shogi, and chess. In this study, we redefine MuZero's algorithm as supervised learning and examine a method to apply it to the more complicated game "Gyakuten Othellonia". When the MuZero algorithm was applied directly to "Gyakuten Othellonia", the performance is partially improved, but it is shown that errors in transition prediction could adversely affect the tree search. The analysis suggests that a tree search to deal with uncertainty could improve performance further.