Abstract
Estimation of Distribution Algorithms for Reinforcement Learning Problems, proposed by us, are a novel approach for realizing autonomous learning agents. In this study, we proposed a learning data correction mechanism for acceralating learning speed. The mechanism firstly detect redundant sequences of state-action pairs, then remove them. Several experimental results show the effectiveness of the proposed method.