強化学習問題のための分布推定アルゴリズムにおける学習データ補正の検討

半田 久志; 西村 徳栄

doi:10.11509/sci.SCI10.0.297.0

Abstract

Estimation of Distribution Algorithms for Reinforcement Learning Problems, proposed by us, are a novel approach for realizing autonomous learning agents. In this study, we proposed a learning data correction mechanism for acceralating learning speed. The mechanism firstly detect redundant sequences of state-action pairs, then remove them. Several experimental results show the effectiveness of the proposed method.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!