多重学習器ベース強化学習の報酬付与遅延学習環境への適応

西澤 智恵子; 松井 博和; 野村 由司彦

doi:10.1541/ieejeiss.139.847

Abstract

In this paper, we extend the proposed reinforcement learning with multiplex learning space to an environment that needs delay time for getting rewards. Concreatly, we prepare the multiplex learning spaces corresponding to each equal interval delay time within the predicted range. We simulated it, comparing with an ordinary one. As a result, the proposed method could get the best policy, but the ordinary method could not.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!