Empirical Evaluation of an Average Reward Learning Method Handling Simultaneous Learning Episodes in a Dynamic Environment with Emerging Tasks

ヴァルディヴィエルソ アレックス; 宮本 俊幸

doi:10.11509/sci.SCI10.0.259.0

第54回システム制御情報学会研究発表講演会

セッションID: F234

DOI https://doi.org/10.11509/sci.SCI10.0.259.0

会議情報

主催: システム制御情報学会

Empirical Evaluation of an Average Reward Learning Method Handling Simultaneous Learning Episodes in a Dynamic Environment with Emerging Tasks

*ヴァルディヴィエルソ　アレックス, 宮本俊幸

著者情報

キーワード: 強化学習, 平均報酬, 学習エピソード, 動的環境

会議録・要旨集フリー

詳細

抄録

Average reward learning methods (ARLMs) show a poor performance in environments in which they must deal with several tasks simultaneously. In this paper we present the evaluation of an ARLM adapted to handle simultaneous learning episodes. We compare its performance against a conventional ARLM in a multicar elevator system.

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）