多重学習器ベース強化学習の報酬付与遅延学習環境への適応

西澤 智恵子; 松井 博和; 野村 由司彦

doi:10.1541/ieejeiss.139.847

抄録

In this paper, we extend the proposed reinforcement learning with multiplex learning space to an environment that needs delay time for getting rewards. Concreatly, we prepare the multiplex learning spaces corresponding to each equal interval delay time within the predicted range. We simulated it, comparing with an ordinary one. As a result, the proposed method could get the best policy, but the ordinary method could not.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

【電気学会会員の方】購読している論文誌を無料でご覧いただけます（会員ご本人のみの個人としての利用に限ります）。購読者番号欄にMyページへのログインIDを，パスワード欄に生年月日8ケタ（西暦，半角数字。例：19800303）を入力して下さい。

ダウンロード

論文(PDF)の閲覧方法はこちら
閲覧方法 (327.9K)

前身誌

電気学会論文誌. C

電氣學會雜誌

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）