ロボティクス・メカトロニクス講演会講演概要集
Online ISSN : 2424-3124
セッションID: 1P1-04b3
会議情報

多重学習器を用いる強化学習
―報酬付与に遅れがある学習環境への適用―
西澤 智恵子松井 博和
著者情報
会議録・要旨集 フリー

詳細
抄録

In this report, we apply the proposed reinforcement learning with multiplex learning spaces to an environment with a delay time to get a reward and compare it with an ordinary reinforcement learning without considering the delay in experimental simulations. We adapt the proposed method by multiplexing some learning spaces corresponding to a different delay time each other. As the result of an experiment, the ordinary method couldn't get the best policy, but the proposed method could get it effectively.

著者関連情報
© 2016 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top