主催: 一般社団法人 日本機械学会
会議名: ロボティクス・メカトロニクス 講演会2016
開催日: 2016/06/08 - 2016/06/11
In this report, we apply the proposed reinforcement learning with multiplex learning spaces to an environment with a delay time to get a reward and compare it with an ordinary reinforcement learning without considering the delay in experimental simulations. We adapt the proposed method by multiplexing some learning spaces corresponding to a different delay time each other. As the result of an experiment, the ordinary method couldn't get the best policy, but the proposed method could get it effectively.