電気学会論文誌C(電子・情報・システム部門誌)
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<知能,ロボティクス>
準エキスパート集団からのアンサンブル逆強化学習
冨田 真司濱津 文哉濱上 知樹
著者情報
ジャーナル フリー

2017 年 137 巻 4 号 p. 667-673

詳細
抄録

Ensemble inverse reinforcement learning from semi-experts' behavior is proposed. In many inverse reinforcement learning (IRL) problem, the expert agent which has ideal rewards for achieving the goal is supposed to be existing. However, in real world problem, the expert is not always observed. Moreover, the estimated reward function includes the bias depending on its inherent behavior if the reward for achieving the goal task is estimated from one agent. In order to overcome the limitation of IRL, we apply Adaboost, one of ensemble and boosting approach, to IRL and integrate estimated reward functions from semi-expert agents. To confirm the effectiveness of the proposed method in the grid world including incomplete areas, we compared the results of reinforcement learning using estimated reward functions and integrated reward function by simulation. The simulation result shows the proposed method can estimate the reward adaptively.

著者関連情報
© 2017 電気学会
前の記事 次の記事
feedback
Top