ロボティクス・メカトロニクス講演会講演概要集
Online ISSN : 2424-3124
セッションID: 2A2-J07
会議情報

VAEによる異常検出器を用いた安全な探索を可能とする模倣学習
*藤石 秀仁小林 泰介杉本 謙二
著者情報
会議録・要旨集 認証あり

詳細
抄録

Behavioral cloning, which is one of the imitation learning methods, enables a robot to imitate an expert’s policy from the expert’s state and action demonstrations. In that case, the robot does not need to interact with environment, thereby preventing robot failure. However, in general, it is difficult to obtain expert action information. Although behavioral cloning from observation allows the robot to learn the policy without that, it requires a few interactions with the environment to infer expert action, which leaves the risk of robot failures. Detecting faced situations are safe or dangerous is an effective way to prevent such dangerous interactions. Suppose that the expert’s demonstrations only visited the safe states, this paper proposes a new outlier detector using variational autoencoder learned by the expert’s data. It can easily find unexperienced and dangerous scenes since all the data used for learning are mapped to limited space. The proposed method improved the policy performance in simulations with the limited number of robot failures.

著者関連情報
© 2020 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top