Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

Daiko KISHIKAWA; Sachiyo ARAI

doi:10.11517/pjsai.JSAI2021.0_3N3IS2e05

抄録

Recently, inverse reinforcement learning, which estimates the reward from an expert's trajectories, has been attracting attention for imitating complex behaviors and estimating intentions. This study proposes a novel deep inverse reinforcement learning method that combines LogReg-IRL, an IRL method based on linearly solvable Markov decision process, and ALOCC, an adversarial one-class classification method. The proposed method can quickly learn rewards and state values without reinforcement learning executions or trajectories to be compared. We show that the proposed method obtains a more expert-like gait than LogReg-IRL in the BipedalWalker task through computer experiments.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Robot with KANSEI? or KANSEI with Robot?
Smooth gait transition in hardware-efficient CPG model based on asynchronous coupling of cellular automaton phase oscillators
難計測部をもつ空調設備as-built3次元モデル構築のための最適スキャナ配置計画（第1報）
Inclusion of hepatitis C virus testing in National Health Screening to accelerate HCV elimination in South Korea
魚類行動パターンの画像解析に基づく水質異常判定

責任著者(Corresponding author)

会議情報

J-STAGEへの登録はこちら（無料）