Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

Daiko KISHIKAWA; Sachiyo ARAI

doi:10.11517/pjsai.JSAI2021.0_3N3IS2e05

35th (2021)

Session ID : 3N3-IS-2e-05

DOI https://doi.org/10.11517/pjsai.JSAI2021.0_3N3IS2e05

Conference information

Host: The Japanese Society for Artificial Intelligence

Name : The 35th Annual Conference of the Japanese Society for Artificial Intelligence

Number : 35

Location : [in Japanese]

Date : June 08, 2021 - June 11, 2021

Deep Inverse Reinforcement Learning with Adversarial One-Class Classification

*Daiko KISHIKAWA, Sachiyo ARAI

Author information

Keywords: Deep Inverse Reinforcement Learning, One-Class Classification, Adversarial Learning

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

Recently, inverse reinforcement learning, which estimates the reward from an expert's trajectories, has been attracting attention for imitating complex behaviors and estimating intentions. This study proposes a novel deep inverse reinforcement learning method that combines LogReg-IRL, an IRL method based on linearly solvable Markov decision process, and ALOCC, an adversarial one-class classification method. The proposed method can quickly learn rewards and state values without reinforcement learning executions or trajectories to be compared. We show that the proposed method obtains a more expert-like gait than LogReg-IRL in the BipedalWalker task through computer experiments.

Corresponding author

Conference information

Register with J-STAGE for free!