Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 3D1-GS-2-02
Conference information

Learning Optimal Polices through Interactive Imitation Learning
*Yuki NAKAGUCHIDai KUBOTA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Imitation learning solves reinforcement learning problems with reference to some teacher information. While the typical method of behavioral cloning could not be applied to long-term tasks due to covariate shifts, interactive imitation learning solves this problem by obtaining online feedback from a teacher model. On the other hand, in the existing methods of interactive imitation learning, students could not learn the optimal policies when the teacher differed from the optimal for the student. In this study, we propose a novel method to solve this problem while providing an organized review of interactive imitation learning.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top