Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
37th (2023)
Session ID : 2F4-GS-5-02
Conference information

Policy Teaching by Agent-Based Intervention in Reinforcement Learning Processes
*Mikoto KUDOYouhei AKIMOTO
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

Autonomous learning agents using online reinforcement learning learn strategies sequentially from state observations obtained from interactions with the environment and internally defined rewards. However, if the state transition changes due to the intervention of other agents, the agent may not be able to learn the strategy it originally wanted to learn or may be induced to learn a specific strategy. In this study, we propose an intervention algorithm and investigate its properties for such an intervention attack on the reinforcement learning process. We formulate the intervention by the intervention agent to the protagonist agent as a 2-player Markov Game, and find that when the protagonist is induced to learn a strategy that maximizes the reward intended by the interventionist, the intervention can fail even in situations where the protagonist always obtains the optimal strategy for his reward. Another problem arises in situations where the protagonist is in the process of learning, for which we devised an improved algorithm.

Content from these authors
© 2023 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top