強化学習過程へのエージェント型介入による方策学習の誘導

工藤 ミコト; 秋本 洋平

doi:10.11517/pjsai.JSAI2023.0_2F4GS502

Abstract

Autonomous learning agents using online reinforcement learning learn strategies sequentially from state observations obtained from interactions with the environment and internally defined rewards. However, if the state transition changes due to the intervention of other agents, the agent may not be able to learn the strategy it originally wanted to learn or may be induced to learn a specific strategy. In this study, we propose an intervention algorithm and investigate its properties for such an intervention attack on the reinforcement learning process. We formulate the intervention by the intervention agent to the protagonist agent as a 2-player Markov Game, and find that when the protagonist is induced to learn a strategy that maximizes the reward intended by the interventionist, the intervention can fail even in situations where the protagonist always obtains the optimal strategy for his reward. Another problem arises in situations where the protagonist is in the process of learning, for which we devised an improved algorithm.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!