モンテカルロ版 RGoal アルゴリズムの改良

一杉 裕志; 中田 秀基; 高橋 直人; 竹内 泉; 佐野崇

doi:10.11517/jsaisigtwo.2023.AGI-026_50

Abstract

We previously proposed a hierarchical reinforcement learning algorithm, RGoal, that allows recursive subroutine calls. In this paper, we improve the definition of the reference value for relative value in the Monte Carlo version of RGoal in order to stabilize learning when subroutines are shared between different tasks. The implemented algorithm was confirmed to work in several test tasks.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Conference information

Register with J-STAGE for free!