強化学習における方策転移度合い決定法の開発

大津 亮二; 河野 仁

doi:10.1299/jsmermd.2019.1A1-P02

Abstract

This paper presents decision method of policy reusing ratio of transfer learning in reinforcement learning based on gradient descent method. In recent years, learning robot system has been discussed for the actual applications. To reduce the learning time, transfer learning framework is proposed and the method is knowledge reusing mechanism. In particular, effectiveness of transfer evaluation method such as a transfer surface is proposed in reinforcement learning and adjustment method of transferring ratio is also proposed. However, decision of value of transferring ratio is depends on human intuition and experience. In this paper, automatic transferring ratio estimation method is proposed based on gradient descent method with random initial value and statistics in multiple estimation trials, further evaluation of proposed method in two different transfer surface.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!