転移学習における価値に基づく知識の選別

小谷 直樹

doi:10.5687/iscie.28.275

Abstract

This paper is aimed at reducing the amount of knowledge to avoid lower learning performance of an agent in transfer learning. In transfer or multitask reinforcement learning problems, the agent reuses policies which were learned in past tasks in order to efficiently solve unknown tasks. Therefore,the agent has a large number of state-action pairs as knowledge. But, at the same time, it causes both explosively increasing the amount of knowledge and decreasing the learning speed. This paper proposes a method for reducing the amount of knowledge on the basis of value. The effectiveness of the proposed method was verified with the simulation of the reaching problem for a multi-link robot arm. The proposed method achieves a reduction of the amount of knowledge and learning time. It also improves learning performance of the agent.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!