進化的計算と方策勾配法による学習を用いた3次元制御タスクにおけるマルチタスク深層強化学習

今井 翔太; 清 雄一; 田原 康之; 大須賀 昭彦

doi:10.11517/pjsai.JSAI2019.0_4Rin104

Abstract

In deep rerinforcement learning, it is difficult to converge when the exploration is insufficient or a reward is sparce. Besides, in a specific tasks, the number of exploration may be limited. Therefore, it is considered effective to learn in source tasks previously to promote learning in the target tasks. In this research, we propose a method to train a model that can work well on variety of target tasks with evolutionary algorithm and policy gradient method. In this method, agents explore multiple environments with diverce set of neural networks to train a general model with evolutionary algorithm and policy gradient methid. In the experiments, we assume multiple 3D control source tasks. After the model training with our method in the source tasks, we shows how effective the model is for the 3D Control tasks of the target tasks.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!