Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
33rd (2019)
Session ID : 4Rin1-04
Conference information

Multi-task Deep Reinforcement Learning with Evolutionary Algorithm and Policy Gradient Method in 3D Control Tasks
*Shota IMAIYuichi SEIYasuyuki TAHARAAkihiko OHSUGA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In deep rerinforcement learning, it is difficult to converge when the exploration is insufficient or a reward is sparce. Besides, in a specific tasks, the number of exploration may be limited. Therefore, it is considered effective to learn in source tasks previously to promote learning in the target tasks. In this research, we propose a method to train a model that can work well on variety of target tasks with evolutionary algorithm and policy gradient method. In this method, agents explore multiple environments with diverce set of neural networks to train a general model with evolutionary algorithm and policy gradient methid. In the experiments, we assume multiple 3D control source tasks. After the model training with our method in the source tasks, we shows how effective the model is for the 3D Control tasks of the target tasks.

Content from these authors
© 2019 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top