JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
Hierarchical Multi Objective Deep Reinforcement Learning with Accumulator Based Arbitration Mode
Keisuke TAKIGUCHIMasahiko OSAWAMichita IMAI
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2018 Volume 2018 Issue AGI-009 Pages 02-

Details
Abstract

In reinforcement learning, it is difficult to solve high-dimensional multi-objective decision problem. In this paper, we propose a novel hierarchical architecture of deep reinforcement learning with Accumulator Based Arbitration Mode(ABAM). This architecture has some groups which have deep Q-network (DQN) modules respectively and layers containing these modules to solve high-dimensional multi-objective decision problem. And ABAM arbitrates the outputs of modules in one layer by using the outputs of modules in the above layer which has more abstractive objective. By using this architecture, it is able to solve difficult maze task which simple DQN cannot solve.

Content from these authors
© 2018 Authors
Previous article Next article
feedback
Top