人工知能学会第二種研究会資料
Online ISSN : 2436-5556
Hierarchical Multi Objective Deep Reinforcement Learning with Accumulator Based Arbitration Mode
滝口 啓介大澤 正彦今井 倫太
著者情報
研究報告書・技術報告書 フリー

2018 年 2018 巻 AGI-009 号 p. 02-

詳細
抄録

In reinforcement learning, it is difficult to solve high-dimensional multi-objective decision problem. In this paper, we propose a novel hierarchical architecture of deep reinforcement learning with Accumulator Based Arbitration Mode(ABAM). This architecture has some groups which have deep Q-network (DQN) modules respectively and layers containing these modules to solve high-dimensional multi-objective decision problem. And ABAM arbitrates the outputs of modules in one layer by using the outputs of modules in the above layer which has more abstractive objective. By using this architecture, it is able to solve difficult maze task which simple DQN cannot solve.

著者関連情報
© 2018 著作者
前の記事 次の記事
feedback
Top