ロボティクス・メカトロニクス講演会講演概要集
Online ISSN : 2424-3124
セッションID: 2P2-E04
会議情報

活性化拡散モデルに基づく強化学習エージェントの方策選択手法
高桑 優作河野 仁温 文神村 明哉富田 康治鈴木 剛
著者情報
会議録・要旨集 フリー

詳細
抄録

This paper proposes a policy selection method of a reinforcement learning agent for suitable learning in unknown or dynamic environments based on a spreading activation model in the cognitive psychology. The reinforcement learning agent saves policies learned in various environments and the agent learns flexibly by partially using suitable policy according to the environment. In the proposed method, a directed graph is created between policies, and the network is constructed by means of a policy by combining them between policies. The agent updates the network according to the environment while repeating processes of recall, activation, filtering, and learns based on the network. Agent uses this network in transfer learning. Simulation results show that reinforcement learning agent achieves task by selecting the optimal one from multiple policies by the proposed method and from the comparison of transfer learning with the proposed method and the learning efficiency of ordinary reinforcement learning, the usefulness of the proposed method.

著者関連情報
© 2017 一般社団法人 日本機械学会
前の記事 次の記事
feedback
Top