各目的のCriticのうち最大TD-errorを用いてActorを更新する多目的強化学習

長峰 大智; 山田 和明

doi:10.1299/jsmermd.2019.1P2-H09

抄録

Multi-agent system (MAS) is constructed by many autonomous agents. Conflicts occur in MAS because of complex interactions among many agents. An agent needs to carry out a task and to avoid conflicts at same time. That is, each agent has to achieve the contradicting purposes. Therefore, this paper proposes a new approach by using multi-objective reinforcement learning as decision making system of an agents. We investigate the efficiency of the proposed approach through a simulation experiment that two agents pass each other in the narrow path.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

[title in Japanese]
Inductance and Current Distribution Extraction in Nb Multilayer Circuits with Superconductive and Resistive Components
Prototype of Robotic Mobilization Device for Finger and Wrist Joints Driven by a Pneumatic Soft Actuator

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）