異種エージェントへの教示に向けたInstruction-based Behavior Explanationの応用の検討

福地 庸介; 大澤 正彦; 山川 宏; 今井 倫太

doi:10.11517/jsaisigtwo.2017.AGI-006_07

抄録

Under large state and action spaces, it is difficult for a reinforcement learning agent to learn the agent's policy within a practical time. Previous studies have proposed methods in which a trainer gives better actions to a trainee to promote the learning. However, when action spaces of a trainer and a trainee is not the same, the instruction does not work without mapping from the instruction to the trainee's variable space. In this paper, we deal with three types of instruction: action-based expression, abstract expression from a human trainer, and expression output by Instruction-based Behavior Explanation, which is a framework to announce a reinforcement learning agent's future behavior. The three instructions were mapped to agents' action spaces with deep reinforcement learning, and we compared the mappings to consider the form of information towards heterogeneous agents' instruction.

著者関連情報

お気に入り & アラート

閲覧履歴

発行機関からのお知らせ

第二種研究会の全記事は認証なしでアクセス可能です．また，各記事の著作権は原則として著者に帰属します．

責任著者(Corresponding author)

会議情報

J-STAGEへの登録はこちら（無料）