教示者による学習支援に基づくエージェントのオンライン行動獲得

廣川 暢一; 鈴木 健嗣

doi:10.1527/tjsai.25.694

抄録

This paper describes a novel methodology, namely ``Coaching'', which allows humans to give a subjective evaluation to an agent in an iterative manner. This is an interactive learning method to improve the reinforcement learning by modifying a reward function dynamically according to given evaluations by a trainer and the learning situation of the agent. We demonstrate that the agent can learn different reward functions by given instructions such as ``good or bad'' by human's observation, and can also obtain a set of behavior based on the learnt reward functions through several experiments.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

電力動揺の高速抑制を追求した発電機適応形LQGシステムの構築について
Giant Aneurysm of the Azygos Anterior Cerebral Artery
Recent Trends of CAE Applications in Forging Process
Regularizations and finite ladders in multiple trigonometry
[title in Japanese]

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）