人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
原著論文
教示者による学習支援に基づくエージェントのオンライン行動獲得
廣川 暢一鈴木 健嗣
著者情報
ジャーナル フリー

25 巻 (2010) 6 号 p. 694-702

詳細
PDFをダウンロード (1161K) 発行機関連絡先
抄録

This paper describes a novel methodology, namely ``Coaching'', which allows humans to give a subjective evaluation to an agent in an iterative manner. This is an interactive learning method to improve the reinforcement learning by modifying a reward function dynamically according to given evaluations by a trainer and the learning situation of the agent. We demonstrate that the agent can learn different reward functions by given instructions such as ``good or bad'' by human's observation, and can also obtain a set of behavior based on the learnt reward functions through several experiments.

著者関連情報
© 2010 JSAI (The Japanese Society for Artificial Intelligence)
前の記事 次の記事

オルトメトリクス
閲覧履歴
feedback
Top