2P1-G12 並列処理を用いた価値関数合成による強化学習の効率化

仲間 祐貴; 當眞 嗣久; 山田 孝治; 遠藤 聡志

doi:10.1299/jsmermd.2010._2P1-G12_1

抄録

In this paper, efficiency improvement of reinforcement learning using parallel processing for combination value function. We propose the method of periodically composing Q table of local learning clusters to global Q table. We apply this method to two applications. One is maze problem and an another is behavior rule detection problem for modular typed robot. Q Learning method and Monte Carlo method are compared with profit share method that learns robot behaviors. We presented computer experiments of 40 PC clusters. The convergence time and learning times are evaluated and discussed.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

INT-06 STUDY ON EFFICIENT PHS POSITION TRACKING SYSTEM FOR TRANSPORT EQUIPMENTS USING ACCELEROMETER AND SUPERVISED MACHINE LEARNING(Intelligent Machines II,Technical Program of Oral Presentations)
STUDIES OF THE ANTIIDIOTYPIC ANTIBODY DETECTED IN THE SERA OF UNEXPLAINED HABITUAL ABORTERS AFTER IMMUNOTHERAPY WITH HUSBAND'S LYMPHOCYTES
Field Sensor Virtual Organization Integrated with Satellite Data on a Geo Grid

発行機関からのお知らせ

会員向け購読者番号とパスワードは以下URLよりご確認下さい。
https://www.jsme.or.jp/publication/proceedings/

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）