人工知能
Online ISSN : 2435-8614
Print ISSN : 2188-2266
人工知能学会誌(1986~2013, Print ISSN:0912-8085)
行為の同型性に基づく強化学習法
山口 智浩野村 勇治田中 康祐谷内田 正彦
著者情報
解説誌・一般情報誌 フリー

1997 年 12 巻 6 号 p. 870-880

詳細
抄録

The advantage of emergence is that various solutions are emerged. However, it takes large computation cost to emerge them since it requires the numbers of iterations of simulation. So we try to reduces the computation cost without losing variety of solutions by introducing the abstraction technique in Artificial Intelligence. This paper presents Isomorphism Based Reinforcement Learning by Isomorphism of Actions that reduces the learning cost without losing variety of solutions. Isomorphism is one of the concepts in Enumerative Combinatorics of mathematics. First we explain Isomorphism of Actions, then explain Isomorphism of Behaviors. Isomorphic behaviors those perform the same task can be obtained by transforming the learning result of the task by "the appropriate permutation". However, a priori knowledge that represents "the appropriate permutation" is not always given, so this paper uses the generate & test method that first generates the isomorphic learning results by transforming the learning result of reinforcement learning for a task by the combinatorial permutations, then tests to select two kinds of the behaviors performing the following tasks ; (1) isomorphic behaviors those perform the same task ; (2) discovery of the behaviors those are converged to the new task state. Since the acquired learning results are isomorphic each other, the merits of our method are those the time cost for generating various learning results is small and also the space cost is small too because it needs only the original learning result and the set of permutations for it. For these reasons, this method is significant for realizing the learning various behaviors for the dynamic environment or multiagent.

著者関連情報
© 1997 人工知能学会
前の記事 次の記事
feedback
Top