Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications
Online ISSN : 2188-4749
Print ISSN : 2188-4730
The 29th ISCIE International Symposium on Stochastic Systems Theory and Its Applications (Nov. 1997, Tokyo)
A Master Equation Formulation of the Reinforcement Scheme of Stochastic Learning Automata
Fei QianHironori Hirata
Author information
JOURNAL FREE ACCESS

1998 Volume 1998 Pages 273-278

Details
Abstract
For judging the convergence property of reinforcement learning algorithms, we formulate the learning scheme in terms of a discrete Markov process, and transform its equation into a continuous time master equation. By making a small perturbation for as mall learning parameter, we derive a small perturbation expansion of the master equation to get a Fokker-Planck equation approximation with the low-order of the learning parameters. In here, we show that the global features of reinforcement scheme of learning automata can be described within this approximation due to the fact that the deterministic term of the dynamics has a globally asymptotically stable fixed point.
Content from these authors
© 1998 ISCIE Symposium on Stochastic Systems Theory and Its Applications
Previous article Next article
feedback
Top