Abstract
A topic of learning autonomous robots has widely been discussed in these years. Reinforcement learning (RL) is one of the popular methods to this domain. Its performance is, however, quite sensitive to the segmentation of state and action spaces. To overcome this problem, we have been developing a new technique, Bayesian-discrimination-functionbased RL (BRL). Furthermore, BRL has proven to be more effective than other standard RL algorithms in dealing with multi-robot system (MRS) problems. However, as most learning systems do, BRL occasionally suffers from overfitting. This paper introduces an extended BRL for improving the robustness of MRS. Metalearning based on information entropy of firing rules is adopted for adaptively modifying its learning parameters. Computer simulations are conducted to verify the effectiveness of our proposed method.