Host: The Japanese Society for Artificial Intelligence
Name : The 33rd Annual Conference of the Japanese Society for Artificial Intelligence, 2019
Number : 33
Location : [in Japanese]
Date : June 04, 2019 - June 07, 2019
When humans learn, it is not just by individual trial-and-error, but the learning is accelerated by sharing information with others. There are social learning strategies such as imitating others’ actions and emulating the high achievement of someone. As a model of social learning, sharing of state- and/or action-values are often implemented in reinforcement learning algorithms. However, sharing information of such huge amount is not realistic for a model of social learning of humans or animals. We propose an algorithm in which a mere “record” (achieved accumulated reward per episode) leads to efficient social learning. The algorithm is based on the model of satisficing integrated with different risk attitudes around the reference (aspiration level), and the conversion of the global aspiration onto each state.