Host: The Japanese Society for Artificial Intelligence
Name : The 33rd Annual Conference of the Japanese Society for Artificial Intelligence, 2019
Number : 33
Location : [in Japanese]
Date : June 04, 2019 - June 07, 2019
Utility-based Q-learning, which uses subjective utilities as rewards of Q-learning, has been proposed and the utilities that derive mutual cooperation in a Prisoner's Dilemma game have been successfully evolved by real-coded genetic algorithm (RCGA). However, in that work, the genes were simply exchanged in the evolution process like a bit-string GA and the search space was not so wide as a result. This work investigates the evolution of the subjective utilities by RCGA with blend crossover (BLX-α) that has a powerful search ability by generating various chromosomes.