This paper proposes a synthesis method of an optimal supervisor in terms of a language measure by using a reinforcement learning.Recently, a concept of the language measure is introduced to the formal languages and a synthesis method of an optimal supervisor based on the language measure has been proposed.In this paper, we apply the reinforcement learning as a learning method of the language measure, and show that the optimal supervisor in terms of the language measure can be derived through learning.By computer simulation, we examine an opimality of the obtained supervisor.