Abstract
In this paper, a reinforcement learning method is proposed that optimizes passenger service in elevator group systems. Task-oriented reinforcement learning using multiple agents is applied in the control system in allocating immediate landing calls to the elevators and operating them intelligently in attaining better service in this stochastic dynamic domain. The proposed system shows better adaptive performance in different traffic profiles with faster convergence compared to the other learning elevator group control system.