Proceedings of the Annual Conference of JSAI
Online ISSN : 2758-7347
38th (2024)
Session ID : 3Xin2-93
Conference information

Evaluation of Reward Functions in Reinforcement Learning of Protein Language Models
*Ryoichi TAKASEHiroya IJIMAYoshihiro OSAKABEAkinori ASAHARAHikaru KOYAMA
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract

In pharmaceutical development, protein language models (pLMs) and reinforcement learning (RL) have become essential techniques for designing desired protein sequences. In this paper, we investigate the effect of loss functions in reward model training, since reward models are central to obtaining protein sequences with better performance. Two types of typical loss functions, such as mean squared error and ranking loss, are used to train reward models.Numerical experiments have shown that there is no significant difference in the performance evaluation of the reward models alone. However, it turned out that the difference in the loss functions affect to the pLMs after performing RL. The ranking loss tends to provide better performance and to keep the distribution of pLMs during RL, resulting in obtaining desired protein sequences with better performance.

Content from these authors
© 2024 The Japanese Society for Artificial Intelligence
Previous article Next article
feedback
Top