Host: The Japanese Society for Artificial Intelligence
Name : The 38th Annual Conference of the Japanese Society for Artificial Intelligence
Number : 38
Location : [in Japanese]
Date : May 28, 2024 - May 31, 2024
Recent advancements in Large Language Models (LLMs) within the artificial intelligence domain have shown exceptional performance across various natural language processing tasks. Amidst these developments, aligning the values and objectives of LLMs with human perspectives has become increasingly important. Reinforcement Learning from Human Feedback (RLHF) has gained notable interest as a method for such alignment adjustments. This study explored a learning approach for LLMs using RLHF, employing scenarios from the romance simulation game 'Tokimeki Memorial 3' as the game scenario data. Specifically, the research involved an experiment where sentences were generated following five Japanese characters, tailored to align with the personalities of the game characters. While subjective, this evaluation demonstrated the capability of producing sentences that appropriately matched the distinct characters in the game.