コード生成タスクにおけるプロンプトの指示形式の差異が与える性能分析

伊東 和香; 佐藤 美唯; 髙野 志歩; 倉光 君郎

doi:10.11517/pjsai.JSAI2024.0_4Xin243

Abstract

In the development of Large Language Models for Code (Code LLM), it has been found that instruction tuning is effective in enhancing the performance of Code LLM. Instruction tuning is a method that improves generalization performance by additional learning of instructions. However, there is a variety of opinions on what form of instruction is optimal, and it has not been clarified. The purpose of this study is to investigate the impact of different instruction formats on code generation performance in order to enhance the effects of instruction tuning for Code LLM. In particular, we focused on the output formats used for code extraction and conducted experiments. We also visualized the experimental results. The results revealed the performance differences in code generation by the models due to different output formats, and it was clarified that the Markdown format was the most versatile. Moreover, it was revealed that specifying an output format resulted in a higher accuracy rate than not specifying an output format.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!