ChatGPTを用いたデータ拡張手法によるデータ不足時の大規模言語モデルの効率的評価

ZHU HANHUA

doi:10.11517/pjsai.JSAI2024.0_2G5GS602

Abstract

In recent years, the development of Large Language Models (LLMs) has rapidly progressed, playing a significant role in Natural Language Processing (NLP). However, there is currently no established standard for efficiently evaluating these LLMs, which often generate complex sentences. Existing evaluation methods using trained Language Models (LMs) are popular due to their cost-effectiveness, but they often fall short in accuracy when training data is scarce. I propose a data augmentation method using ChatGPT to improve the accuracy of LMs in situations of data scarcity. Results on the Japanese Question Answering (QA) task demonstrate that an LM, trained using questions and answers generated by the proposed method, surpassed ChatGPT3.5 and achieved 92% of the evaluation performance of ChatGPT4, even in scenarios where only documents were available.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!