大規模言語モデルを用いた質問応答文の自動評価とプロンプトインジェクションへの対処

近藤 拓未; 竹内 孝; 李 吉屹; 齊藤 秀; 鹿島 久嗣

doi:10.11517/pjsai.JSAI2024.0_2G4GS601

Abstract

In many question-answering tasks of Natural Language Processing, its evaluation is based on exact or partial matching between the candidate text and pre-prepared reference answers, regardless of the question domain. However, evaluating open-domain question-answering, which does not limit the range of topics, becomes problematic due to issues such as synonymous expressions and variations in notation, making accurate assessment challenging through lexical matching. Existing studies have proposed automated evaluation using Large Language Models (LLM) to address these challenges, but discussions on the vulnerability of automated evaluation are lacking. In this study, we propose a new framework for automated evaluation using LLM to address these issues and discuss its performance and robustness. Our experiments show that LLM's automated evaluation aligns with human evaluation in over 90% of cases and demonstrates resilience against attacks on the evaluation system.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!