外部知識なし/モデル内部秘匿/低温度固定状況でのデータ拡張を用いたサンプリングによる日本語LLMの幻覚検知

中井 諒馬; 石川 琉聖; 橋本 俊甫; 井上 博之

doi:10.11517/pjsai.JSAI2024.0_4Xin266

Abstract

Inaccurate responses, termed hallucinations, pose challenges in various Large Language Model (LLM) applications. Although a sampling-based method called SelfCheckGPT has been devised to detect hallucinations by using the model's input-output interface without external knowledge, the method requires an increase in the temperature parameter, which cannot be controlled in some LLM services, including ChatGPT. In LLM services designed for accurate responses, the temperature parameter is fixed at a low level, which can degrade the performance of SelfCheckGPT. We therefore propose a novel methodology that utilizes data augmentation (adding random strings or back-translation) during sampling to detect hallucinations in Japanese LLMs under the fixed-low-temperature constraint. Our experimental results reveal that the proposed methodology outperforms SelfCheckGPT under the fixed-low-temperature constraint.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!