LLMの事前学習データ検知法の日英比較

小柳 響子; 佐藤 美唯; 梶浦 照乃; 倉光 君郎

doi:10.11517/pjsai.JSAI2024.0_4Xin298

Abstract

The large amount of pre-training data used to build large language models (LLMs) may contain inappropriate data for training, such as copyrighted text or personal information. To solve this problem, the method for detecting the contents of LLMs pre-training data was proposed. The existing method uses low token probabilities of a sequence for a determination.This method has been evaluated on LLMs trained English, and its effectiveness in LLMs trained Japanese has not been investigated. In this study, we evaluated the effectiveness of the existing detection method on Japanese LLMs and compared it with the effectiveness on English LLMs. To this end, we constructed JAWikiMIA, a benchmark for detecting Japanese pre-training data. We report that English LLMs achieve high AUC scores when the method uses the 20% of tokens from a sequence with the low token probability, while Japanese LLMs achieve high AUC scores when the method uses all tokens in a sequence.

Content from these authors

Favorites & Alerts

Corresponding author

Conference information

Register with J-STAGE for free!