GPT-2からのbigram知識の取り出し

吉田 稔; 松本 和幸

doi:10.1527/tjsai.40-3_A-O65

抄録

We propose a method to extract bigram knowledge from GPT-2 models. Based on the observation that the first layer in GPT-2 is useful to predict the tokens next to the given input tokens, we propose an algorithm to use self attention heads only from the first layer to predict the next tokens. We also propose an algorithm to find contextual words that are highly related to a given bigram by applying the backpropagation method to GPT-2 parameters for the next-token prediction. Experimental results showed that our proposed algorithms to predict next words and to induce context words showed the higher average precision values than the baseline methods.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Substrate and Calcium Effects on the Autolysis of Tilapia Muscle m-Calpain
The Activity of Matrix Metalloproteinases (MMPs) and Tissue Inhibitors of Metalloproteinases (TIMPs) in Mammary Tumors of Dogs and Rats
[title in Japanese]
イスラーム契約におけるリスク分担のシャリーア適合性
Hypolipidemic and Bifidogenic Potentials in the Dietary Fiber Prepared from Mikan (Japanese Mandarin Orange: Citrus unshiu) Albedo

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）