Value行列を手掛かりとした Transformerの分析

吉田 稔; 松本 和幸; 北 研二

doi:10.1527/tjsai.38-2_C-MB7

抄録

We propose a new method to analyze Transformer language models. In Transformer self-attention modules, attention weights are calculated from the query vectors and key vectors. Then, output vectors are obtained by taking the weighted sum of value vectors. While existing works on analysis of Transformer have focused on attention weights, this work focused on value and output matrices. We obtain joint matrices by multiplying both matrices, and show that the trace of the joint matrices are correlated with word co-occurences.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

The Analysis of Self-determined Behaviors among Laryngectomees for the Continuation of Esophageal Phonation Training
Efficient DFA on SPN-Based Block Ciphers and Its Application to the LED Block Cipher
シロイヌナズナCDK inhibitorの機能解析
ラット臼歯の発育過程における歯根の形態変化について
The 60th Annual Meeting of the Polarographic Society of Japan

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）