Value行列を手掛かりとした Transformerの分析

吉田 稔; 松本 和幸; 北 研二

doi:10.1527/tjsai.38-2_C-MB7

抄録

We propose a new method to analyze Transformer language models. In Transformer self-attention modules, attention weights are calculated from the query vectors and key vectors. Then, output vectors are obtained by taking the weighted sum of value vectors. While existing works on analysis of Transformer have focused on attention weights, this work focused on value and output matrices. We obtain joint matrices by multiplying both matrices, and show that the trace of the joint matrices are correlated with word co-occurences.

著者関連情報

お気に入り & アラート

お気に入りに追加
追加情報アラート
被引用アラート
認証解除アラート

閲覧履歴

Removal and Decomposition of Toxic Species in Fly Ash Emitted from Incinerators at High Temperature
肥満症の行動療法
Spin-Dependent Electron Transport Induced by Non-Magnetic Adatoms in Metallic Carbon Nanotubes
Phytopathological Chemistry of Black Sweet Potato
Physiological Measurements of the Static Qigong "NEI YANG GONG"

責任著者(Corresponding author)

J-STAGEへの登録はこちら（無料）