Espresso 型ブートストラッピング法における意味ドリフトのグラフ理論に基づく分析 語義曖昧性解消における評価

小町 守; 工藤 拓; 新保 仁; 松本 裕治

doi:10.1527/tjsai.25.233

Abstract

Bootstrapping has a tendency, called semantic drift, to select instances unrelated to the seed instances as the iteration proceeds. We demonstrate the semantic drift of Espresso-style bootstrapping has the same root as the topic drift of Kleinberg's HITS, using a simplified graph-based reformulation of bootstrapping. We confirm that two graph-based algorithms, the von Neumann kernels and the regularized Laplacian, can reduce the effect of semantic drift in the task of word sense disambiguation (WSD) on Senseval-3 English Lexical Sample Task. Proposed algorithms achieve superior performance to Espresso and previous graph-based WSD methods, even though the proposed algorithms have less parameters and are easy to calibrate.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!