属性語のWeb文書からの自動発見と人手評価のための基準

徳永 耕亮; 風間 淳一; 鳥澤 健太郎

doi:10.5715/jnlp.13.4_49

Abstract

We propose a method of acquiring attribute words for a wide range of object classes from Japanese Web documents.The method is a simple unsupervised method that ranks candidate words according to the score that uses the statistics of lexicosyntactic patterns, HTML tags, and word occurrences, as clues.To evaluate the attribute words, we also establish an evaluation procedure based on the idea of question-answerability. Using the proposed evaluation procedure, we conducted experiments on 22 word classes with four human evaluators.The results revealed that our method can obtain attribute words with a high degree of precision and the clues used in the ranking actually contribute to the performance.

Content from these authors

Favorites & Alerts

Add to favorites
Additional info alert
Citation alert
Authentication alert

Corresponding author

Register with J-STAGE for free!