Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Automatic Discovery of Attribute Words from Web Documents and Criteria for Human Evaluation
KOSUKE TOKUNAGAJUN'ICHI KAZAMAKENTARO TORISAWA
Author information
JOURNAL FREE ACCESS

2006 Volume 13 Issue 4 Pages 49-67

Details
Abstract
We propose a method of acquiring attribute words for a wide range of object classes from Japanese Web documents.The method is a simple unsupervised method that ranks candidate words according to the score that uses the statistics of lexicosyntactic patterns, HTML tags, and word occurrences, as clues.To evaluate the attribute words, we also establish an evaluation procedure based on the idea of question-answerability. Using the proposed evaluation procedure, we conducted experiments on 22 word classes with four human evaluators.The results revealed that our method can obtain attribute words with a high degree of precision and the clues used in the ranking actually contribute to the performance.
Content from these authors
© The Association for Natural Language Processing
Previous article
feedback
Top