Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Automatic acquisition of hyponymy relations from HTML documents
KEIJI SHINZATOKENTARO TORISAWA
Author information
JOURNAL FREE ACCESS

2005 Volume 12 Issue 1 Pages 125-150

Details
Abstract
This paper describes an automatic acquisition method for hyponymy relations.Hyponymy relations play a crucial role in various natural language processing systems, and there have been many attempts to automatically acquire the relations from largescale corpora.Most of the existing acquisition methods rely on particular linguistic patterns, such as juxtapositions, which specify hyponymy relations.Our method, however, does not use such linguistic patterns.We try to acquire hyponymy relations from four different types of clues.The first is repetitions of HTML tags found in usual HTML documents on the WWW.The second is statistical measures such as df and idf, which are popular in IR literatures.The third is verb-noun cooccurrences found in normal corpora.The fourth is heuristic rules obtained through our experiments on a development set.
Content from these authors
© The Association for Natural Language Processing
Previous article
feedback
Top