Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Paper
Personal Name Disambiguation in Web Search Results Using a Semi-Supervised Clustering Approach
Kazunari SugiyamaManabu Okumura
Author information
JOURNAL FREE ACCESS

2009 Volume 16 Issue 5 Pages 5_23-5_49

Details
Abstract
Personal names are often submitted to search engines as query keywords. However, in response to a personal name query, search engines return a long list of search results that contains Web pages about several namesakes. In order to address this problem, most of the previous works that disambiguate personal names in Web search results often employ agglomerative clustering approaches. In contrast, we have adopted a semi-supervised clustering approach to integrate similar documents into a seed document. Our proposed semi-supervised clustering approach is novel in that it controls the fluctuation of the centroid of a cluster.
Content from these authors
© 2009 The Association for Natural Language Processing
Previous article Next article
feedback
Top