Transactions of the Institute of Systems, Control and Information Engineers
Online ISSN : 2185-811X
Print ISSN : 1342-5668
ISSN-L : 1342-5668
Papers
Extracting Vocation-Related Information for Distinguishing Different People with Identical Names on the Web
Hiroshi UedaHarumi MurakamiShoji Tatsumi
Author information
JOURNAL FREE ACCESS

2009 Volume 22 Issue 6 Pages 229-240

Details
Abstract
Distinguishing different people with identical names is becoming more and more important in person searches on the Web. The aim of this research is to dispatch useful labels for identifying persons in “person clusters,” which are generated as a result of person searches on the Web. In this paper, we propose a method to label person clusters with “vocation-related information.” The vocation-related information includes broader terms that may be considered as vocations, and terms that are useful to infer vocations, not only those rigorously defined as vocations. Our method is based on (a) extracting candidates of vocation-related information by using HTML structures and simple heuristics, and (b) generating vocation-related information by using term frequencies,synonym clustering, and Web search engines. Experimental results revealed the usefulness of the proposed method.
Content from these authors
© 2009 The Institute of Systems, Control and Information Engineers
Previous article Next article
feedback
Top