システム制御情報学会論文誌
Online ISSN : 2185-811X
Print ISSN : 1342-5668
ISSN-L : 1342-5668
論文
Web上の同姓同名人物識別のための職業関連情報の抽出
上田 洋村上 晴美辰巳 昭治
著者情報
ジャーナル フリー

2009 年 22 巻 6 号 p. 229-240

詳細
抄録

Distinguishing different people with identical names is becoming more and more important in person searches on the Web. The aim of this research is to dispatch useful labels for identifying persons in “person clusters,” which are generated as a result of person searches on the Web. In this paper, we propose a method to label person clusters with “vocation-related information.” The vocation-related information includes broader terms that may be considered as vocations, and terms that are useful to infer vocations, not only those rigorously defined as vocations. Our method is based on (a) extracting candidates of vocation-related information by using HTML structures and simple heuristics, and (b) generating vocation-related information by using term frequencies,synonym clustering, and Web search engines. Experimental results revealed the usefulness of the proposed method.

著者関連情報
© 2009 システム制御情報学会
前の記事 次の記事
feedback
Top