人工知能学会論文誌
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
原著論文
共起語ネットワーク特徴の言語・文書種非依存性に基づくキーワード抽出と見出し語の予測による性能評価
山本 優樹折原 良平
著者情報
ジャーナル フリー

2009 年 24 巻 3 号 p. 303-312

詳細
抄録

A word co-occurrence graph based on co-occurrence of words within sentences is known to have characteristics of a small-world and scale-free network. We built a keyword extraction algorithm using it betweenness-pass parameter in addition to comprehensive network parameters that include clustering coefficient, average path length and the number of links. Making use of the relationship between an article and its headline in a newspaper, we applied SVM algorithm to learn properties of the network parameters that characterize keywords, and tuned a keyword evaluation function composed of these parameters. We show our algorithm outperforms a past study with a similar technique. Moreover, the learned model is successfully applicable to documents written in an other language andor documents of other types.

著者関連情報
© 2009 JSAI (The Japanese Society for Artificial Intelligence)
次の記事
feedback
Top