Transactions of the Japanese Society for Artificial Intelligence
Online ISSN : 1346-8030
Print ISSN : 1346-0714
ISSN-L : 1346-0714
Original Paper
Keyword Extraction using the Word Co-occurrence Network Properties that is Independent of Languages and Document Types and Its Evaluation by Prediction of Headline Words
Yuki YAMAMOTORyohei ORIHARA
Author information
JOURNAL FREE ACCESS

2009 Volume 24 Issue 3 Pages 303-312

Details
Abstract
A word co-occurrence graph based on co-occurrence of words within sentences is known to have characteristics of a small-world and scale-free network. We built a keyword extraction algorithm using it betweenness-pass parameter in addition to comprehensive network parameters that include clustering coefficient, average path length and the number of links. Making use of the relationship between an article and its headline in a newspaper, we applied SVM algorithm to learn properties of the network parameters that characterize keywords, and tuned a keyword evaluation function composed of these parameters. We show our algorithm outperforms a past study with a similar technique. Moreover, the learned model is successfully applicable to documents written in an other language andor documents of other types.
Content from these authors
© 2009 JSAI (The Japanese Society for Artificial Intelligence)
Next article
feedback
Top