人工知能学会第二種研究会資料
Online ISSN : 2436-5556
時系列テキストデータからの時間的出現依存関係に基づく重要単語の抽出
多田 知道岩沼宏治鍋島 英知
著者情報
研究報告書・技術報告書 フリー

2009 年 2009 巻 DMSM-A803 号 p. 12-

詳細
抄録

This paper shows a new method of extracting important words from newspaper corpus based on the temporal-dependency between word occurrences. This word extraction method plays an important role in event-sequence mining. TF·IDF is a well-known method to rank word's importance in a document. We already proposed a new word-extraction method of improving TFIDF method, called TF·IDayF,which considers temporal information of word occurrences and can extract important/characteristic words of expressing sequential events. However, this method does not consider any temporal dependency of word occurrences, which can be regarded as some causal relationships. In this paper, we propose a novel method for extracting important words by using temporal co-occurrence information of words in a newspaper corpus.

著者関連情報
© 2009 著作者
前の記事 次の記事
feedback
Top