JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
A Word Extraction Method from Sequential Text Data Based on Time Dependence Between Word Occuuerrences
Tomomichi TADAKoji IANUMAHidetomo NABESHIMA
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2009 Volume 2009 Issue DMSM-A803 Pages 12-

Details
Abstract

This paper shows a new method of extracting important words from newspaper corpus based on the temporal-dependency between word occurrences. This word extraction method plays an important role in event-sequence mining. TF・IDF is a well-known method to rank word's importance in a document. We already proposed a new word-extraction method of improving TFIDF method, called TF・IDayF,which considers temporal information of word occurrences and can extract important/characteristic words of expressing sequential events. However, this method does not consider any temporal dependency of word occurrences, which can be regarded as some causal relationships. In this paper, we propose a novel method for extracting important words by using temporal co-occurrence information of words in a newspaper corpus.

Content from these authors
© 2009 Authors
Previous article Next article
feedback
Top