Proceedings of the Fuzzy System Symposium
28th Fuzzy System Symposium
Conference information

main
Extraction of Similar Words Based on Time-correlation and Co-occurrence Probability from Tweets of the Same Topic
Yuichiro HisanoKazuhito SawaseHajime Nobuhara
Author information
CONFERENCE PROCEEDINGS OPEN ACCESS

Pages 394-397

Details
Abstract
In order to reduce various onomastic expressions for efficient tweet topic retrieval/clustering, a construction method of twitter dictionaries based on tweets extraction and their time-correlation is proposed. In the proposed method, similarities between keywords are calculated by the time-correlation of each word and co-occurrence probability. Furthermore, the proposed method divides the target time line to reduce the computational cost of twitter dictionaries construction. Through experiments with 101,714 tweets with the hashtags related to ``NHK kohaku-utagassen'', the effectiveness of the proposed division method compared with the method calculated using entire time line region is confirmed.
Content from these authors
© 2012 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top