Journal of Japan Industrial Management Association
Online ISSN : 2187-9079
Print ISSN : 1342-2618
ISSN-L : 1342-2618
Original Paper (Case Study)
A Case Study of Text Mining Tweet Data —How to Correct Information from the Affected Area in a Large-scale Disaster—
Yoshikazu SAKAMAKIEtsuko KAMEI
Author information
JOURNAL FREE ACCESS

2014 Volume 65 Issue 1 Pages 39-50

Details
Abstract
After the Great East Japan Earthquake, which occurred on March 11, 2011, most of the communication systems stopped because of power outages, and social networking services (SNS) such as Twitter made significant contributions as communication tools. A lot of information was transmitted on Twitter and details such as who needed help where, and what was lacking could be known. The government could also determine what was happening in the affected area in real time from tweet data. On the other hand, many false remarks and rumors were also circulated on Twitter. As a result, all information had to be checked manually, and it took long time before rescue parties and aid supplies could be sent. In this study, we propose how to find key words related to disasters automatically using a naïve Bayesian classifier and information value. Using this method, we show that it is possible to determine the current status of the affected area in real time. Additionally, we compare the naïve Bayesian classifier with a support vector machine (SVM) and show that the naïve Bayesian classifier can classify tweet data to the same accuracy level as that of the SVM.
Content from these authors
© 2014 Japan Industrial Management Association
Previous article
feedback
Top