SCIS & ISIS
SCIS & ISIS 2010
Session ID : FR-F3-3
Conference information
Classification of Advertising Spam Reviews Using Latent Semantic Analysis
Insuk ParkHanhoon Kang*SeongJoon Yoo
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract
In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding the classification of opinion spam documents, which is very rare in foreign studies; however, there are no studies that classify advertising reviews from Korean reviews. In this study, the Naïve Bayes Classifier was used to classify review documents and the POS-Tagging and bigram methods were used to extract specific words. The frequency calculation methods for the probability value of specific words were: 1) the general number of appearances of words 2) the frequency calculation of specific words through the suggested Latent Semantic Analysis (LSA), and by recalculating the result from 1) in 2), the performances of each method were compared. As a result, the methods from 2) showed 88.4% accuracy which is 5.3% higher than 83.1% which was the previous result from using the POS-Tagging+Bigram method. Therefore, it was proved that the method suggested in this study is effective at classifying or extracting advertising reviews from Korean product review documents. I
Content from these authors
© 2010 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top