SCIS & ISIS
SCIS & ISIS 2010
セッションID: FR-F3-3
会議情報
Classification of Advertising Spam Reviews Using Latent Semantic Analysis
Insuk ParkHanhoon Kang*SeongJoon Yoo
著者情報
会議録・要旨集 フリー

詳細
抄録
In this study, methods to classify advertising reviews from shopping mall reviews are suggested. Advertising reviews are mostly written by companies and contain advertising contents. There are a few studies regarding the classification of opinion spam documents, which is very rare in foreign studies; however, there are no studies that classify advertising reviews from Korean reviews. In this study, the Naïve Bayes Classifier was used to classify review documents and the POS-Tagging and bigram methods were used to extract specific words. The frequency calculation methods for the probability value of specific words were: 1) the general number of appearances of words 2) the frequency calculation of specific words through the suggested Latent Semantic Analysis (LSA), and by recalculating the result from 1) in 2), the performances of each method were compared. As a result, the methods from 2) showed 88.4% accuracy which is 5.3% higher than 83.1% which was the previous result from using the POS-Tagging+Bigram method. Therefore, it was proved that the method suggested in this study is effective at classifying or extracting advertising reviews from Korean product review documents. I
著者関連情報
© 2010 Japan Society for Fuzzy Theory and Intelligent Informatics
前の記事 次の記事
feedback
Top