Journal of Natural Language Processing
Online ISSN : 2185-8314
Print ISSN : 1340-7619
ISSN-L : 1340-7619
Estimation of Class Membership Probabilities by Using Multiple Classification Scores
KAZUKO TAKAHASHIHIROYA TAKAMURAMANABU OKUMURA
Author information
JOURNAL FREE ACCESS

2008 Volume 15 Issue 2 Pages 3-38

Details
Abstract
We propose a method for estimating class membership probabilities of a predictedclass in multiclass classification, using scores outputted by a classifier (classification scores), not only for the predicted class but also for other classes in a document classification.Class membership probabilities are important in many applications of document classification, in which multiclass classification is often applied.As a ethod for estimating class membership probabilities by using multiple scores, we propose two kinds of methods.One is generating an accuracy table with smoothing methods such as the moving average or a moving average with coverage, which indirectly estimates class membership probabilities by referring the accuracy table. The other is applying a logistic regression estimated parameters beforehand, which directly estimate these probabilities.Through experiments on two different datasets with both Support Vector Machines and Naive Bayes classifiers, we show that the use of multiple classification scores is much effective in both methods.We also show that the proposed smoothing method for the accuracy table works quite well, and that the method applying a logistic regression is more stable.Moreover, the estimated class membership probabilities by the proposed method are useful in the detection of the misclassified samples.
Content from these authors
© The Association for Natural Language Processing
Previous article Next article
feedback
Top