Journal of Japan Society for Fuzzy Theory and Intelligent Informatics
Online ISSN : 1881-7203
Print ISSN : 1347-7986
ISSN-L : 1347-7986
Original Papers
Performance Comparisons of Fuzzy c-Means Classifier with Many Parameters on Variously-sized Training Data Sets
Hidetomo ICHIHASHIKatsuhiro HONDAAkira NOTSU
Author information
JOURNAL FREE ACCESS

2011 Volume 23 Issue 2 Pages 254-263

Details
Abstract
Fuzzy c-means based classifier (FCMC) is a classifier based on clustering approaches. The classification accuracy on training sets can easily be improved by increasing the number of clusters. On the other hand, the accuracy on test sets (i.e., the generalization capability) is not necessarily improved by increasing the number of clusters. Especially when the number of training samples is relatively small, the classifier not only over-fits the data, but also obtains incorrect covariance matrices and cluster centers, since the number of samples in each cluster becomes small. Hence, the test set accuracy deteriorates. The performance of FCMC with 2 clusters in each class when the number of training samples is less than 1000 was already reported. This paper reports the scaling behavior of FCMC by testing with variously-sized training samples. The number of clusters of FCMC is increased up to 8. The number is not very large but FCMC in this paper has many parameters. LibSVM is one of the widely known state of the art tools of SVM classifier for large-sized data sets. The classification accuracy on test set, training time and testing time (i.e., detection time) of FCMC are compared with LibSVM by varying the number of training samples.
Content from these authors
© 2011 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top