セミハードクラスタリングとその識別器への応用-大量データでのSVMとの比較-

市橋 秀友; 野津 亮; 本多 克宏

doi:10.14864/fss.26.0.94.0

26th Fuzzy System Symposium

Session ID : MG4-4

DOI https://doi.org/10.14864/fss.26.0.94.0

Conference information

Host: Japan Society for Fuzzy Theory and Intelligent Informatics (SOFT)

Semi-hard clustering with application to classifier design - Comparisons with SVM on large data sets

*Hidetomo Ichihashi, Akira Notsu, Katsuhiro Honda

Author information

Keywords: Clustering, Classifier, Support Vector Machine

CONFERENCE PROCEEDINGS FREE ACCESS

Details

Abstract

This paper discusses the application of the fuzzy c-means based classifier (FCMC) to large scale data sets. Large scale data sets contain a huge number of samples (patterns). The number can be reduced by sampling, but the accuracy of the classifier on the test set may deteriorate, and the accuracy on the available data worsens. The FCM classifier uses covariance matrices whose size does not increase with the number of training samples, and the training time is proportional to the number of samples. By comparing the performance of FCMC with the support vector machine (SVM) classifier, which is known as one of the highest performance classifiers, the paper shows that FCMC nearly attains the accuracy of SVM and surpasses it in the training time and the testing time.

Corresponding author

Register with J-STAGE for free!