Proceedings of the Fuzzy System Symposium
30th Fuzzy System Symposium
Session ID : WB1-1
Conference information

main
A study of fuzzy clustering for mixed numerical and categorical incomplete data II
Takashi Furukawa*Shin-ichi OhnishiTakahiro Yamanoi
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract
The focus of fuzzy c-means (FCM) clustering method is normally used on numerical data. However, most data existing in databases are both categorical and numerical. To date, clustering methods have been developed to analyze only complete data. Although we, sometimes, encounter data sets that contain one or more missing feature values (incomplete data) in data intensive classification systems, traditional clustering methods cannot be used for such data. Thus, we study this theme and discuss clustering methods that can handle mixed numerical and categorical incomplete data. In this paper, we propose some algorithms that use the missing categorical data imputation method and distances between numerical data that contain missing values. Finally, we show through a real data experiment that our proposed method is more effective than without imputation, when missing ratio becomes higher.
Content from these authors
© 2014 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top