SCIS & ISIS
SCIS & ISIS 2008
Session ID : FR-E3-3
Conference information

Variations of Fuzzy Clustering for Cooccurrence Matrix and Their Application to Text Analysis
*Chi-Hyon OhKatsuhiro HondaHidetomo Ichihashi
Author information
CONFERENCE PROCEEDINGS FREE ACCESS

Details
Abstract
In this study, we compare several variations of Fuzzy Clustering for Cooccurrence Matrix (FCCM) in applications to text analysis. The FCCM was proposed to partition individuals and items of the cooccurrence matrix by maximizing the degree of aggregation of each cluster. The total amount of products of cooccurrence variables and memberships for individuals and items is regarded as the degree of aggregation. Several variations of FCCM which employ two types of constraints for memberships i.e. probabilistic and possibilistic and two types of regularizations to obtain fuzzy clusters, entropy maximization and K-L information, exist. In the experiments, we apply our methods to a data set which represents frequency of keywords appearing in text documents and compare the results of each clustering method. They are used to find mutual relation (or co-occurrence structure) among text documents and keywords in the applications. Those tasks are known as text mining.
Content from these authors
© 2008 Japan Society for Fuzzy Theory and Intelligent Informatics
Previous article Next article
feedback
Top