IEEJ Transactions on Electronics, Information and Systems
Online ISSN : 1348-8155
Print ISSN : 0385-4221
ISSN-L : 0385-4221
<Information Processing, Software>
Estimation of an Optimized Number of Topics by Consensus Soft Clustering using NMF
Takeru Yokoi
Author information
JOURNAL FREE ACCESS

2012 Volume 132 Issue 1 Pages 53-60

Details
Abstract
We propose here a novel approach to explore an optimized number of topics in a document set using consensus clustering based on Non-negative Matrix Factorization (NMF). It is useful to automatically decide the number of topics from a document set since various approaches to extract topics heuristically decide it. Consensus clustering merges multiple results of clustering so that it achieves a robust clustering. In this paper, assuming that a robust clustering is achieved by the optimized number of clusters, we have proposed a novel consensus soft clustering algorithm based on NMF and estimated an optimized number of topics with exploring a robust classification of documents into the topics.
Content from these authors
© 2012 by the Institute of Electrical Engineers of Japan
Previous article Next article
feedback
Top