Journal of The Japanese Society for Quality Control
Online ISSN : 2432-1044
Print ISSN : 0386-8230
Report of Research Group
Trend of the SQC Based on Data Clustering-Studies of Categorical Data Analysis Based on Principal Points for Multivariate Binary Distribution-
Haruka YAMASHITA
Author information
JOURNAL RESTRICTED ACCESS

2016 Volume 46 Issue 4 Pages 387-392

Details
Abstract
 In statistics,there are many studies on principal points.The concept of principal points which is proposed by Flury allows us to carry out such an analysis in a variety of applications and also properties of principal points have been studied. Although principal points of a multivariate distribution have widely studied,there is no discussion of principal points for a multivariate binary distribution.
 Yamashita and Suzuki have define the principal points for a multivariate binary distribution. Since principal points for a multivariate binary distribution are selected from multivariate binary region,there is a problem of the amount of calculation,since this problem is an NP-hard problem. Yamashita and Suzuki have shown the submodularity of principal points for a multivariate binary distribution and proposed an approximation method based on the greedy algorithm.Using the property of submodularity of principal points for a multivariate binary distribution,the accuracy of approximations is at least(1-1/e)times the optimal solution proved by Nemhauser et al.Finally, we show the result of an application of the methods to questionnaire survey data.
Content from these authors
© 2016 The Japanese Society for Quality Control
Previous article Next article
feedback
Top