2025 Volume 33 Pages 646-656
Various models for evaluating anonymity have been proposed so far. Among them, k-anonymity is widely known as a typical anonymity measure, guaranteeing that at least k individuals in a database have the same values. Unfortunately, it is difficult to create highly useful anonymized data satisfying k-anonymity for high-dimensional data because of the curse of dimensionality. Several approaches relaxing k-anonymity have been proposed, such as km-anonymity, to overcome the problem. On the other hand, we have another privacy protection metric, developed by Dwork et al., called differential privacy. However, the complete protection index for differential privacy, i.e., the level of noise that can satisfy the desired privacy, has not been clarified. This paper shows relationships between km-anonymity and differential privacy under sampling, proposed by Li et al., that is, a weak notion of differential privacy. Numerical experiments are then performed to give relations among the parameters of km-anonymity and differential privacy under sampling. These experiments also show relationships between k-anonymity and km-anonymity as k-anonymity is a special case of km-anonymity in some sense.