Journal of Information Processing
Online ISSN : 1882-6652
ISSN-L : 1882-6652
Zipf Distribution Model for Quantifying Risk of Re-identification from Trajectory Data
Hiroaki KikuchiKatsumi Takahashi
著者情報
ジャーナル フリー

2016 年 24 巻 5 号 p. 816-823

詳細
抄録

In this paper, we propose a new mathematical model for evaluating a given anonymized dataset that risks being re-identified. Many anonymization algorithms have been proposed in the area called privacy-preserving data publishing (PPDP), but, no anonymization algorithms are suitable for all scenarios because many factors, e.g., a requirement of accuracy, a domain of attributes, a size of dataset, and sensitivities of attributes, are involved. In order to address the issues of anonymization, we propose a new mathematical model based on the Zipf distribution. Our model is simple, but it fits well with the real distribution of trajectory data. We demonstrate the primary property of our model and we extend it to a more complex environment. Using our model, we define the theoretical bound for reidentification, which yields the appropriate optimal level for anonymization.

著者関連情報
© 2016 by the Information Processing Society of Japan
前の記事 次の記事
feedback
Top