Behaviormetrika
Online ISSN : 1349-6964
Print ISSN : 0385-7417
ISSN-L : 0385-7417
Articles
COLLABORATIVE FILTERING FOR MASSIVE DATASETS BASED ON BAYESIAN NETWORKS
Maomi UenoTakahiro Yamazaki
著者情報
ジャーナル 認証あり

2007 年 35 巻 2 号 p. 137-158

詳細
抄録
This paper proposes a collaborative filtering method for massive datasets that is based on Bayesian networks. We first compare the prediction accuracy of four scoring-based learning Bayesian networks algorithms (AIC, MDL, UPSM, and BDeu) and two conditional-independence-based (CI-based) learning Bayesian networks algorithms (MWST, and Polytree-MWST) using actual massive datasets. The results show that (1) for large networks, the scoring-based algorithms have lower prediction accuracy than the CI-based algorithms and (2) when the scoring-based algorithms use a greedy search to learn a large network, algorithms which make a lot of arcs tend to have less prediction accuracy than those that make fewer arcs. Next, we propose a learning algorithm based on MWST for collaborative filtering of massive datasets. The proposed algorithm employs a traditional data mining technique, the “a priori” algorithm, to quickly calculate the amount of mutual information, which is needed in MWST, from massive datasets. We compare the original MWST algorithm and the proposed algorithm on actual data, and the comparison shows the effectiveness of the proposed algorithm.
著者関連情報
© 2007 The Behaviormetric Society
前の記事 次の記事
feedback
Top