IPSJ Online Transactions
Online ISSN : 1882-6660
ISSN-L : 1882-6660
Dividing Huge XML Trees Using the m-bridge Technique over One-to-one Corresponding Binary Trees
Takayuki KawamuraKiminori Matsuzaki
Author information
JOURNAL FREE ACCESS

2014 Volume 7 Pages 111-121

Details
Abstract
Tree data such as XML trees have recently been getting larger and larger. Parallel and distributed processing is a promising way of dealing with big data, but we need to divide the data in the first step. Since computation over trees often requires relationships between parents and children and/or among siblings, we should pay attention to such relationships. There is a technique called the “m-bridge” for dividing trees. We can easily compute m-bridges for trees of any shape. However, division with the m-bridge technique is sometimes unsatisfactory for shallow XML trees. We propose a method of tree division for XML trees in this study, in which we apply the m-bridge technique to a one-to-one corresponding binary tree. We implement the tree division algorithm using the Simple API for XML (SAX) Parser. An important feature of our algorithm is that we transform and divide XML trees in the order that the SAX parser reads the trees. We carried out experiments and discuss the properties of the tree division algorithm we propose. In addition, we discuss how we can use the divided trees with query examples.
Content from these authors
© 2014 by the Information Processing Society of Japan
Previous article Next article
feedback
Top