Approaches and Schemes for Storing DTDIndependent XML Data in Relational Databases

The volume of XML data exchange is explosively increasing, and the need for efficient mechanisms of XML data management is vital. Many XML storage models have been proposed for storing XML DTD-independent documents in relational database systems. Benchmarking is the best way to highlight pros and cons of different approaches. In this study, we use a common benchmarking scheme, known as XMark to compare the most cited and newly proposed DTD-independent methods in terms of logical reads, physical I/O, CPU time and duration. We show the effect of Label Path, extracting values and storing in another table and type of join needed for each method-s query answering.




References:
[1] M. Fernndez, Y. Kadiyska, D. Suciu, A. Morishima and W. C. Tan,
"Silkroute: A framework for publishing relational data in xml,". ACM
Trans. Database Syst. Vol .27, No. 4, pp. 438-493, 2002.
[2] F. Yergeau, T. Bray, J. Paoli, C. M. Sperberg-McQueen and E. Maler,
"Extendible Markup Language (XML),". W3C Recommendation, 2004.
Available at http://www.w3.org/XML/
[3] P. O-Neil, E. O-Neil, S. Pal, I. Cseri and G. Schaller, "ORDPATHs:
Insert-Friendly XML Node Labels,". Proc. ACM SIGMOD. France pp.
903-908, 2004.
[4] T. Böhme and E. Rahm, "Supporting Efficient Streaming and Insertion
of XML Data in RDBMS,". Proc. DIWeb. Latvia, pp. 70-81, 2004.
[5] H. Lu, J. Xu Yu, G. Wang, S. Zheng, H. Jiang, G. Yu and A. Zhou,
"What makes the differences: benchmarking XML database
implementations,". ACM Trans. Internet Techn. Vol. 5, No. 1, pp 154-
194, 2005.
[6] D. Florescu and D. Kossmann, "Storing and Querying XML Data using
an RDMBS,". IEEE Data Eng. Bull. Vol. 22, No. 3, pp. 27-34, 1999.
[7] M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura, "XRel: a pathbased
approach to storage and retrieval of XML documents using
relational databases,". ACM Trans. Internet Techn. Vol. 1, No. 1, pp.
110-141, 2001.
[8] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "Path Materialization
Revisited: An Efficient Storage Model for XML Data,". Proc. of ACM
Australasian Database Tech. Conf. Australia, pp. 85-94, 2002.
[9] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "XParent: An Efficient
RDBMS-Based XML Database System,". Proc. of IEEE ICDE. USA,
pp. 335-336, 2002.
[10] E. Cohen, H. Kaplan and T. Milo, "Labeling Dynamic XML Trees,".
Proc. of ACM PODS. USA, pp. 271-281, 2002.
[11] S. Pal, I. Cseri, O. Seeliger, G. Schaller, L. Giakoumakis and V. Zolotov,
"Indexing XML Data Stored in a Relational Database,". Proc. of VLDB.
Canada, pp. 1134-1145, 2004.
[12] S. Abiteboul, D. Quass, J. McHugh, J. Widom, J. Wiener, "The Lorel
Query Language for Semistructured Data,". Int. J. on Digital Libraries.
Vol. 1, No. 1, pp. 68-88, 1997.
[13] A. Deutsch, M. Fernandez, D. Florescu, A. Levy and D. Suciu "A Query
Language for XML,". Proc. of WWW. Canada, pp. 1155-1169, 1999.
[14] D. Chamberlin, J. Robie and D. Florescu, "Quilt: An XML Query
Language for Heterogeneous Data Sources,". Proc of WebDB (LNCS).
USA, pp. 53-62, 2000.
[15] J. Clark and S. DeRose, "XML Path Language (XPath) Version 1.0,".
W3C Recommendation, 1999. Available at
http://www.w3.org/TR/xpath.
[16] A. Bonifati and S. Ceri, "Comparative Analysis of Five XML Query
Languages,". ACM SIGMOD Record. Vol. 29, No. 1, pp. 68-79, 2000.
[17] D. Lee and W. W. Chu, "Comparative Analysis of Six XML Schema
Languages,". ACM SIGMOD Record. Vol 29, No. 3, pp. 76-87, 2000.
[18] A. Schmidt, M. Kersten, M. Windhouwer and F. Waas, "Efficient
Relational Storage and Retrieval of XML Documents,". Proc. of WebDB
(LNCS). USA, pp. 137-150, 2000.
[19] T. Böhme and E. Rahm, "Multi-User Evaluation of XML Data
Management Systems with XMach-1,". Proc. of EEXTT (LNCS).
Germany, pp. 148-159, 2003.
[20] A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse,
"Xmark: A Benchmark for XML Data Management,". Proc. of VLDB
China, pp. 974-985, 2002.
[21] S. Boag, D. Chamberlin, M. F. Fernández , D. Florescu, J. Robie, J.
Siméon, "W3C XML Query (XQuery),". W3C Candidate
Recommendation. 2005. Available at http://www.w3.org/XML/Query/
[22] M. Emadi, M. Rahgozar, A. Ardalan, A. Kazerani and M.M. Arian, "A
Comparative Study of DTD-Independent XML Data Storage
Approaches,". 11th International CSI Computer Conference (CSICC'06).
Iran, pp. 624-628, 2006.