Approaches and Schemes for Storing DTDIndependent XML Data in Relational Databases
The volume of XML data exchange is explosively
increasing, and the need for efficient mechanisms of XML data
management is vital. Many XML storage models have been proposed
for storing XML DTD-independent documents in relational database
systems. Benchmarking is the best way to highlight pros and cons of
different approaches. In this study, we use a common benchmarking
scheme, known as XMark to compare the most cited and newly
proposed DTD-independent methods in terms of logical reads,
physical I/O, CPU time and duration. We show the effect of Label
Path, extracting values and storing in another table and type of join
needed for each method-s query answering.
[1] M. Fernndez, Y. Kadiyska, D. Suciu, A. Morishima and W. C. Tan,
"Silkroute: A framework for publishing relational data in xml,". ACM
Trans. Database Syst. Vol .27, No. 4, pp. 438-493, 2002.
[2] F. Yergeau, T. Bray, J. Paoli, C. M. Sperberg-McQueen and E. Maler,
"Extendible Markup Language (XML),". W3C Recommendation, 2004.
Available at http://www.w3.org/XML/
[3] P. O-Neil, E. O-Neil, S. Pal, I. Cseri and G. Schaller, "ORDPATHs:
Insert-Friendly XML Node Labels,". Proc. ACM SIGMOD. France pp.
903-908, 2004.
[4] T. Böhme and E. Rahm, "Supporting Efficient Streaming and Insertion
of XML Data in RDBMS,". Proc. DIWeb. Latvia, pp. 70-81, 2004.
[5] H. Lu, J. Xu Yu, G. Wang, S. Zheng, H. Jiang, G. Yu and A. Zhou,
"What makes the differences: benchmarking XML database
implementations,". ACM Trans. Internet Techn. Vol. 5, No. 1, pp 154-
194, 2005.
[6] D. Florescu and D. Kossmann, "Storing and Querying XML Data using
an RDMBS,". IEEE Data Eng. Bull. Vol. 22, No. 3, pp. 27-34, 1999.
[7] M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura, "XRel: a pathbased
approach to storage and retrieval of XML documents using
relational databases,". ACM Trans. Internet Techn. Vol. 1, No. 1, pp.
110-141, 2001.
[8] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "Path Materialization
Revisited: An Efficient Storage Model for XML Data,". Proc. of ACM
Australasian Database Tech. Conf. Australia, pp. 85-94, 2002.
[9] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "XParent: An Efficient
RDBMS-Based XML Database System,". Proc. of IEEE ICDE. USA,
pp. 335-336, 2002.
[10] E. Cohen, H. Kaplan and T. Milo, "Labeling Dynamic XML Trees,".
Proc. of ACM PODS. USA, pp. 271-281, 2002.
[11] S. Pal, I. Cseri, O. Seeliger, G. Schaller, L. Giakoumakis and V. Zolotov,
"Indexing XML Data Stored in a Relational Database,". Proc. of VLDB.
Canada, pp. 1134-1145, 2004.
[12] S. Abiteboul, D. Quass, J. McHugh, J. Widom, J. Wiener, "The Lorel
Query Language for Semistructured Data,". Int. J. on Digital Libraries.
Vol. 1, No. 1, pp. 68-88, 1997.
[13] A. Deutsch, M. Fernandez, D. Florescu, A. Levy and D. Suciu "A Query
Language for XML,". Proc. of WWW. Canada, pp. 1155-1169, 1999.
[14] D. Chamberlin, J. Robie and D. Florescu, "Quilt: An XML Query
Language for Heterogeneous Data Sources,". Proc of WebDB (LNCS).
USA, pp. 53-62, 2000.
[15] J. Clark and S. DeRose, "XML Path Language (XPath) Version 1.0,".
W3C Recommendation, 1999. Available at
http://www.w3.org/TR/xpath.
[16] A. Bonifati and S. Ceri, "Comparative Analysis of Five XML Query
Languages,". ACM SIGMOD Record. Vol. 29, No. 1, pp. 68-79, 2000.
[17] D. Lee and W. W. Chu, "Comparative Analysis of Six XML Schema
Languages,". ACM SIGMOD Record. Vol 29, No. 3, pp. 76-87, 2000.
[18] A. Schmidt, M. Kersten, M. Windhouwer and F. Waas, "Efficient
Relational Storage and Retrieval of XML Documents,". Proc. of WebDB
(LNCS). USA, pp. 137-150, 2000.
[19] T. Böhme and E. Rahm, "Multi-User Evaluation of XML Data
Management Systems with XMach-1,". Proc. of EEXTT (LNCS).
Germany, pp. 148-159, 2003.
[20] A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse,
"Xmark: A Benchmark for XML Data Management,". Proc. of VLDB
China, pp. 974-985, 2002.
[21] S. Boag, D. Chamberlin, M. F. Fernández , D. Florescu, J. Robie, J.
Siméon, "W3C XML Query (XQuery),". W3C Candidate
Recommendation. 2005. Available at http://www.w3.org/XML/Query/
[22] M. Emadi, M. Rahgozar, A. Ardalan, A. Kazerani and M.M. Arian, "A
Comparative Study of DTD-Independent XML Data Storage
Approaches,". 11th International CSI Computer Conference (CSICC'06).
Iran, pp. 624-628, 2006.
[1] M. Fernndez, Y. Kadiyska, D. Suciu, A. Morishima and W. C. Tan,
"Silkroute: A framework for publishing relational data in xml,". ACM
Trans. Database Syst. Vol .27, No. 4, pp. 438-493, 2002.
[2] F. Yergeau, T. Bray, J. Paoli, C. M. Sperberg-McQueen and E. Maler,
"Extendible Markup Language (XML),". W3C Recommendation, 2004.
Available at http://www.w3.org/XML/
[3] P. O-Neil, E. O-Neil, S. Pal, I. Cseri and G. Schaller, "ORDPATHs:
Insert-Friendly XML Node Labels,". Proc. ACM SIGMOD. France pp.
903-908, 2004.
[4] T. Böhme and E. Rahm, "Supporting Efficient Streaming and Insertion
of XML Data in RDBMS,". Proc. DIWeb. Latvia, pp. 70-81, 2004.
[5] H. Lu, J. Xu Yu, G. Wang, S. Zheng, H. Jiang, G. Yu and A. Zhou,
"What makes the differences: benchmarking XML database
implementations,". ACM Trans. Internet Techn. Vol. 5, No. 1, pp 154-
194, 2005.
[6] D. Florescu and D. Kossmann, "Storing and Querying XML Data using
an RDMBS,". IEEE Data Eng. Bull. Vol. 22, No. 3, pp. 27-34, 1999.
[7] M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura, "XRel: a pathbased
approach to storage and retrieval of XML documents using
relational databases,". ACM Trans. Internet Techn. Vol. 1, No. 1, pp.
110-141, 2001.
[8] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "Path Materialization
Revisited: An Efficient Storage Model for XML Data,". Proc. of ACM
Australasian Database Tech. Conf. Australia, pp. 85-94, 2002.
[9] H. Jiang, H. Lu, W. Wang and J. Xu Yu, "XParent: An Efficient
RDBMS-Based XML Database System,". Proc. of IEEE ICDE. USA,
pp. 335-336, 2002.
[10] E. Cohen, H. Kaplan and T. Milo, "Labeling Dynamic XML Trees,".
Proc. of ACM PODS. USA, pp. 271-281, 2002.
[11] S. Pal, I. Cseri, O. Seeliger, G. Schaller, L. Giakoumakis and V. Zolotov,
"Indexing XML Data Stored in a Relational Database,". Proc. of VLDB.
Canada, pp. 1134-1145, 2004.
[12] S. Abiteboul, D. Quass, J. McHugh, J. Widom, J. Wiener, "The Lorel
Query Language for Semistructured Data,". Int. J. on Digital Libraries.
Vol. 1, No. 1, pp. 68-88, 1997.
[13] A. Deutsch, M. Fernandez, D. Florescu, A. Levy and D. Suciu "A Query
Language for XML,". Proc. of WWW. Canada, pp. 1155-1169, 1999.
[14] D. Chamberlin, J. Robie and D. Florescu, "Quilt: An XML Query
Language for Heterogeneous Data Sources,". Proc of WebDB (LNCS).
USA, pp. 53-62, 2000.
[15] J. Clark and S. DeRose, "XML Path Language (XPath) Version 1.0,".
W3C Recommendation, 1999. Available at
http://www.w3.org/TR/xpath.
[16] A. Bonifati and S. Ceri, "Comparative Analysis of Five XML Query
Languages,". ACM SIGMOD Record. Vol. 29, No. 1, pp. 68-79, 2000.
[17] D. Lee and W. W. Chu, "Comparative Analysis of Six XML Schema
Languages,". ACM SIGMOD Record. Vol 29, No. 3, pp. 76-87, 2000.
[18] A. Schmidt, M. Kersten, M. Windhouwer and F. Waas, "Efficient
Relational Storage and Retrieval of XML Documents,". Proc. of WebDB
(LNCS). USA, pp. 137-150, 2000.
[19] T. Böhme and E. Rahm, "Multi-User Evaluation of XML Data
Management Systems with XMach-1,". Proc. of EEXTT (LNCS).
Germany, pp. 148-159, 2003.
[20] A. Schmidt, F. Waas, M. Kersten, M. Carey, I. Manolescu, and R. Busse,
"Xmark: A Benchmark for XML Data Management,". Proc. of VLDB
China, pp. 974-985, 2002.
[21] S. Boag, D. Chamberlin, M. F. Fernández , D. Florescu, J. Robie, J.
Siméon, "W3C XML Query (XQuery),". W3C Candidate
Recommendation. 2005. Available at http://www.w3.org/XML/Query/
[22] M. Emadi, M. Rahgozar, A. Ardalan, A. Kazerani and M.M. Arian, "A
Comparative Study of DTD-Independent XML Data Storage
Approaches,". 11th International CSI Computer Conference (CSICC'06).
Iran, pp. 624-628, 2006.
@article{"International Journal of Information, Control and Computer Sciences:62275", author = "Mehdi Emadi and Masoud Rahgozar and Adel Ardalan and Alireza Kazerani and Mohammad Mahdi Ariyan", title = "Approaches and Schemes for Storing DTDIndependent XML Data in Relational Databases", abstract = "The volume of XML data exchange is explosively
increasing, and the need for efficient mechanisms of XML data
management is vital. Many XML storage models have been proposed
for storing XML DTD-independent documents in relational database
systems. Benchmarking is the best way to highlight pros and cons of
different approaches. In this study, we use a common benchmarking
scheme, known as XMark to compare the most cited and newly
proposed DTD-independent methods in terms of logical reads,
physical I/O, CPU time and duration. We show the effect of Label
Path, extracting values and storing in another table and type of join
needed for each method-s query answering.", keywords = "XML Data Management, XPath, DTD-Independent
XML Data.", volume = "2", number = "7", pages = "2546-6", }