XML Data Management in Compressed Relational Database

XML is an important standard of data exchange and representation. As a mature database system, using relational database to support XML data may bring some advantages. But storing XML in relational database has obvious redundancy that wastes disk space, bandwidth and disk I/O when querying XML data. For the efficiency of storage and query XML, it is necessary to use compressed XML data in relational database. In this paper, a compressed relational database technology supporting XML data is presented. Original relational storage structure is adaptive to XPath query process. The compression method keeps this feature. Besides traditional relational database techniques, additional query process technologies on compressed relations and for special structure for XML are presented. In this paper, technologies for XQuery process in compressed relational database are presented..




References:
[1] T.Bray, J.Paoli, C.M.Sperberg-McQueen. Extensible markup
language(XML)1.0. W3C Recommendation.
Feb.1998.http://www.w3.org/TR/REX-xml.
[2] Feng Tian, David J. DeWitt, Jianjun Chen, Chun Zhang. The Design and
Performance Evaluation of Alternative XML Storage Strategies.
SIGMOD Record special issue on "Data Management Issues in
E-commerce", March 2002
[3] A Deutsch, M. F. Fernandez, D. Suciu, Storing Semi-structured Data with
STORED, SIGMOD Conference 1999.
[4] J. Shanmugasundaram, K. Tufte, C. Zhang, G. He, D. J. DeWitt, J. F.
Naughton, Relational Databases for Querying XML Documents:
Limitations and Opportunities. VLDB 1999.
[5] S. Abiteboul, P. Buneman, D. Suciu. Data on the Web: From Relations to
Semistructured Data and XML. Morgan Kaufmann Publishers. 2000.
[6] Ioana Manolescu, Daniela Florescu, Donald Kossmann. Answering XML
queries on heterogeneous data sources. In Proc. of VLDB 2001.
[7] M. Yashikawa et al. : XRel: A Path-Based Approach to Storage and
Retrieval of XML Documents using Relational Databases. TOIS2001
[8] P. Bohhanon et al. From XML Schema to Relations: A Cost-Based
Approach to XML Storage. In the Proc. of ICDE 2002
[9] Torsten Grust. Accelerating XPath Location Steps. In Proc of SIGMOD
2002.
[10] James Cheney, Compressing XML with Multiplexed Hierarchical
Models, in Proceedings of the 2001 IEEE Data Compression Conference,
pp. 163-172
[11] Hartmut Liefke, Dan Suciu. XMill: an Efficient Compressor for XML
Data. In Proc of ACM SIGMOD2000.
[12] Pankaj M. Tolani, Jayant R. Haritsa. XGRIND: A Query-friendly XML
Compressor. In Proc. of the 18th International Conference on Data
Engineering, 2002
[13] JunKi Min, Myung-Jae Park, ChinWan Chung. XPRESS: A Queriable
Compression for XML Data. In Proc. of ACM SIGMOD 2003.
[14] Peter Buneman, Matin Grohe, Christoph Koch. Path Queries on
Compressed XML. In Proc of the 29th VLDB conference, 2003.
[15] Zhengchuan Xu, Zhimao Guo, Shuigeng Zhou, Aoying Zhou. Dynamic
Tuning of XML Sgorage Schema in VXMLR. In Proc. of IDEAS 2003.
[16] David DeHaan, David Toman, Mariano P. Consens, M. Tamer Ozsu. A
Comprehensive XQuery to SQL Translation using Dynamic Interval
Encoding. In Proc. of SIGMOD 2003.
[17] T.Westmann, D.Kossmann, S.Helmer, G.Moerkotte, The Implementation
and Performance of Compressed Databases, SIGMOD RECORD, Vol.29,
No.3, Sept. 2000
[18] J. Goldstein, R. Ramakrishnan, and U. Shaft. Compressing relations and
indexes. Proceedings of the IEEE Conference on Data Engineering, pages
370-379, 1998
[19] G. Antoshenkov, D. Lomet, J. Murray. Order Preserving String
Compression. 12th International Conference on Data Engineering.
[20] W.K. Ng, C.V. Ravishankar. Relational database compression using
augmented vector quantization. 11TH International Conference on Data
Engineering
[21] Zhiyuan Chen, Johannes Gehrke, Flip Korn. Query Optimization in
Compressed Database Systems.SIGMOD2001.
[22] Shivnath Babu, Minos Garofalakis, Rajeev Rastogi. SPARTAN: A
Model-Based Semantic Compression System for Massive Data Tables.
SIGMOD2001.
[23] S. J. O-Connell, N. Winterbottom. Performing Joins without
Decompression in a compressed Database System. SIGMOD Record
32(2), June, 2003.
[24] D. A. Huffman. A Method for the Construction of Minimum Redandancy
Codes. In Proceedings of the Institute of Radio Engineers 40, pages
1098-1101, September 1952.
[25] M. Nelson, "Data compression with the Burrows--Wheeler transform,"
Dr. Dobbs' J., Sept. 1996.
[26] Hector Garcia-Molina, Jeffrey D. Ullman, Jennifer Widom. Database
System Implementation. Prentice Hall. 2000.
[27] Albrecht Schmidt, Florian Waas, Martin Kersten, Micheal J. Carey,
Ionana Manolescu, Ralph Busse. XMark: A Benchmark for XML Data
Managemetn. In Proc of the 28th VLDB conference, 2002.
[28] World Wide Web consortium. XQuery 1.0: An XML Query Language.
http://www.w3.org/TR/xquery/
[29] World Wide Web Consortium: XML Path Language (XPath) 2.0.
http://www.w3.org/TR/xpath20/