A Materialized Approach to the Integration of XML Documents: the OSIX System

The data exchanged on the Web are of different nature from those treated by the classical database management systems; these data are called semi-structured data since they do not have a regular and static structure like data found in a relational database; their schema is dynamic and may contain missing data or types. Therefore, the needs for developing further techniques and algorithms to exploit and integrate such data, and extract relevant information for the user have been raised. In this paper we present the system OSIX (Osiris based System for Integration of XML Sources). This system has a Data Warehouse model designed for the integration of semi-structured data and more precisely for the integration of XML documents. The architecture of OSIX relies on the Osiris system, a DL-based model designed for the representation and management of databases and knowledge bases. Osiris is a viewbased data model whose indexing system supports semantic query optimization. We show that the problem of query processing on a XML source is optimized by the indexing approach proposed by Osiris.




References:
[1] S. Abiteboul, S. Cluet , G. Ferran and M-C. Rousset: "The Xyleme
Project". Gemo Repot 248, INRIA, 2001.
[2] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "A View-
Based Approach to the Integration of Structured and Semi-structured
Data", IEEE International Baltic Conference on Databases and
Information Systems-Communication of Baltic DBIS , 2006.
[3] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "Data
Warehouse based Approach to the Integration of Semi-structured Data",
WCMT The 1st International Workshop on Web-based Contents
Management Technologies, Suzhou, China 2009
[4] X. Baril: "Un modèle de vues pour l-intégration de sources de données
XML: VIMIX ". PHD thesis, Languedoc University of Science and
Techniques, 2003.
[5] C. Bornhovd: "MIX - A Representation Model for the Integration of
Web- Based Data". Technical report, Dep.CS, Darmstadt University of
Technology, Germany, 1998.
[6] M. Cannataro, S. Cluet, G. Tradigo, P. Veltri and D. Vodislav:" Using
views to query XML. In Encyclopedia of Database Technologies and
Applications", pp.729-735 , 2005.
[7] H. Garcia-Molina: "The TSIMMIS approach to mediation: Data Models
and Languages". Journal of Intelligent Information Systems. 8(2) pp
117-132, 1997.
[8] A. Halevy: "Answering queries using views: A survey". The VLBD
Journal, 10(4), 270-294. 2001.
[9] S. Kermanshahani: "Semi-Materialized Framework: a Hybrid Approach
to Data Integration", CSTST Student Workshop, Paris, October 2008.
[10] I. Manolescu, D. Florescu and D. Kossman: "Answering XML Queries
Over Heterogeneous Data Sources". In proceedings of the 27 th
International Conference on VLDB, 2001.
[11] M. Roger, A. Simonet, M. Simonet, "Bringing Together Description
Logics and Databases in an Object-Oriented Model", DEXA 2002,
Database and Expert System Applications, Toulouse, Sept. 2002.
[12] M. H. Scholl, C. Laasch, M. Tresch, "Updatable Views in Object-
Oriented Databases", Proc. 2nd DOOD conf., pp 187-198, Dec. 1991.
[13] I. Sebi : "Interrogation de Documents XML à Travers des Vues". PhD
thesis, EDITE, CEDRIS Laboratory, 2007.
[14] A. Simonet, M. Simonet, "Classement d-instance et Evaluation des
Requ├¬tes en Osiris", in BDA-96 : Bases de Données Avancées, Cassis,
France, pp 273-288, Aug. 1996.
[15] D. Stanat, D. McAllister: "Discrete Mathematics in Computer Science",
Prentice Hall, 1977.
[16] G. Wiederhold. "Mediators in the architecture of future information
systems". IEEE Computer Magazine, 25(3), 38-49, 1992.
[17] M.-C. Wu, A. P. Buchmann. "Research issues in data warehousing". In
Datebanksysteme in Buro, Technik and Wissenschaft, pp. 61-82, 1997.