A Materialized Approach to the Integration of XML Documents: the OSIX System
The data exchanged on the Web are of different nature
from those treated by the classical database management systems;
these data are called semi-structured data since they do not have a
regular and static structure like data found in a relational database;
their schema is dynamic and may contain missing data or types.
Therefore, the needs for developing further techniques and
algorithms to exploit and integrate such data, and extract relevant
information for the user have been raised. In this paper we present
the system OSIX (Osiris based System for Integration of XML
Sources). This system has a Data Warehouse model designed for the
integration of semi-structured data and more precisely for the
integration of XML documents. The architecture of OSIX relies on
the Osiris system, a DL-based model designed for the representation
and management of databases and knowledge bases. Osiris is a viewbased
data model whose indexing system supports semantic query
optimization. We show that the problem of query processing on a
XML source is optimized by the indexing approach proposed by
Osiris.
[1] S. Abiteboul, S. Cluet , G. Ferran and M-C. Rousset: "The Xyleme
Project". Gemo Repot 248, INRIA, 2001.
[2] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "A View-
Based Approach to the Integration of Structured and Semi-structured
Data", IEEE International Baltic Conference on Databases and
Information Systems-Communication of Baltic DBIS , 2006.
[3] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "Data
Warehouse based Approach to the Integration of Semi-structured Data",
WCMT The 1st International Workshop on Web-based Contents
Management Technologies, Suzhou, China 2009
[4] X. Baril: "Un modèle de vues pour l-intégration de sources de données
XML: VIMIX ". PHD thesis, Languedoc University of Science and
Techniques, 2003.
[5] C. Bornhovd: "MIX - A Representation Model for the Integration of
Web- Based Data". Technical report, Dep.CS, Darmstadt University of
Technology, Germany, 1998.
[6] M. Cannataro, S. Cluet, G. Tradigo, P. Veltri and D. Vodislav:" Using
views to query XML. In Encyclopedia of Database Technologies and
Applications", pp.729-735 , 2005.
[7] H. Garcia-Molina: "The TSIMMIS approach to mediation: Data Models
and Languages". Journal of Intelligent Information Systems. 8(2) pp
117-132, 1997.
[8] A. Halevy: "Answering queries using views: A survey". The VLBD
Journal, 10(4), 270-294. 2001.
[9] S. Kermanshahani: "Semi-Materialized Framework: a Hybrid Approach
to Data Integration", CSTST Student Workshop, Paris, October 2008.
[10] I. Manolescu, D. Florescu and D. Kossman: "Answering XML Queries
Over Heterogeneous Data Sources". In proceedings of the 27 th
International Conference on VLDB, 2001.
[11] M. Roger, A. Simonet, M. Simonet, "Bringing Together Description
Logics and Databases in an Object-Oriented Model", DEXA 2002,
Database and Expert System Applications, Toulouse, Sept. 2002.
[12] M. H. Scholl, C. Laasch, M. Tresch, "Updatable Views in Object-
Oriented Databases", Proc. 2nd DOOD conf., pp 187-198, Dec. 1991.
[13] I. Sebi : "Interrogation de Documents XML à Travers des Vues". PhD
thesis, EDITE, CEDRIS Laboratory, 2007.
[14] A. Simonet, M. Simonet, "Classement d-instance et Evaluation des
Requ├¬tes en Osiris", in BDA-96 : Bases de Données Avancées, Cassis,
France, pp 273-288, Aug. 1996.
[15] D. Stanat, D. McAllister: "Discrete Mathematics in Computer Science",
Prentice Hall, 1977.
[16] G. Wiederhold. "Mediators in the architecture of future information
systems". IEEE Computer Magazine, 25(3), 38-49, 1992.
[17] M.-C. Wu, A. P. Buchmann. "Research issues in data warehousing". In
Datebanksysteme in Buro, Technik and Wissenschaft, pp. 61-82, 1997.
[1] S. Abiteboul, S. Cluet , G. Ferran and M-C. Rousset: "The Xyleme
Project". Gemo Repot 248, INRIA, 2001.
[2] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "A View-
Based Approach to the Integration of Structured and Semi-structured
Data", IEEE International Baltic Conference on Databases and
Information Systems-Communication of Baltic DBIS , 2006.
[3] H.Ahmad, S. Kermanshahani, A. Simonet and M. Simonet: "Data
Warehouse based Approach to the Integration of Semi-structured Data",
WCMT The 1st International Workshop on Web-based Contents
Management Technologies, Suzhou, China 2009
[4] X. Baril: "Un modèle de vues pour l-intégration de sources de données
XML: VIMIX ". PHD thesis, Languedoc University of Science and
Techniques, 2003.
[5] C. Bornhovd: "MIX - A Representation Model for the Integration of
Web- Based Data". Technical report, Dep.CS, Darmstadt University of
Technology, Germany, 1998.
[6] M. Cannataro, S. Cluet, G. Tradigo, P. Veltri and D. Vodislav:" Using
views to query XML. In Encyclopedia of Database Technologies and
Applications", pp.729-735 , 2005.
[7] H. Garcia-Molina: "The TSIMMIS approach to mediation: Data Models
and Languages". Journal of Intelligent Information Systems. 8(2) pp
117-132, 1997.
[8] A. Halevy: "Answering queries using views: A survey". The VLBD
Journal, 10(4), 270-294. 2001.
[9] S. Kermanshahani: "Semi-Materialized Framework: a Hybrid Approach
to Data Integration", CSTST Student Workshop, Paris, October 2008.
[10] I. Manolescu, D. Florescu and D. Kossman: "Answering XML Queries
Over Heterogeneous Data Sources". In proceedings of the 27 th
International Conference on VLDB, 2001.
[11] M. Roger, A. Simonet, M. Simonet, "Bringing Together Description
Logics and Databases in an Object-Oriented Model", DEXA 2002,
Database and Expert System Applications, Toulouse, Sept. 2002.
[12] M. H. Scholl, C. Laasch, M. Tresch, "Updatable Views in Object-
Oriented Databases", Proc. 2nd DOOD conf., pp 187-198, Dec. 1991.
[13] I. Sebi : "Interrogation de Documents XML à Travers des Vues". PhD
thesis, EDITE, CEDRIS Laboratory, 2007.
[14] A. Simonet, M. Simonet, "Classement d-instance et Evaluation des
Requ├¬tes en Osiris", in BDA-96 : Bases de Données Avancées, Cassis,
France, pp 273-288, Aug. 1996.
[15] D. Stanat, D. McAllister: "Discrete Mathematics in Computer Science",
Prentice Hall, 1977.
[16] G. Wiederhold. "Mediators in the architecture of future information
systems". IEEE Computer Magazine, 25(3), 38-49, 1992.
[17] M.-C. Wu, A. P. Buchmann. "Research issues in data warehousing". In
Datebanksysteme in Buro, Technik and Wissenschaft, pp. 61-82, 1997.
@article{"International Journal of Information, Control and Computer Sciences:49882", author = "H. Ahmad and S. Kermanshahani and A. Simonet and M. Simonet", title = "A Materialized Approach to the Integration of XML Documents: the OSIX System", abstract = "The data exchanged on the Web are of different nature
from those treated by the classical database management systems;
these data are called semi-structured data since they do not have a
regular and static structure like data found in a relational database;
their schema is dynamic and may contain missing data or types.
Therefore, the needs for developing further techniques and
algorithms to exploit and integrate such data, and extract relevant
information for the user have been raised. In this paper we present
the system OSIX (Osiris based System for Integration of XML
Sources). This system has a Data Warehouse model designed for the
integration of semi-structured data and more precisely for the
integration of XML documents. The architecture of OSIX relies on
the Osiris system, a DL-based model designed for the representation
and management of databases and knowledge bases. Osiris is a viewbased
data model whose indexing system supports semantic query
optimization. We show that the problem of query processing on a
XML source is optimized by the indexing approach proposed by
Osiris.", keywords = "Data integration, semi-structured data, views, XML.", volume = "3", number = "4", pages = "879-7", }