Multiversion Document Warehouse: An Approach to Multidimensional Analysis

Authors

  • Kaïs Khrouf MIR@CL Laboratory - University of Sfax - Tunisia
  • Jamel Feki MIR@CL Laboratory - University of Sfax - Tunisia
  • Chantal Soulé-Dupuy IRIT - University of Toulouse I – France

DOI:

https://doi.org/10.37380/jisib.v2i1.29

Keywords:

Multidimensional analysis of information, Heterogeneous documents, business intelligence

Abstract

Document warehouses allow the storage of selected and filtered heterogeneous documents, as well as their exploitation through multidimensional analyses techniques. However, the content of documents is dynamic and changes over time. In practice, decisional analysts may be interested in various versions of documents. The document warehouse should store and manage these versions. This paper presents an extended generic model for document warehouses allowing the management of multiversion documents. In addition, it proposes a multidimensional analysis of the document versions.

References

Ben-Messaoud I., Feki J. & Zurfluh G. (2010). Unification des structures des documents XML pour l’entreposage de documents : Atelier des Systèmes Décisionnels. (p. 1-12). Tunisie : Sfax.

Boussaid O., Ben-Messaoud R., Choquet R., & Anthoard S. (2006). XWarehousing: An XML-Based Approach for Warehousing ComplexData, (p. 39–54) 10th East European Conf. on Advances in Databases and Information Systems.

Dublin Core Metadata Initiative (DCMI): Dublin Core Metadata Element Set, Version 1.1, ISO Standard 15836, Downloaded September 2008 from http://dublincore.org/documents/dces/.

Inmon B. & Hackathorn R.D. (1994). Using the Data Warehouse. Wiley-QED Publication.

Kimball R. & Ross M. (2002). The Data Warehouse Toolkit (2 edition). New York: John Wiley & Sons.

Khrouf K., Ravat F. & Soulé-Dupuy C. (2003). Comparaison et fusion de structures logiques de documents semi-structurés. Ingénierie des Systèmes d’Information, 8, 127-151.

Khrouf K. & Soulé-Dupuy C. (2004). A Textual Warehouse Approach: a Web Data Repository, (p. 101-124). Idea Group Publishing

Khrouf K., Mbarki M., Ravat F., Soule-Dupuy C. & Valles-Parlangeau N. (2007). Les entrepôts de documents : gestion de versions. Colloque Veille Stratégique Scientifique & Technologique.

Nassis V., Rajugan R., Dillon T.S. & Rahayu J.W. (2004). Conceptual Design of XML Document Warehouses (p. 1-14). International Conference on Data Warehousing and Knowledge Discovery.

Park B-K., Han H., & Song I-Y. (2005). XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses, (p.32-42). International Conference on Data Warehousing and Knowledge Discovery.

Pérez-Martinez J.M., Berlanga-Llavori R.B., Aramburu-Cabo M.J., & Pedersen T.B. (2007). Contextualizing data warehouses with documents, Decision Support Systems. Elsevier.

Ravat F., Teste O., Tournier R., & Zurfluh G. (2008). Top_Keyword: an Aggregation Function for Textual Document OLAP, (p. 55-64). International Conference on Data Warehousing and Knowledge Discovery, Turin, Italiy.

Soutou C. (1999). Relational-objet sous oracle 8: Modélisation avec UML, Edition Eyrolles, Mars 1999.

Downloads

Issue

Section

Article