Tracing the lineage of view data in a warehousing environment

Article Properties
Abstract
Cite
Cui, Yingwei, et al. “Tracing the Lineage of View Data in a Warehousing Environment”. ACM Transactions on Database Systems, vol. 25, no. 2, 2000, pp. 179-27, https://doi.org/10.1145/357775.357777.
Cui, Y., Widom, J., & Wiener, J. L. (2000). Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems, 25(2), 179-227. https://doi.org/10.1145/357775.357777
Cui Y, Widom J, Wiener JL. Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems. 2000;25(2):179-227.
Journal Categories
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Science
Mathematics
Instruments and machines
Electronic computers
Computer science
Computer software
Science
Science (General)
Cybernetics
Information theory
Technology
Electrical engineering
Electronics
Nuclear engineering
Electronics
Computer engineering
Computer hardware
Description

Ever wondered where your data really comes from? This paper addresses the *view data lineage* problem in data warehousing: identifying the source data items that contribute to a specific data item in a materialized warehouse view. The research is aimed at developing algorithms for tracing lineage and mechanisms for ensuring consistent lineage tracing. A formal definition of the lineage problem is presented, and lineage tracing algorithms are developed for relational views with aggregation. Mechanisms for performing consistent lineage tracing in a multisource data warehousing environment are proposed. These results can form the basis of a tool that helps analysts to examine warehouse data, choose specific view tuples, and then “drill-through” to identify the exact source tuples that were used to derive the view tuples of interest. This enables a deeper understanding of data provenance and facilitates data quality management in warehousing environments.

Published in ACM Transactions on Database Systems, this paper aligns with the journal's focus on database management systems and data warehousing. The research on view data lineage is directly relevant to database research, adding to the journal's core topics.

Refrences
Citations
Citations Analysis
The first research to cite this article was titled An annotation management system for relational databases and was published in 2005. The most recent citation comes from a 2024 study titled An annotation management system for relational databases . This article reached its peak citation in 2018 , with 8 citations.It has been cited in 49 different journals, 10% of which are open access. Among related journals, the Proceedings of the VLDB Endowment cited this research the most, with 14 citations. The chart below illustrates the annual citation trends for this article.
Citations used this article by year