ABSTRACT
In this demonstration we present the Perm provenance management system (PMS). Perm is capable of computing, storing and querying provenance information for the relational data model. Provenance is computed by using query rewriting techniques to annotate tuples with provenance information. Thus, provenance data and provenance computations are represented as relational data and queries and, hence, can be queried, stored and optimized using standard relational database techniques. This demo shows the complete Perm system and lets attendants examine in detail the process of query rewriting and provenance retrieval in Perm, the most complete data provenance system available today. For example, Perm supports lazy and eager provenance computation, external provenance and various contribution semantics.
- P. Buneman, S. Khanna, and W. C. Tan. Why and where: A characterization of data provenance. In ICDT'01, pages 316--330, 2001. Google ScholarDigital Library
- Y. Cui, J. Widom, and J. L. Wiener. Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems, 25(2):179--227, 2000. Google ScholarDigital Library
- B. Glavic and G. Alonso. Perm: Processing provenance and data on the same data model through query rewriting. In ICDE'09, 2009. Google ScholarDigital Library
- B. Glavic and G. Alonso. Provenance for nested subqueries. In EDBT'09, 2009. Google ScholarDigital Library
- B. Glavic and K. R. Dittrich. Data provenance: A categorization of existing approaches. In BTW'07, pages 227--241, 2007.Google Scholar
Index Terms
- The perm provenance management system in action
Recommendations
Perm: Processing Provenance and Data on the Same Data Model through Query Rewriting
ICDE '09: Proceedings of the 2009 IEEE International Conference on Data EngineeringData provenance is information that describes how a given data item was produced. The provenance includes source and intermediate data as well as the transformations involved in producing the concrete data item. In the context of a relational databases, ...
Provenance management in Swift
The Swift parallel scripting language allows for the specification, execution and analysis of large-scale computations in parallel and distributed environments. It incorporates a data model for recording and querying provenance information. In this ...
Prospective and Retrospective Provenance Collection in Scientific Workflow Environments
SCC '10: Proceedings of the 2010 IEEE International Conference on Services ComputingProvenance, a record of the derivation history of scientific results, is critical for scientific workflows to support reproducibility, result interpretation, and problem diagnosis. Both prospective provenance, which captures an abstract workflow ...
Comments