ABSTRACT
Data citation is an interesting computational challenge, whose solution draws on several well-studied problems in database theory: query answering using views, and provenance. We describe the problem, suggest an approach to its solution, and highlight several open research problems, both practical and theoretical.
- Out of Cite, Out of Mind: The Current State of Practice, Polocy, and Technology for the Citation of Data, volume 12. CODATA-ICSTI Task Group on Data Citation Standards and Practices, September 2013.Google Scholar
- S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarDigital Library
- F. N. Afrati, C. Li, and J. D. Ullman. Using views to generate efficient evaluation plans for queries. J. Comput. Syst. Sci., 73(5):703--724, 2007. Google ScholarDigital Library
- P. Buneman, S. Davidson, and J. Frew. Why data citation is a computational problem. CACM, 59, 2016. Google ScholarDigital Library
- J. Cheney, L. Chiticariu, and W. C. Tan. Provenance in databases: Why, how, and where. Foundations and Trends in Databases, 1(4):379--474, 2009. Google ScholarDigital Library
- S. B. Davidson, D. Deutch, T. Milo, and G. Silvello. A model for fine-grained data citation. In CIDR 2017, 8th Biennial Conference on Innovative Data Systems Research, Chaminade, CA, USA, January 8-11, 2017, Online Proceedings, 2017.Google Scholar
- FORCE-11. Data Citation Synthesis Group: Joint Declaration of Data Citation Principles. FORCE11, San Diego, CA, USA, 2014.Google Scholar
- T. J. Green, G. Karvounarakis, and V. Tannen. Provenance semirings. In PODS, pages 31--40, 2007. Google ScholarDigital Library
- A. Y. Halevy. Answering queries using views: A survey. VLDB J., 10(4):270--294, 2001. Google ScholarDigital Library
- L. Popa and V. Tannen. An equational chase for path-conjunctive queries, constraints, and views. In Database Theory - ICDT '99, 7th International Conference, Jerusalem, Israel, January 10-12, 1999, Proceedings., pages 39--57, 1999. Google ScholarDigital Library
- S. Pröll and A. Rauber. Scalable data citation in dynamic, large databases: Model and reference implementation. In Proc. of the 2013 IEEE International Conference on Big Data, pages 307--312, 2013.Google ScholarCross Ref
Index Terms
- Data Citation: A Computational Challenge
Recommendations
Data Citation: Giving Credit Where Credit is Due
SIGMOD '18: Proceedings of the 2018 International Conference on Management of DataAn increasing amount of information is being published in structured databases and retrieved using queries, raising the question of how query results should be cited. Since there are a large number of possible queries over a database, one strategy is to ...
Data Citation Index: Promoting attribution, use and discovery of research data
NFAIS 2014 and ICSTI 2014The Data Citation Index forms part of the Thomson Reuters Web of Science research platform. Thomson Reuters makes partnerships with data repositories to create indexed data object metadata records integrated within the larger platform. The core features ...
Querying data provenance
SIGMOD '10: Proceedings of the 2010 ACM SIGMOD International Conference on Management of dataMany advanced data management operations (e.g., incremental maintenance, trust assessment, debugging schema mappings, keyword search over databases, or query answering in probabilistic databases), involve computations that look at how a tuple was ...
Comments