skip to main content
research-article

Data Transparency with Blockchain and AI Ethics

Published:21 August 2019Publication History
Skip Abstract Section

Abstract

Providing a 360° view of a given data item especially for sensitive data is essential toward not only protecting the data and associated privacy but also assuring trust, compliance, and ethics of the systems that use or manage such data. With the advent of General Data Protection Regulation, California Data Privacy Law, and other such regulatory requirements, it is essential to support data transparency in all such dimensions. Moreover, data transparency should not violate privacy and security requirements. In this article, we put forward a vision for how data transparency would be achieved in a de-centralized fashion using blockchain technology.

References

  1. Guruduth Banavar. 2016. Learning to trust artificial intelligence systems Accountability, compliance and ethics in the age of smart machines. Accessed on Oct. 12, 2018 from https://www.alain-bensoussan.com/wp-content/uploads/2017/06/34348524.pdf.Google ScholarGoogle Scholar
  2. Elisa Bertino. 2015. Data trustworthiness—Approaches and research challenges. In Data Privacy Management, Autonomous Spontaneous Security, and Security Assurance —9th International Workshop, DPM 2014, 7th International Workshop, SETOP 2014, and 3rd International Workshop, QASA 2014, Revised Selected Papers, Lecture Notes in Computer Science, Vol. 8872. Springer, 17--25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Elisa Bertino, Shawn Merrill, Alina Nesen, and Christine Utz. 2019. Redefining data transparency—A multi-dimensional approach. IEEE Computer 52 (2019), 16--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. IBM THINK Blog. 2017a. Artificial Intelligence and Machine Learning: Ethics, Governance and Compliance Resources. Accessed on Sept. 10, 2018 from https://fpf.org/artificial-intelligence-and-machine-learning-ethics-governance-and-compliance-resources/.Google ScholarGoogle Scholar
  5. IBM THINK Blog. 2017b. Transparency and Trust in the Cognitive Era. Accessed on Sept. 15, 2018 from https://www.ibm.com/blogs/think/2017/01/ibm-cognitive-principles/.Google ScholarGoogle Scholar
  6. Jeff Collman and Sorin Matei. 2016. Ethical Reasoning in Big Data: An Exploratory Analysis (1st ed.). Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Izzat El Hajj, Alexander Merritt, Gerd Zellweger, Dejan Milojicic, Reto Achermann, Paolo Faraboschi, Wen-mei Hwu, Timothy Roscoe, and Karsten Schwan. 2016. SpaceJMP: Programming with multiple virtual address spaces. SIGOPS Oper. Syst. Rev. 50, 2 (March 2016), 353--368. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. GDPR. 2018. The EU General Data Protection Regulation (GDPR) is the most important change in data privacy regulation in 20 years. Accessed on Sept. 10, 2018 from https://eugdpr.org/.Google ScholarGoogle Scholar
  9. IEEE. 2018. The IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems. Accessed on Nov. 20, 2018 from https://standards.ieee.org/industry-connections/ec/autonomous-systems.html.Google ScholarGoogle Scholar
  10. Besiki Stvilia, Les Gasser, Michael B. Twidale, and Linda C. Smith. 2007. A framework for information quality assessment. J. Am. Soc. Inf. Sci. Technol. (JASIST) 58, 12 (Oct. 2007), 1720--1733. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Columbia University. 2018. Columbia University and IBM Establish New Center to Accelerate Innovation in Blockchain and Data Transparency. Accessed on Oct. 07, 2018 from https://engineering.columbia.edu/press-releases/columbia-ibm-center-blockchain-data-transparency.Google ScholarGoogle Scholar
  12. Wikipedia. 2018. Facebook Cambridge Analytica Data Scandal. Accessed on Sept. 10, 2018 from https://en.wikipedia.org/wiki/Facebook.Google ScholarGoogle Scholar

Index Terms

  1. Data Transparency with Blockchain and AI Ethics

        Recommendations

        Reviews

        Fjodor J. Ruzic

        With the ever-growing use of digital technology, especially the use of big data technologies and machine learning, questions related to the proper use of data collected from social networks led to the introduction of data transparency. Hence, the authors open the discussion on crucial problems of data use "in many application areas ... dealing with personally identifiable information." There are many cases where personal data theft has led to privacy breaches and the possibility of identity theft. This fact asks for appropriate solutions, and the authors successfully aim their study at data transparency, knowing that it is not easy to achieve. Many cybersecurity techniques have been implemented, but our almost fully networked business, social, and personal lives need a robust and decentralized approach. This study recognizes blockchain's potential for a variety of applications, as it can "provide real-time data visibility and transparency while keeping private information secure" [1]. Many of us have seen the slogan "we value your privacy" (or something similar). The blockchain technology appears to make data transparent: Network participants have the ability to access holdings and transactions of public addresses using a block explorer, used to search the blocks of a blockchain, their contents, and their relevant details. [2] Additionally, trust and privacy are secured by national and international regulations. This technology is recognized as the right tool for storing personal information since it eliminates the need to give consent to use personal data. In this way, user identities will not be duplicated and distributed to service providers, social media companies, or online stores. In the work on finding suitable frameworks and approaches for data transparency, the authors introduce, in an excellent way, the term "data transparency dimensions." It is explained with brief descriptions of several key data transparency dimensions. For example, they explore how artificial intelligence (AI) and machine learning ethics influence data transparency in accordance with commonly accepted ethical principles. The introduction of data ethics through AI ethics-driven transparency is a really successful approach, giving business and social communities new possibilities for data transparency. In the networked decentralized infrastructure there is another infrastructure, that is, blockchain technology as an infrastructure for transparency. Hence, blockchain, a decentralized technology, "does not rely on a central point of control" and instead "relies on consensus protocols across a network of nodes to confirm any transaction performed on the network" [2]. Thus, "participants on the network all must agree unanimously to add a new block and must do it while ensuring its integrity" [2]. It is amazing how the authors present their ideas about data transparency-a crucial data quality dimension based on agreed-upon data ethics principles, where blockchain technology is used as an infrastructure for efficient transparency-in an easy-to-follow way. Besides the blockchain technology issues, the authors provide a novel approach for the research community: including data ethics as a key part of AI ethics. Data ethics "refers to assessing whether data have been gathered and used according to some ethical principles"; trust and transparency are important elements. They clearly state that "transparency is critical" and "data transparency should not violate privacy and security requirements." This work is promotional research on data ethics that helps in building digital trust and developing adequate regulatory requirements for data transparency. It is also strongly connected to identifying universal data ethics principles for the data science community.

        Access critical reviews of Computing literature here

        Become a reviewer for Computing Reviews.

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Journal of Data and Information Quality
          Journal of Data and Information Quality  Volume 11, Issue 4
          December 2019
          139 pages
          ISSN:1936-1955
          EISSN:1936-1963
          DOI:10.1145/3357606
          Issue’s Table of Contents

          Copyright © 2019 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 21 August 2019
          • Revised: 1 February 2019
          • Accepted: 1 February 2019
          • Received: 1 January 2019
          Published in jdiq Volume 11, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format