skip to main content
10.1145/1140277.1140326acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article

Analysis of a new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors

Published: 26 June 2006 Publication History

Abstract

Today's data storage systems are increasingly adopting low-cost disk drives that have higher capacity but lower reliability, leading to more frequent rebuilds and to a higher risk of unrecoverable media errors. We propose a new XOR-based intra-disk redundancy scheme, called interleaved parity check (IPC), to enhance the reliability of RAID systems that incurs only negligible I/O performance degradation. The proposed scheme introduces an additional level of redundancy inside each disk, on top of the RAID redundancy across multiple disks. The RAID parity provides protection against disk failures, while the proposed scheme aims to protect against media-related unrecoverable errors.We develop a new model capturing the effect of correlated unrecoverable sector errors and subsequently use it to analyze the proposed scheme as well as the traditional redundancy schemes based on Reed-Solomon (RS) codes and single-parity-check (SPC) codes. We derive closed-form expressions for the mean time to data loss (MTTDL) of RAID 5 and RAID 6 systems in the presence of unrecoverable errors and disk failures. We then combine these results for a comprehensive characterization of the reliability of RAID systems that incorporate the proposed IPC redundancy scheme. Our results show that in the practical case of correlated errors, the proposed scheme provides the same reliability as the optimum albeit more complex RS coding scheme. Finally, the throughput performance of incorporating the intra-disk redundancy on various RAID systems is evaluated by means of event-driven simulations. A detailed description of these contributions is given in [1].

Reference

[1]
A. Dholakia et al. Analysis of a New Intra-Disk Redundancy Scheme for High-Reliability RAID Storage Systems in the Presence of Unrecoverable Errors. IBM Research Report RZ 3652, March 16, 2006.

Cited By

View all
  • (2018)Protecting Single Shingled Write Drives Against Latent Sector FailuresProceedings of the 11th ACM International Systems and Storage Conference10.1145/3211890.3211893(26-36)Online publication date: 4-Jun-2018
  • (2015)Rebuttal to “Beyond MTTDL: A Closed-Form RAID-6 Reliability Equation”ACM Transactions on Storage10.1145/270031111:2(1-10)Online publication date: 20-Mar-2015
  • (2009)Protecting against rare event failures in archival systems2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems10.1109/MASCOT.2009.5366825(1-11)Online publication date: Sep-2009
  • Show More Cited By

Index Terms

  1. Analysis of a new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMETRICS '06/Performance '06: Proceedings of the joint international conference on Measurement and modeling of computer systems
      June 2006
      404 pages
      ISBN:1595933190
      DOI:10.1145/1140277
      • cover image ACM SIGMETRICS Performance Evaluation Review
        ACM SIGMETRICS Performance Evaluation Review  Volume 34, Issue 1
        Performance evaluation review
        June 2006
        388 pages
        ISSN:0163-5999
        DOI:10.1145/1140103
        Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 26 June 2006

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. RAID
      2. file and I/O systems
      3. reliability analysis
      4. stochastic modeling

      Qualifiers

      • Article

      Conference

      SIGMETRICS06

      Acceptance Rates

      Overall Acceptance Rate 459 of 2,691 submissions, 17%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Protecting Single Shingled Write Drives Against Latent Sector FailuresProceedings of the 11th ACM International Systems and Storage Conference10.1145/3211890.3211893(26-36)Online publication date: 4-Jun-2018
      • (2015)Rebuttal to “Beyond MTTDL: A Closed-Form RAID-6 Reliability Equation”ACM Transactions on Storage10.1145/270031111:2(1-10)Online publication date: 20-Mar-2015
      • (2009)Protecting against rare event failures in archival systems2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems10.1109/MASCOT.2009.5366825(1-11)Online publication date: Sep-2009
      • (2009)Reliability modeling of RAID storage systems with latent errors2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems10.1109/MASCOT.2009.5366195(1-12)Online publication date: Sep-2009
      • (2008)PergamumProceedings of the 6th USENIX Conference on File and Storage Technologies10.5555/1364813.1364814(1-16)Online publication date: 26-Feb-2008
      • (2008)Enhancing data availability in disk drives through background activities2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN)10.1109/DSN.2008.4630120(492-501)Online publication date: Jun-2008
      • (2018)Protecting Single Shingled Write Drives Against Latent Sector FailuresProceedings of the 11th ACM International Systems and Storage Conference10.1145/3211890.3211893(26-36)Online publication date: 4-Jun-2018
      • (2015)Discovering Information Propagation Patterns in Microblogging ServicesACM Transactions on Knowledge Discovery from Data10.1145/274280110:1(1-22)Online publication date: 22-Jul-2015
      • (2015)Rebuttal to “Beyond MTTDL: A Closed-Form RAID-6 Reliability Equation”ACM Transactions on Storage10.1145/270031111:2(1-10)Online publication date: 20-Mar-2015
      • (2015)dDRAID: A technique for capacity and performance enhancement of RAID storage systems2015 Annual IEEE India Conference (INDICON)10.1109/INDICON.2015.7443331(1-6)Online publication date: Dec-2015
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media