Evaluating the Impact of Irrecoverable Read Errors on Disk Array Reliability
Appeared in Proceedings of the IEEE 15th Pacific Rim International Symposium on Dependable Computing (PRDC09).
Abstract
We investigate the impact of irrecoverable read errors — also known as bad blocks — on the MTTDL of mirrored disks, RAID level 5 arrays and RAID level 6 arrays. Our study is based on the data collected by Bairavasundaram et al. from a population of 1.53 million disks over a period of 32 months. Our study indicates that irrecoverable read errors can reduce the mean time to data loss (MTTDL) of the three arrays by up to 99 percent, effectively canceling most of the benefits of fast disk repairs. It also shows the benefits of frequent scrubbing scans that map out bad blocks thus preventing future irrecoverable read errors. As an example, once-a-month scrubbing scans were found to improve the MTTDL of the three arrays by at least 300 percent compared to once-a-year scrubbing scans.
Publication date:
November 2009
        Authors:
        
            
                Jehan-François Pâris
            
        
            
                Ahmed Amer
            
        
            
                Darrell D. E. Long
            
        
            
                Thomas Schwarz
            
        
    
        Projects:
        
            Reliable Storage
        
    
Available media
Full paper text: PDF
Bibtex entry
@inproceedings{paris-prdc09,
  author       = {Jehan-François Pâris and Ahmed Amer and Darrell D. E. Long and Thomas Schwarz},
  title        = {Evaluating the Impact of  Irrecoverable Read Errors on Disk Array Reliability},
  booktitle    = {Proceedings of the IEEE 15th Pacific Rim International  Symposium on Dependable Computing (PRDC09)},
  month        = nov,
  year         = {2009},
}
    
