ACM Home Page
Please provide us with feedback. Feedback
Enhancing data cache reliability by the addition of a small fully-associative replication cache
Full text PdfPdf (265 KB)
Source
International Conference on Supercomputing archive
Proceedings of the 18th annual international conference on Supercomputing table of contents
Malo, France
SESSION: Cache table of contents
Pages: 12 - 19  
Year of Publication: 2004
ISBN:1-58113-839-3
Author
Wei Zhang  Southern Illinois University Carbondale, Carbondale, IL
Sponsors
SIGARCH: ACM Special Interest Group on Computer Architecture
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 51,   Citation Count: 4
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1006209.1006212
What is a DOI?

ABSTRACT

Soft error conscious cache design is a necessity for reliable computing. ECC or parity-based integrity checking technique in use today either compromises performance for reliability or vice versa, and the N modular redundancy (NMR) scheme is too costly for microprocessors and applications with stringent cost constraint. This paper proposes a novel and cost-effective solution to enhance data reliability with minimum impact on performance. The idea is to add a small fully-associative cache to store the replica(s) of every write to the L1 data cache. The replicas can be used to detect and correct soft errors. The replication cache can also be used to increase performance by reducing the L1 data cache miss rate. Our experiments show that more than 97% read hits of the L1 data cache can find replicas available in a replication cache of 8 blocks.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
3
 
4
 
5
 
6
P. Sweazey. SRAM organization, control, and speed, and their effect on cache memory design. Midcon/87, pages 434--437, Septembe, 1987.
 
7
 
8
C. L. Chen and M. Y Hsiao. Error-correcting codes for semiconductor memory applications: a state of the art review. In Reliable Computer Systems - Design and Evaluation, pages 771--786, Digital Press, 2nd edition, 1992.
 
9
W. Zhang, S. Gurumurthi, M. kandemir and A. Sivasubramaniam. ICR: in-cache replication for enhancing data cache reliability. Proceedings of DSN, 2003.
 
10
 
11
S. Park and B. Bose. Burst asymmetric/unidirectional error correcting/detecting codes. Proceeding of International Symposium on Fault-Tolerant Computing, pages 273--280, June 1990.
 
12
13
 
14
 
15
 
16
Understanding Soft and Firm Errors in Semiconductor Devices. Actel Whitepaper, 2002.
17
 
18
19
 
20
 
21
 
22
AMD Athlon 64 FX Processor Data Sheet.
 
23
P. Shivakumar and N. Jouppi. CACTI 3.0: An integrated cache timing, power and area model. WRL Research Report 2001.
 
24



Peer to Peer - Readers of this Article have also read: