ACM Home Page
Please provide us with feedback. Feedback
Speculative execution for hiding memory latency
Full text PdfPdf (233 KB)
Source ACM SIGARCH Computer Architecture News archive
Volume 33 ,  Issue 3  (June 2005) table of contents
Special issue: MEDEA 2004 workshop
SPECIAL ISSUE: MEDEA 2004 workshop table of contents
Pages: 49 - 56  
Year of Publication: 2005
ISSN:0163-5964
Also published in ...
Authors
Alex Pajuelo  Universitat Politècnica de Catalunya, Barcelona-Spain
Antonio González  Universitat Politècnica de Catalunya, Barcelona-Spain
Mateo Valero  Universitat Politècnica de Catalunya, Barcelona-Spain
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 35,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1101868.1101877
What is a DOI?

ABSTRACT

L2 misses are one of the main causes for stalling the activity in current and future microprocessors.In this paper we present a mechanism to speculatively execute independent instructions of L2-miss loads, even if no entry in the reorder buffer is available. The proposed mechanism generates future instances of instructions that are expected to be independent of the delinquent load. When these dynamic instructions are later fetched, they use the previously precomputed data and directly go to the commit stage without executing.The mechanism replicates strided loads found above the L2-miss load, that produce the data for the target independent instructions. Instructions following the L2-miss load will check if their source operands have been replicated. In this case, multiple speculative instances of them will also be generated.This mechanism is built on top of a superscalar processor with an aggressive prefetch scheme. Compared to this baseline, the mechanism obtains 21% of performance improvement.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
2
3
 
4
 
5
D. Burger and T. Austin, "The SimpleScalar Tool Set, Version 2.0", Technical Report No. CS-TR-97-1342, University of Wisconsin-Madison, June 1997.
6
7
 
8
 
9
10
11
 
12
13
14
15
 
16
17
 
18
19
 
20
21
 
22
 
23
SPEC 2000. http://www.specbench.org/osg/cpu2000/
 
24
25

Collaborative Colleagues:
Alex Pajuelo: colleagues
Antonio González: colleagues
Mateo Valero: colleagues