ACM Home Page
Please provide us with feedback. Feedback
Improving superscalar instruction dispatch and issue by exploiting dynamic code sequences
Full text PdfPdf (1.76 MB)
Source International Symposium on Computer Architecture archive
Proceedings of the 24th annual international symposium on Computer architecture table of contents
Denver, Colorado, United States
Pages: 1 - 12  
Year of Publication: 1997
ISBN:0-89791-901-7
Also published in ...
Authors
Sriram Vajapeyam  Supercomputer Education and Research Centre and Dept. of Computer Science & Automation, Indian Institnte of Science, Bangalore, India 560012
Tulika Mitra  Dept. of Computer Science & Automation, Indian Institute of science, Bangalore, India 560012
Sponsor
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 49,   Citation Count: 25
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/264107.264119
What is a DOI?

ABSTRACT

Superscalar processors currently have the potential to fetch multiple basic blocks per cycle by employing one of several recently proposed instruction fetch mechanisms. However, this increased fetch bandwidth cannot be exploited unless pipeline stages further downstream correspondingly improve. In particular, register renaming a large number of instructions per cycle is difficult. A large instruction window, needed to receive multiple basic blocks per cycle, will slow down dependence resolution and instruction issue. This paper addresses these and related issues by proposing (i) partitioning of the instruction window into multiple blocks, each holding a dynamic code sequence; (ii) logical partitioning of the register file into a global file and several local files, the latter holding registers local to a dynamic code sequence; (iii) the dynamic recording and reuse of register renaming information for registers local to a dynamic code sequence. Performance studies show these mechanisms improve performance over traditional superscalar processors by factors ranging from 1.5 to a little over 3 for the SPEC Integer programs. Next, it is observed that several of the loops in the benchmarks display vector-like behavior during execution, even if the static loop bodies are likely complex for compile-time vectorization. A dynamic loop vectorization mechanism that builds on top of the above mechanisms is briefly outlined. The mechanism vectorizes up to 60% of the dynamic instructions for some programs, albeit the average number of iterations per loop is quite small.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

Aus92a
Bur96a
Con95a
Dit82a
Fra92a
Fra92b
 
Fra93a
Fra94a
 
Fra95a
 
Hao96a
Hil84a
 
Hwu87a
W. W. Hwu and Y. N. Patt, "Design Choices for the HPSm Microprocessor Chip," in Proc. 20th Annual Hawaii International Conference on System Sciences, Kona, HI, January 1987.
 
IBM90a
IBM, "Special Issue on the IBM RISC System/6000 Processor," IBM Journal of Research and Development, January 1990.
Lam92a
 
Mel88a
 
Mit97a
Tulika Mitra. "Performance Evaluation of Improved Superscalar Issue Mechanisms," in M.E. Project Report, Dept. of Computer Science, Indian Institute of Science, January 1997.
 
Pal96a
S. Palacharla. N. Jouppi, and J. E. Smith, "Quantifying the Complexity of Superscalar Processors," Univ. of Wisconsin-Madison Technical Report, vol. CS-T&96- 1328, November 1996, (Available at http:l/www.cs.wisc.edultrs.html; a version to appear in ISCA'97).
Pat85a
Pat85b
 
Rot96a
Rus78a
Smi84a
 
Smo95a
Spr94a
 
Tom67a
R. M. Tomasulo, "An Efficient Algorithm for Exploiting Multiple Arithmetic Units," IBM Journal of Research and Development, January 1967.
 
Uht92a
Wal91a
 
Wei95a
Yeh93b
Yeh93a

CITED BY  25
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Collaborative Colleagues:
Sriram Vajapeyam: colleagues
Tulika Mitra: colleagues

Peer to Peer - Readers of this Article have also read: