article

Free Access

Precise instruction scheduling without a precise machine model

Author:
Henry G. Baker

View Profile

Authors Info & Claims

ACM SIGARCH Computer Architecture News Volume 19 Issue 6Dec. 1991pp 4–8https://doi.org/10.1145/152766.152767

Published:01 December 1991Publication History

ACM SIGARCH Computer Architecture News

Abstract

A simple technique is presented which allows an optimizing compiler to more precisely compare the performance of alternative instruction sequences on a complex RISC architecture so that the better sequence can be chosen. This technique may be faster than current techniques, and has the advantage that minor modifications to the hardware do not require any changes to the compiler (not even recompilation), and yet have an immediate effect on instruction scheduling decisions.

References

Appel, Andrew. Private communication, July, 1991.Google Scholar
ATT. WE® DSP32C Digital Signal Processor Advance Data Sheet. ATT Microelectronics, Allentown, PA, May 1988.Google Scholar
Baker, Henry, and Parker, Clinton. Micro SPL. Synapse Computer Services, Sept. 1979.Google Scholar
Baker, Henry, and Parker, Clinton. "High Level Language Programs Run Ten Times Faster in Microstore". Tech. Rept., Synapse Computer Services, 1980.Google Scholar
Bradlee, David G., et al. "The Marion System for Retargetable Instruction Scheduling". Proc. ACM PLDI'91, Sigplan Not. 26, 6 (June 1991), 229-240. Google ScholarDigital Library
Chambers, C., and Ungar, D. "Customization: Optimizing Compiler Technology for SELF, A Dynamically-Typed Object-Oriented Programming Language". Proc. ACM PLDI'89, Sigplan Not. 24, 7 (July 1989), 146-160. Google ScholarDigital Library
Chambers, C., Ungar, D., and Lee, E. "An Efficient Implementation of SELF, A Dynamically-Typed Object-Oriented Programming Language". Proc. OOPSLA'89, Sigplan Not. 24, 10 (Oct. 1989), 49-70. Google ScholarDigital Library
Deutsch, L.P., and Schiffman, A.M. "Efficient Implementation of the Smalltalk-80 System". Proc. 11'th ACM POPL, Salt Lake City, UT, Jan. 1984, 297-302. Google ScholarDigital Library
Ellis, John R. Bulldog: A Compiler for VLIW Architectures. MIT Press, Cambridge, MA, 1986. Google ScholarDigital Library
Gibbons, P.B., and Muchnick, S.S. "Efficient instruction scheduling for a pipelined architecture". Proc. ACM Symp. on Compiler Constr., Sigplan Not. 21, 7 (July 1986), 11-16. Google ScholarDigital Library
Hennessy, John, and Gross, Thomas. "Postpass Code Optimization of Pipeline Constraints". ACM TOPLAS 5, 3 (July 1983), 422-448. Google ScholarDigital Library
Intel Corp. i860^TM [XR] 64-Bit Microprocessor Programmer's Reference Manual. #240329-002, 1989.Google Scholar
Intel Corp. i860^TMMicroprocessor Family Programmer's Reference Manual. #240875-001, 1991. Google ScholarDigital Library
Intel Corp. i860^TM 64-bit Microprocessor Simulator and Debugger Reference Manual, Ver. 3. #240437-003, Jan. 1990.Google Scholar
Keppel, David. "A Portable Interface for On-The-Fly Instruction Space Modification". Proc. 4'th ACM ASPLOS, Sigplan Not. 26, 4 (April 1991), 86-95. Google ScholarDigital Library
Knuth, Donald E. The Art of Computer Programming Vol. I: Fundamental Algorithms, 2nd Ed. Addison-Wesley, Reading, MA, 1973, 634 p. Google ScholarDigital Library
Kogge, P.M. The Architecture of Pipelined Computers. McGraw-Hill, New York, 1981.Google Scholar
Massalin, Henry. "Superoptimizer--A Look at the Smallest Program". Proc. ACM ASPLOS'87, Sigplan Not. 22, 10 (Oct. 1987), 122-126. Google ScholarDigital Library
Morris, W.G. "CCG: A Prototype Coagulating Code Generator". Proc. ACM PLDI'91, Sigplan Not. 26, 6 (June 1991), 45-58. Google ScholarDigital Library
Moyer, Steven A. "Performance of the iPSC/860 Node Architecture". IPC-TR-91-007, Inst. for Parallel Comp., Eng. & Applied Sci., U. of Va., May 1991. Google ScholarDigital Library
Scott, D.S., and Withers, G.R. "Performance and Assembly Language Programming of the iPSC/860 System". Tech. Report, Intel Supercomputer Systems Div., Beaverton, OR, 1990.Google Scholar
Texas Inst. TMS320C30: The Third Generation of the TMS320 Family of Digital Signal Processors. Texas Instruments, Feb. 1988.Google Scholar
Wirth, Niklaus. "From Programming Language Design to Computer Construction". CACM 28, 2 (Feb. 1985), 160-164. Google ScholarDigital Library
Xerox Corp. ALTO: A Personal Computer System Hardware Manual. Xerox PARC, Palo Alto, CA, Jan. 1977.Google Scholar

Index Terms

Recommendations

Precise Runahead Execution
Runahead execution improves processor performance by accurately prefetching long-latency memory accesses. When a long-latency load causes the instruction window to fill up and halt the pipeline, the processor enters runahead mode and keeps speculatively ...
Read More
Effective instruction scheduling with limited registers
Read More
Lazy instruction scheduling: keeping performance, reducing power
ISLPED '08: Proceedings of the 2008 international symposium on Low Power Electronics & Design

An important approach to reduce power dissipation is reducing the number of instructions executed by the processor. To achieve this goal, this paper introduces a novel instruction scheduling algorithm that executes an instruction only when its result is ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGARCH Computer Architecture News Volume 19, Issue 6
Dec. 1991
20 pages
ISSN:0163-5964
DOI:10.1145/152766
Editor:
Doug DeGroot
Texas Instruments Inc., Dallas, TX
Issue’s Table of Contents
Copyright © 1991 Author
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 December 1991
Check for updates
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 236
  Total Downloads
- Downloads (Last 12 months)58
- Downloads (Last 6 weeks)8
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Precise instruction scheduling without a precise machine model

ACM SIGARCH Computer Architecture News

Abstract

References

Cited By

Index Terms

Recommendations

Precise Runahead Execution

Effective instruction scheduling with limited registers

Lazy instruction scheduling: keeping performance, reducing power

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Precise instruction scheduling without a precise machine model

ACM SIGARCH Computer Architecture News

Abstract

References

Cited By

Index Terms

Recommendations

Precise Runahead Execution

Effective instruction scheduling with limited registers

Lazy instruction scheduling: keeping performance, reducing power

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media