Hardware support for dynamic activation of compiler-directed computation reuse

Authors:
Daniel A. Connors

Department of Electrical and Computer Engineering & Computer Science, University of Colorado

Department of Electrical and Computer Engineering & Computer Science, University of Colorado
View Profile

,
Hillery C. Hunter

Center for Reliable and High-Performance Computing, University of Illinois at Urbana-Champaign

Center for Reliable and High-Performance Computing, University of Illinois at Urbana-Champaign
View Profile

,
Ben-Chung Cheng

Transmeta Corporation, Santa Clara, CA

Transmeta Corporation, Santa Clara, CA
View Profile

,
Wen-mei W. Hwu

Center for Reliable and High-Performance Computing, University of Illinois at Urbana-Champaign

Center for Reliable and High-Performance Computing, University of Illinois at Urbana-Champaign
View Profile

ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systemsNovember 2000Pages 222–233https://doi.org/10.1145/378993.379243

Published:12 November 2000Publication History

ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems

Pages 222–233

ABSTRACT

Compiler-directed Computation Reuse (CCR) enhances program execution speed and efficiency by eliminating dynamic computation redundancy. In this approach, the compiler designates large program regions for potential reuse. During run time, the execution results of these reusable regions are recorded into hardware buffers for future reuse. Previous work shows that CCR can result in significant performance enhancements in general applications. A major limitation of the work is that the compiler relies on value profiling to identify reusable regions, making it difficult to deploy the scheme in many software production environments. This paper presents a new hardware model that alleviates the need for value profiling at compile time. The compiler is allowed to designate reusable regions that may prove to be inappropriate. The hardware mechanism monitors the dynamic behavior of compiler-designated regions and selectively activates the profitable ones at run time. Experimental results show that the proposed design makes more effective utilization of hardware buffer resources, achieves rapid employment of computation regions, and improves reuse accuracy, all of which promote more flexible compiler methods of identifying reusable computation regions.

References

1.J. Auslander, M. Philipose, C. Chambers, S. Eggers, and B. Bershad. Fast, effective dynamic compilation. In Proceedings of the ACM SIGPLAN 1996 Conference on Programming Language Design and Implementation, volume 31, pages 149-159, June 1996. Google ScholarDigital Library
2.T. Autrey and M. Wolfe. Initial results for glacial variable analysis. International Journal of Parallel Programming, 26(1), February 1998. Google ScholarDigital Library
3.B. Calder, P. Feller, and A. Eustace. Value profiling. In Proceedings of the 30th Annual International Symposium on Microarchitecture, pages 259-269, December 1997. Google ScholarDigital Library
4.D. Callahan, K. Cooper, K. Kennedy, and L. Torczon. Interprocedural constant propagation. In Proceedings of the Symposium on Compiler Construction, 1986. Google ScholarDigital Library
5.W. H. Chen, C. H. Smith, and S. Fralick. A fast computational algorithm for the discrete cosine transform. IEEE Transactions on Communications, COM-25:1004-1009, September 1977.Google ScholarCross Ref
6.B. C. Cheng and W. W. Hwu. Interprocedural pointer analysis using access paths. In Proceedings of the ACM SIGPLAN '00 Conference on Programming Language Design and Implementation, June 2000. Google ScholarDigital Library
7.D. A. Connors and W. W. Hwu. Compiler-directed computation reuse (CCR). In Proceedings of the 32nd Annual International Symposium on Microarchitecture, pages 158-169, November 1999. Google ScholarDigital Library
8.M. D. Ernst, J. Cockrell, W. G. Griswold, and D. Notkin. Dynamically discovering likely program invariants to support program evolution. In Proceedings of the 19th International Conference on Software Engineering, pages 213-224, May 1999. Google ScholarDigital Library
9.Hwu. The Superblock: An effective technique for VLIW and superscalar compilation. The Journal of Supercomputing, 7(1):229-248, January 1993. Google ScholarDigital Library
10.C. Lee and W. Mangione-Smith. Mediabench: A tool for evaluating and synthesizing multimedia and communications systems. In Proceedings of the 30th Annual International Symposium on Microarchitecture, pages 330-335, December 1997. Google ScholarDigital Library
11.D. C. Lee, P. J. Crowley, J. L. Baer, T. E. Anderson, and B. N. Bershad. Execution characteristics of desktop applications on windows nt. In Proceedings of the 25th International Symposium on Computer Architecture, pages 27-38, June 1998. Google ScholarDigital Library
12.M. H. Lipasti, C. B. Wilkerson, and J. P. Shen. Value locality and load value prediction. In Proceedings of 7th International Conference on Architectural Support for Programming Languages and Operating Systems, pages 138-147, September 1996. Google ScholarDigital Library
13.M. C. Merten, A. R. Trick, and W. W. Hwu. A hardware-driven profiling scheme for identifying program hot spots to support runtime optimization. In Proceedings of the 1999 International Symposium on Computer Architecture, pages 136-147, May 1999. Google ScholarDigital Library
14.E. Rotenberg and J. E. Smith. Trace cache: a low latency approach to high bandwidth instruction fetching. In Proceedings of the 29th International Symposium on Microarchitecture, pages 24-34, December 1996. Google ScholarDigital Library
15.Y. Sazeides and J. E. Smith. The predictability of data values. In Proceedings of the 30th International Symposium on Microarchitecture, pages 248-258, December 1997. Google ScholarDigital Library
16.A. Sodani and G. S. Sohi. Dynamic instruction reuse. In Proceedings of the 25th International Symposium on Computer Architecture, pages 194-205, June 1998. Google ScholarDigital Library
17.M. N. Wegman and F. K. Zadeck. Constant propagation with conditional branches. In Proceedings of the 12th Symposium on Principles of Programming Languages, pages 291-299, January 1985. Google ScholarDigital Library
18.T. Xanthopoulos and A. Chandrakasan. A low-power IDCT macrocell for MPEG-2 exploiting data properties for minimal activity. IEEE Journal of Solid-State Circuits, pages 693-703, May 1999.Google Scholar

Index Terms

Recommendations

Hardware support for dynamic activation of compiler-directed computation reuse

Compiler-directed Computation Reuse (CCR) enhances program execution speed and efficiency by eliminating dynamic computation redundancy. In this approach, the compiler designates large program regions for potential reuse. During run time, the execution ...
Read More
Hardware support for dynamic activation of compiler-directed computation reuse
Special Issue: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems (ASPLOS '00)

Compiler-directed Computation Reuse (CCR) enhances program execution speed and efficiency by eliminating dynamic computation redundancy. In this approach, the compiler designates large program regions for potential reuse. During run time, the execution ...
Read More
Hardware support for dynamic activation of compiler-directed computation reuse

Compiler-directed Computation Reuse (CCR) enhances program execution speed and efficiency by eliminating dynamic computation redundancy. In this approach, the compiler designates large program regions for potential reuse. During run time, the execution ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
November 2000
271 pages
ISBN:1581133170
DOI:10.1145/378993
Chairmen:
Larry Rudolph
MIT, Cambridge, MA
,
Anoop Gupta
Microsoft
ACM SIGARCH Computer Architecture News Volume 28, Issue 5
Special Issue: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems (ASPLOS '00)
Dec. 2000
269 pages
ISSN:0163-5964
DOI:10.1145/378995
Editor:
Doug DeGroot
Dallas, TX
Issue’s Table of Contents
ACM SIGOPS Operating Systems Review Volume 34, Issue 5
Dec. 2000
269 pages
ISSN:0163-5980
DOI:10.1145/384264
Chairman:
William E. Weihl
Akami technologies, Inc., San Mateo, CA
Issue’s Table of Contents
Copyright © 2000 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 November 2000
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
ASPLOS IX Paper Acceptance Rate24of114submissions,21%Overall Acceptance Rate535of2,713submissions,20%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 19
  Total Citations
  View Citations
- 568
  Total Downloads
- Downloads (Last 12 months)87
- Downloads (Last 6 weeks)11
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Hardware support for dynamic activation of compiler-directed computation reuse

ASPLOS IX: Proceedings of the ninth international conference on Architectural support for programming languages and operating systems

ABSTRACT

References

Cited By

Index Terms

Recommendations

Hardware support for dynamic activation of compiler-directed computation reuse

Hardware support for dynamic activation of compiler-directed computation reuse

Hardware support for dynamic activation of compiler-directed computation reuse