| Design space optimization of embedded memory systems via data remapping |
| Full text |
Pdf
(382 KB)
|
| Source
|
Language, Compiler and Tool Support for Embedded Systems
archive
Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
table of contents
Berlin, Germany
SESSION: Synthesis and Design Space Exploration
table of contents
Pages: 28 - 37
Year of Publication: 2002
ISBN:1-58113-527-0
Also published in ...
|
|
Authors
|
|
Krishna V. Palem
|
Georgia Institute of Technology, Atlanta, GA
|
|
Rodric M. Rabbah
|
Georgia Institute of Technology, Atlanta, GA
|
|
Vincent J. Mooney, III
|
Georgia Institute of Technology, Atlanta, GA
|
|
Pinar Korkmaz
|
Georgia Institute of Technology, Atlanta, GA
|
|
Kiran Puttaswamy
|
Georgia Institute of Technology, Atlanta, GA
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 6, Downloads (12 Months): 53, Citation Count: 5
|
|
|
ABSTRACT
In this paper, we provide a novel compile-time data remapping algorithm that runs in linear time. This remapping algorithm is the first fully automatic approach applicable to pointer-intensive dynamic applications. We show that data remapping can be used to significantly reduce the energy consumed as well as the memory size needed to meet a user-specified performance goal (i.e., execution time) -- relative to the same application executing without being remapped. These twin advantages afforded by a remapped program -- reduced cache size and energy needs -- constitute a key step in a framework for design space exploration: for any given performance goal, remapping allows the user to reduce the primary and secondary cache size by 50%, yielding a concomitant energy savings of 57%. Additionally, viewed as a compiler optimization for a fixed processor, we show that remapping improves the energy consumed by the cache subsystem by 25%. All of the above savings are in the context of the cache subsystem in isolation. We also show that remapping yields an average 20% energy saving for an ARM-like processor and cache subsystem. All of our improvements are achieved in the context of DIS, Olden and SPEC2000 pointer-centric benchmarks.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
Brad Calder , Chandra Krintz , Simmi John , Todd Austin, Cache-conscious data placement, Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, p.139-149, October 02-07, 1998, San Jose, California, United States
|
| |
4
|
John B. Carter , Wilson C. Hsieh , Mark R. Swanson , Lixin Zhang , Erik Brunvand , Al Davis , Chen-Chi Kuo , Ravindra Kuramkote , Michael Parker , Lambert Schaelicke , Leigh Stoller , Terry Tateyama, Memory System Support for Irregular Applications, Selected Papers from the 4th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers, p.17-26, May 28-30, 1998
|
| |
5
|
|
 |
6
|
L. N. Chakrapani , P. Korkmaz , V. J. Mooney, III , K. V. Palem , K. Puttaswamy , W. F. Wong, The emerging power crisis in embedded processors: what can a poor compiler do?, Proceedings of the 2001 international conference on Compilers, architecture, and synthesis for embedded systems, November 16-17, 2001, Atlanta, Georgia, USA
[doi> 10.1145/502217.502246]
|
| |
7
|
L. Chakrapani, K. Palem, and W. Wong. Triceps: Enhancing the trimaran compiler infrastructure for strongarm code generation. Technical Report CREST-TR-01-001, Georgia Institute of Technology, May 2001
|
 |
8
|
Trishul M. Chilimbi , Bob Davidson , James R. Larus, Cache-conscious structure definition, Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, p.13-24, May 01-04, 1999, Atlanta, Georgia, United States
|
 |
9
|
Trishul M. Chilimbi , Mark D. Hill , James R. Larus, Cache-conscious structure layout, Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation, p.1-12, May 01-04, 1999, Atlanta, Georgia, United States
|
 |
10
|
|
| |
11
|
Data-Intensive Systems benchmark suite. www.aaec.com/projectweb/dis/
|
 |
12
|
|
 |
13
|
|
| |
14
|
P. Korkmaz, K. Puttaswamy, and V. Mooney. Energy modeling of a processor core using synopsys and of the memory hierarchy using the kamble and ghose model. Technical Report CREST-TR-02-002, Georgia Institute of Technology, Feb. 2002
|
 |
15
|
Monica D. Lam , Edward E. Rothberg , Michael E. Wolf, The cache performance and optimizations of blocked algorithms, Proceedings of the fourth international conference on Architectural support for programming languages and operating systems, p.63-74, April 08-11, 1991, Santa Clara, California, United States
|
| |
16
|
Olden benchmark suite. www.cs.princeton.edu/~mcc/olden.html
|
| |
17
|
K. Palem and R. Rabbah. Data remapping for design space optimization of embedded cache systems. Technical Report GIT-CC-02-10, Georgia Institute of Technology, Mar. 2002
|
 |
18
|
P. R. Panda , F. Catthoor , N. D. Dutt , K. Danckaert , E. Brockmeyer , C. Kulkarni , A. Vandercappelle , P. G. Kjeldsberg, Data and memory optimization techniques for embedded systems, ACM Transactions on Design Automation of Electronic Systems (TODAES), v.6 n.2, p.149-206, April 2001
[doi> 10.1145/375977.375978]
|
 |
19
|
|
| |
20
|
|
 |
21
|
|
 |
22
|
|
| |
23
|
Standard Performance Evaluation Corporation CPU2000 benchmark suite. www.spec.org
|
 |
24
|
|
| |
25
|
The simplescalar-arm power modeling project. www.eecs.umich.edu/~jringenb/power/
|
| |
26
|
S. Thompson, P. Packan, and M. Bohr. Mos scaling: Transistor challenges for the 21st century. Technical Report~Q3, Intel Technology Journal, July 1994
|
| |
27
|
Triceps: A trimaran-based arm code generator. www.trimaran.org/triceps.shtml
|
| |
28
|
|
| |
29
|
S. Wilton and N. Jouppi. An enhanced access and cycle time model for on-chip caches. Technical Report 93.5, WRL Research Report, July 1994
|
CITED BY 5
|
Qingfeng Zhuge , Zili Shao , Bin Xiao , Edwin H.-M. Sha, Design space minimization with timing and code size optimization for embedded DSP, Proceedings of the 1st IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, October 01-03, 2003, Newport Beach, CA, USA
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|