ACM Home Page
Please provide us with feedback. Feedback
Frequent loop detection using efficient non-intrusive on-chip hardware
Full text PdfPdf (278 KB)
Source International Conference on Compilers, Architecture and Synthesis for Embedded Systems archive
Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems table of contents
San Jose, California, USA
SESSION: Microprocessor architecture table of contents
Pages: 117 - 124  
Year of Publication: 2003
ISBN:1-58113-676-5
Authors
Ann Gordon-Ross  University of California, Riverside,Riverside, CA
Frank Vahid  University of California, Riverside,Riverside, CA
Sponsors
ACM: Association for Computing Machinery
SIGMICRO: ACM Special Interest Group on Microarchitectural Research and Processing
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 51,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/951710.951728
What is a DOI?

ABSTRACT

Dynamic software optimization methods are becoming increasingly popular for improving software performance and power. The first step in dynamic optimization consists of detecting frequently executed code, or "critical regions." Previous critical region detectors have been targeted to desktop processors. We introduce a critical region detector targeted to embedded processors, with the unique features of being very size and power efficient, and being completely non-intrusive to the software's execution - features needed in timing-sensitive embedded systems. Our detector not only finds the critical regions, but also determines their relative frequencies, a potentially important feature for selecting among alternative dynamic optimization methods. Our detector uses a tiny cache coupled with a small amount of logic. We provide results of extensive explorations across seventeen embedded system benchmarks. We show that highly accurate results can be achieved with only a 0.02% power overhead and acceptable size overhead. Our detector is currently being used as part of a dynamic hardware/software partitioning approach, but is applicable to a wide-variety of situations.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
Artisan, http://www.artisan.com.
3
 
4
 
5
Burger, D., Austin, T., Bennet, S. Evaluating future microprocessors: the simplescalar toolset. University of Wisconsin-Madison. Computer Science Department Tech. Report CS-TR-1308, July 2000.
 
6
 
7
 
8
 
9
 
10
11
 
12
IEEE, IEEE 1149.1 Standard Test Access Port and Boundary-Scan Architecture, http://standards .ieee.org, 2001.
13
 
14
 
15
Kiefendorff, K.. Transistor Budgets Go Ballistic. Microprocessor Report, Volume 12, Number 10, August 1998, pp. 34--43.
 
16
Klaiber, A. The technology behind crusoe processors. Transmeta Technical Brief. January 2000.
 
17
18
19
20
21
22
 
23
MIPS Technologies, http://www.mips.com/content/Products/Cores/32BitCores/MIPS324KFamily/ProductCatalog/P_MIPS324KFamily/productBrief
24
 
25
26
27
 
28
Synopsys Inc., http://www.synopsys.com.
 
29
 
30
Vtune Environment, Intel Corp., http://developer.intel.com/vtune
 
31
 
32
33


Collaborative Colleagues:
Ann Gordon-Ross: colleagues
Frank Vahid: colleagues

Peer to Peer - Readers of this Article have also read: