skip to main content
10.1145/1057661.1057729acmconferencesArticle/Chapter ViewAbstractPublication PagesglsvlsiConference Proceedingsconference-collections
Article

Load elimination for low-power embedded processors

Published: 17 April 2005 Publication History

Abstract

The performance requirements of emerging embedded applications are rapidly increasing. One attractive approach to increase the performance of processors, while keeping their energy consumption low, is to utilize instruction-level parallelism. Hence, we are witnessing a significant increase in the number of superscalar embedded processors. In this paper, we present a method to reduce the energy consumption in such processors. Particularly, we will show that a) the load instructions in representative applications exhibit a large address locality, i.e., two consecutive executions of the same load instruction is very likely to access the same data, and b) the register file utilization of these applications are usually low. To take advantage of these observations, we devise a load elimination scheme, which tries to store the data values of load instructions in the register file. Our results with 11 MediaBench applications reveal that this method eliminates 20.5% of all cache accesses, resulting in 11.5% reduction in the energy consumption.

References

[1]
Azam M, P.F., Wentai Liu. Low power data processing by elimination of redundant computations. in International Symposium on Low Power Electronics and Design. 1997. Monterey, CA.
[2]
Bekerman, M., et al. Early load address resolution via register tracking. in International Symposium on Computer Architecture. 2000.
[3]
Burger, D. and T. Austin, SimpleScalar Tool Set, Version 2.0. June 1997, University of Wisconsin.
[4]
Burtscher, M. and B. Zorn, Hybrid load value predictors. IEEE Transactions on Computers, July 2002. 51(7).
[5]
Cho, S., P.-C. Yew, and G. Lee. Decoupling local variable accesses in a wide-issue superscalar processor. in International Symposium on Computer Architecture. 1999.
[6]
EETimes, Two call on superscalar CPUs for handset apps. Oct. 2003.
[7]
Intel, C., SA-110 Microprocessor Technical Reference Manual.
[8]
Kessler, R., The Alpha 21264 Microproc. IEEE Micro, Mar/Apr 1999.
[9]
Lee, C., M. Potkonjak, and W.H. Mangione-Smith. MediaBench: A Tool for Evaluating and Synthesizing Multimedia and Communications Systems. in International Symposium on Microarchitecture. 1997.
[10]
Loechner, V., B. Meister, and P. Clauss, Precise data locality optimization of nested loops. The Journal of Supercomputing, Jan'02. 21-1.
[11]
STMicroelectronics, STPC Microcontrollers - Overview.'04.
[12]
Tyson, G.S. and T.M. Austin. Improving the accuracy and performance of performance of memory communication through renaming. in International Symposium on Microarchitecture. 1997.
[13]
Wilton, S. and N. Jouppi, An enhanced access and cycle time model for on-chip caches. July 1995, Digital Western Research Laboratory, 93/5.
[14]
Yang J, R.G. Energy-efficient load and store reuse. in International Symposium on Low Power Electronics and Design (ISLPED). 2001. Huntington Beach, CA.

Index Terms

  1. Load elimination for low-power embedded processors

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    GLSVLSI '05: Proceedings of the 15th ACM Great Lakes symposium on VLSI
    April 2005
    518 pages
    ISBN:1595930574
    DOI:10.1145/1057661
    • General Chair:
    • John Lach,
    • Program Chairs:
    • Gang Qu,
    • Yehea Ismail
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 April 2005

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. load elimination technique
    2. low power design

    Qualifiers

    • Article

    Conference

    GLSVLSI05
    Sponsor:
    GLSVLSI05: Great Lakes Symposium on VLSI 2005
    April 17 - 19, 2005
    Illinois, Chicago, USA

    Acceptance Rates

    Overall Acceptance Rate 312 of 1,156 submissions, 27%

    Upcoming Conference

    GLSVLSI '25
    Great Lakes Symposium on VLSI 2025
    June 30 - July 2, 2025
    New Orleans , LA , USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 215
      Total Downloads
    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media