skip to main content
10.1145/3061639.3062255acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article
Public Access

Greybox Design Methodology: A Program Driven Hardware Co-optimization with Ultra-Dynamic Clock Management

Authors Info & Claims
Published:18 June 2017Publication History

ABSTRACT

In this paper, a novel Greybox design methodology is proposed to establish a design and co-optimization flow across the boundary of conventional software and hardware design. The dynamic timing of each software instruction is simulated and associated with processor hardware design, which provides the basis of ultra-dynamic clock management. The proposed scheme effectively implements the instruction-based clock management and achieves 21.71% frequency speedup. Besides, a novel program-driven hardware optimization flow is proposed, in which software operations are mapped with hardware gate netlist and sorted by the usage frequency. The experiments on an ARM based pipeline design in commercial 65nm CMOS process show an extra 10% frequency speedup is obtained with high optimization efficiency. Overall, the proposed Greybox design method achieves frequency speedup by 31.56%, comparing with conventional design method.

References

  1. J. Howard, et al., "A 48-core IA-32 processor in 45 nm CMOS using on-die message-passing and DVFS for performance and power scaling", IEEE Journal of Solid-State Circuits, vol. 46, no. 1, pp. 173--183, Jan. 2011.Google ScholarGoogle ScholarCross RefCross Ref
  2. Z. Toprak-Deniz, et al., "Distributed system of digitally controlled microregulators enabling per-core DVFS for the POWER8 microprocessor", International Solid-State Circuits Conference (ISSCC), pp. 98--99, Feb. 2014.Google ScholarGoogle Scholar
  3. S. Kim, et al., "Enabling wide autonomous DVFS in a 22 nm graphics execution core using a digitally controlled fully integrated voltage regulator", IEEE Journal of Solid-State Circuits, vol. 51, no. 1, pp. 18--30, Jan. 2016.Google ScholarGoogle ScholarCross RefCross Ref
  4. T. Jia, et al., "Exploration of associative power management with instruction governed operation for ultra-low power design", Design Automation Conference (DAC), 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Xin, et al., "Identifying and predicting timing-critical instructions to boost timing speculation", International Symposium on Microarchitecture (MICRO), pp. 74--85, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Constantin, et al., "Exploiting dynamic timing margins in microprocessors for frequency-over-scaling with instruction-based clock adjustment", Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Cherupalli, et al., "Exploiting dynamic timing slack for energy efficiency in ultra-low-power embedded systems", International Symp. on Computer Architecture (ISCA), 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Das, et al., "RazorII: in situ error detection and correction for PVT and SER tolerance", IEEE Journal of Solid-State Circuits, vol. 44, no. 1, pp. 32--48, Jan. 2009.Google ScholarGoogle ScholarCross RefCross Ref
  9. Online resource, ARM, "ARMv5 Architecture Reference Manual", https://silver.arm.com/download/download.tm?pv=1073121Google ScholarGoogle Scholar
  10. Online resource, http://www.gem5.org/Main_PageGoogle ScholarGoogle Scholar
  11. J. Henning, et al., "SPEC CPU2006 benchmark descriptions", Computer Architecture News, 34(4), Sep. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Meixner, et al., "Argus: Low-cost, comprehensive error detection in simple cores", International Symposium on Microarchitecture (MICRO), pp. 210--222, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. August, et al, "A TDC-less ADPLL with 200-to-3200MHz range and 3mW power dissipation for mobile SoC clocking in 22nm CMOS", International Solid-State Circuits Conference (ISSCC), Feb. 2012.Google ScholarGoogle Scholar
  14. M. Perrott, "Tutorial on digital phase-locked loops", Custom Integrated Circuits Conference (CICC), 2009.Google ScholarGoogle Scholar
  15. U. Moon, et al., "Spectral analysis of time-domain phase jitter measurement", IEEE Transactions on Circuits and System II, vol. 49, no. 5, pp. 321--327, 2002.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    DAC '17: Proceedings of the 54th Annual Design Automation Conference 2017
    June 2017
    533 pages
    ISBN:9781450349277
    DOI:10.1145/3061639

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 18 June 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate1,770of5,499submissions,32%

    Upcoming Conference

    DAC '24
    61st ACM/IEEE Design Automation Conference
    June 23 - 27, 2024
    San Francisco , CA , USA

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader