Abstract
Recent empirical studies have shown that multicore scaling is fast becoming power limited, and consequently, an increasing fraction of a multicore processor has to be under clocked or powered off. Therefore, in addition to fundamental innovations in architecture, compilers and parallelization of application programs, there is a need to develop practical and effective dynamic energy management (DEM) techniques for multicore processors.
Existing DEM techniques mainly target reducing processor power consumption and temperature, and only few of them have addressed improving energy efficiency for multicore systems. With energy efficiency taking a center stage in all aspects of computing, the focus of the DEM needs to be on finding practical methods to maximize processor efficiency. Towards this, this article presents STEAM -- an optimal closed-loop DEM controller designed for multicore processors. The objective is to maximize energy efficiency by dynamic voltage and frequency scaling (DVFS). Energy efficiency is defined as the ratio of performance to power consumption or performance-per-watt (PPW). This is the same as the number of instructions executed per Joule. The PPW metric is actually replaced by PαPW (performanceα-per-Watt), which allows for controlling the importance of performance versus power consumption by varying α.
The proposed controller was implemented on a Linux system and tested with the Intel Sandy Bridge processor. There are three power management schemes called governors, available with Intel platforms. They are referred to as (1) Powersave (lowest power consumption), (2) Performance (achieves highest performance), and (3) Ondemand. Our simple and lightweight controller when executing SPEC CPU2006, PARSEC, and MiBench benchmarks have achieved an average of 18% improvement in energy efficiency (MIPS/Watt) over these ACPI policies. Moreover, STEAM also demonstrated an excellent prediction of core temperatures and power consumption, and the ability to control the core temperatures within 3ˆC of the specified maximum. Finally, the overhead of the STEAM implementation (in terms of CPU resources) is less than 0.25%. The entire implementation is self-contained and can be installed on any processor with very little prior knowledge of the processor.
- Advanced Configuration and Power Interface Specification. http://www.acpi.info/spec.htm.Google Scholar
- ARM. 2012. Big.LITTLE Processing with ARM Cortex-A15 and Cortex-A7. http://www.arm.com/files/downloads/big.LITTLEFinal.pdf. Google ScholarDigital Library
- A. Bartolini, M. Cacciari, A. Tilli, and L. Benini. 2011. A distributed and self-calibrating model-predictive controller for energy and thermal management of high-performance multicores. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE'2011). 1--6.Google Scholar
- C. Bienia, S. Kumar, J. P. Singh, and K. Li. 2008. The PARSEC benchmark suite: Characterization and architectural implications. In Proceedings of PACT. 72--81. Google ScholarDigital Library
- G. Box, G. M. Jenkins, and G. C. Reinsel. 1994. Time Series Analysis: Forecasting and Control. Prentice-Hall. Google ScholarDigital Library
- T. Chantem, R. P. Dick, and X. S. Hu. 2008. Temperature-aware scheduling and assignment for hard real-time applications on MPSoCs. In Proceedings of DATE. 288--293. Google ScholarDigital Library
- R. Cochran, C. Hankendi, A. K. Coskun, and S. Reda. 2011. Pack & cap: Adaptive DVFS and thread packing under power caps. In Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-44'11). ACM, New York, 175--185. Google ScholarDigital Library
- A. Coskun, T. Rosing, K. Whisnant, and K. Gross. 2008. Static and dynamic temperature-aware scheduling for multiprocessor SoCs. IEEE Trans. VLSI Syst. 16, 1127--1140. Google ScholarDigital Library
- G. Dhiman and T. S. Rosing. 2007. Dynamic voltage frequency scaling for multi-tasking systems using online learning. In Proceedings of the 2007 International Symposium on Low Power Electronics and Design (ISLPED'07). ACM, New York, 207--212. Google ScholarDigital Library
- H. Esmaeilzadeh, E. Blem, R. St. Amant, K. Sankaralingam, and D. Burger. 2011. Dark silicon and the end of multicore scaling. In Proceedings of the International Symposium on Computer Architecture. Google ScholarDigital Library
- Y. Fu, N. Kottenstette, Y. Chen, C. Lu, X. Koutsoukos, and H. Wang. 2010. Feedback thermal control for real-time systems. In Proceedings of the 16th IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS'2010). 111--120. Google ScholarDigital Library
- M. R. Guthaus, J. S. Ringenberg, D. Ernst, T. M. Austin, T. Mudge, and R. B. Brown. 2001. MiBench: A free, commercially representative embedded benchmark suite. In Proceedings of WWC. 3--14. Google ScholarDigital Library
- V. Hanumaiah and S. Vrudhula. 2012a. Energy-efficient operation of multi-core processors by DVFs, task migration and active cooling. IEEE Trans. Comput. 99, 1. Google ScholarDigital Library
- V. Hanumaiah and S. Vrudhula. 2012b. Temperature-aware DVFs for hard real-time applications on multi-core processors. IEEE Trans. Comput. 61, 10, 1484--1494. Google ScholarDigital Library
- V. Hanumaiah, S. Vrudhula, and K. S. Chatha. 2011. Performance optimal online DVFS and task migration techniques for thermally constrained multi-core processors. IEEE Trans. Comput.-Aid. Des. 30, 11, 1677--1690. Google ScholarDigital Library
- W. Huang, M. R. Stan, K. Skadron, K. Sankaranarayanan, and S. Ghosh. 2006. HotSpot: A compact thermal modeling method for CMOS VLSI systems. IEEE Trans. VLSI Syst. 14, 501--513. Google ScholarDigital Library
- Intel Corporation 2012. Intel 64 and IA-32 Architectures Software Developers Manual. Intel Corporation.Google Scholar
- C. Isci, G. Contreras, and M. Martonosi. 2006. Live, runtime phase monitoring and prediction on real systems with application to dynamic power management. In Proceedings of the International Symposium on Microarchitecture (MICRO). Google ScholarDigital Library
- H. Jung and M. Pedram. 2006. Stochastic dynamic thermal management: A Markovian decision-based approach. In Proceedings of ICCD. 452--457.Google Scholar
- W.-Y. Liang, P.-T. Lai, and C. W. Chiou. 2010. An energy conservation DVFs algorithm for the android operating system. J. Converg. 1, 1, 93--100.Google Scholar
- W. Liao, L. He, and K. M. Lepak. 2005. Temperature and supply voltage aware performance and power modeling at microarchitecture level. IEEE Trans. Comput.-Aid. Des. 24, 1042--1053. Google ScholarDigital Library
- P. S. Maybeck. 1979. Stochastic Models, Estimation, and Control. Mathematics in Science and Engineering. Academic Press. Google ScholarDigital Library
- S. Murali, A. Mutapcic, D. Atienza, R. Gupta, S. Boyd, and G. D. Micheli. 2007. Temperature-aware processor frequency assignment for MPSoCs using convex optimization. In Proceedings of CODES. 111--116. Google ScholarDigital Library
- J. Nocedal and S. J. Wright. 2006. Numerical Optimization 2nd Ed. Springer.Google Scholar
- V. Pallipadi and A. Starikovskiy. 2006. The ondemand governor - past, present, and future. In Proceedings of the Linux Symposium, Vol. 2, 223--238.Google Scholar
- R. Rao and S. Vrudhula. 2009. Fast and accurate prediction of the steady state throughput of multi-core processors under thermal constraints. IEEE Trans. Comput.-Aid. Des. 28, 1559--1572. Google ScholarDigital Library
- E. Rotem, A. Naveh, D. Rajwan, A. Ananthakrishnan, and E. Weissmann. 2012. Power-management architecture of the intel microarchitecture code-named sandy bridge. Micro, IEEE 32, 2, 20--27. Google ScholarDigital Library
- K. Skadron, T. Abdelzaher, and M. R. Stan. 2002. Control-theoretic techniques and thermal-RC modeling for accurate and localized dynamic thermal management. In Proceedings of HPCA. 17--28. Google ScholarDigital Library
- D. C. Snowdon, E. Le Sueur, S. M. Petters, and G. Heiser. 2009. Koala: A platform for OS-level power management. In Proceedings of the European Conference on Computer Systems (EuroSys'09). ACM, New York, 289--302. Google ScholarDigital Library
- SPEC CPU2006 Benchmarks. http://www.spec.org/cpu2006.Google Scholar
- Y. Wang, K. Ma, and X. Wang. 2009. Temperature-constrained power control for chip multiprocessors with online model estimation. SIGARCH Comput. Archit. News 37, 314--324. Google ScholarDigital Library
- P. Whittle and T. J. Sargent. 1983. Prediction and Regulation by Linear Least-Square Methods 2nd Ed. University of Minnesota Press.Google Scholar
- F. Zanini, D. Atienza, L. Benini, and G. De Micheli. 2009. Multicore thermal management with model predictive control. In Proceedings of the European Conference on Circuit Theory and Design, 2009. ECCTD 2009. 711--714.Google Scholar
Index Terms
- STEAM: A Smart Temperature and Energy Aware Multicore Controller
Recommendations
Energy-Efficient Operation of Multicore Processors by DVFS, Task Migration, and Active Cooling
Energy efficiency has taken center stage in all aspects of computing, regardless of whether it is performed on a portable battery-powered device, a desktop PC, on servers in a data center, or on a supercomputer. It is expressed as performance-per-watt (...
EDF-hv: An Energy-Efficient Semi-Partitioned Approach for Hard Real-Time Systems
RTNS '16: Proceedings of the 24th International Conference on Real-Time Networks and SystemsThe shift from uniprocessor to multi-core architectures has made it more difficult to design predictable hard real-time systems (HRTS) and achieving high processor utilization remains a major challenge. As energy efficiency becomes an important design ...
Improving Energy Efficiency in Memory-constrained Applications Using Core-specific Power Control
E2SC'17: Proceedings of the 5th International Workshop on Energy Efficient SupercomputingPower is increasingly the limiting factor in High Performance Computing (HPC) at Exascale and will continue to influence future advancements in supercomputing. Recent processors equipped with on-board hardware counters allow real time monitoring of ...
Comments