skip to main content
10.1145/3149412.3149413acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

Dynamic Application-aware Power Capping

Published: 12 November 2017 Publication History

Abstract

A future large-scale high-performance computing (HPC) cluster will likely be power capped since the surrounding infrastructure like power supply and cooling is constrained. For such a cluster, it may be impossible to supply thermal design power (TDP) to all components. The default power supply of current system guarantees TDP to each computing node will become unfeasible. Power capping was introduced to limit power consumption to a value below TDP, with the drawback of resulting performance limitations. We developed an alternative dynamic application-aware power scheduling (DAPS) strategy to enforce a predetermined power limit and at the same time improve the cluster-wide performance. The power scheduling decision is guided by the cap value, the hardware usage, and the application-specific performance sensitivity to power. Applying DAPS on a test platform comprising 12 computing nodes with three representative applications, we obtained a performance improvement up to 17% compared to a strategy that distributes power equally and statically across nodes.

References

[1]
Axel Auweter, Arndt Bode, Matthias Brehm, Luigi Brochard, Nicolay Hammer, Herbert Huber, Raj Panda, Francois Thomas, and Torsten Wilde. 2014. A case study of energy aware scheduling on supermuc. In International Supercomputing Conference. Springer, 394--409.
[2]
Karel De Vogeleer, Gerard Memmi, Pierre Jouvelot, and Fabien Coelho. 2013. The energy/frequency convexity rule: Modeling and experimental validation on mobile devices. In International Conference on Parallel Processing and Applied Mathematics. Springer, 793--803.
[3]
Qingyuan Deng, David Meisner, Abhishek Bhattacharjee, Thomas F Wenisch, and Ricardo Bianchini. 2012. Coscale: Coordinating cpu and memory system dvfs in server systems. In Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture. IEEE Computer Society, 143--154.
[4]
Martin Dimitrov. 2012. Intel Power Governor. https://software.intel.com/en-us/articles/intel-power-governor. (2012).
[5]
Daniel A Ellsworth, Allen D Malony, Barry Rountree, and Martin Schulz. 2015. Dynamic power sharing for higher job throughput. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. ACM, 80.
[6]
Institute for Combustion Technologe. 2015. Psopen. http://www.fz-juelich.de/ias/jsc/EN/Expertise/High-Q-Club/psOpen/_node.html. (2015).
[7]
SPEC HPG. 2007. SPEC MPI2007. https://www.spec.org/mpi2007/. (2007).
[8]
SPEC HPG. 2012. SPEC OMP2012. https://www.spec.org/omp2012/. (2012).
[9]
Tapasya Patki. 2013. librapl. https://github.com/tpatki/librapl/tree/master/libmsr. (2013).
[10]
Tapasya Patki, David K Lowenthal, Barry Rountree, Martin Schulz, and Bronis R De Supinski. 2013. Exploring hardware overprovisioning in power-constrained, high performance computing. In Proceedings of the 27th international ACM conference on International conference on supercomputing. ACM, 173-182.
[11]
Tapasya Patki, David K Lowenthal, Anjana Sasidharan, Matthias Maiterth, Barry L Rountree, Martin Schulz, and Bronis R De Supinski. 2015. Practical resource management in power-constrained, high performance computing. In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 121-132.
[12]
Barry Rountree, Dong H Ahn, Bronis R De Supinski, David K Lowenthal, and Martin Schulz. 2012. Beyond DVFS: A first look at performance under a hardware-enforced power bound. In Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2012 IEEE 26th International. IEEE, 947-953.
[13]
Barry Rountree, David K Lowenthal, Shelby Funk, Vincent W Freeh, Bronis R De Supinski, and Martin Schulz. 2007. Bounding energy consumption in large-scale MPI programs. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing. ACM, 49.
[14]
Barry Rountree, David K Lownenthal, Bronis R De Supinski, Martin Schulz, Vincent W Freeh, and Tyler Bletsch. 2009. Adagio: making DVS practical for complex HPC applications. In Proceedings of the 23rd international conference on Supercomputing. ACM, 460-469.
[15]
Osman Sarood, Akhil Langer, Abhishek Gupta, and Laxmikant Kale. 2014. Maximizing throughput of overprovisioned hpc data centers under a strict power budget. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Press, 807-818.
[16]
Robert Schöne and Daniel Hackenberg. 2011. On-line analysis of hardware performance events for workload characterization and processor frequency scaling decisions. In Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering. ACM, 481-486.
[17]
Bo Wang, Dirk Schmidl, and Matthias S Müller. 2015. Evaluating the energy consumption of openmp applications on Haswell processors. In International Workshop on OpenMP. Springer, 233-246.

Cited By

View all
  • (2022)Evaluation of Heuristics to Manage a Data Center Under Power Constraints2022 IEEE 13th International Green and Sustainable Computing Conference (IGSC)10.1109/IGSC55832.2022.9969362(1-8)Online publication date: 24-Oct-2022
  • (2021)Efficient and Precise Profiling, Modeling and Management on Power and Performance for Power Constrained HPC SystemsIEICE Transactions on Electronics10.1587/transele.2020LHP0005E104.C:6(237-246)Online publication date: 1-Jun-2021
  • (2020)Exploring the Potential of using Power as a First Class Parameter for Resource Allocation in Apache Mesos Managed Clouds2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC)10.1109/UCC48980.2020.00040(216-226)Online publication date: Dec-2020
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
E2SC'17: Proceedings of the 5th International Workshop on Energy Efficient Supercomputing
November 2017
84 pages
ISBN:9781450351324
DOI:10.1145/3149412
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 November 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. RAPL
  2. TDP
  3. overprovisioning
  4. performance optimization
  5. power capping
  6. power scheduling

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SC '17
Sponsor:

Acceptance Rates

E2SC'17 Paper Acceptance Rate 10 of 21 submissions, 48%;
Overall Acceptance Rate 17 of 33 submissions, 52%

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Evaluation of Heuristics to Manage a Data Center Under Power Constraints2022 IEEE 13th International Green and Sustainable Computing Conference (IGSC)10.1109/IGSC55832.2022.9969362(1-8)Online publication date: 24-Oct-2022
  • (2021)Efficient and Precise Profiling, Modeling and Management on Power and Performance for Power Constrained HPC SystemsIEICE Transactions on Electronics10.1587/transele.2020LHP0005E104.C:6(237-246)Online publication date: 1-Jun-2021
  • (2020)Exploring the Potential of using Power as a First Class Parameter for Resource Allocation in Apache Mesos Managed Clouds2020 IEEE/ACM 13th International Conference on Utility and Cloud Computing (UCC)10.1109/UCC48980.2020.00040(216-226)Online publication date: Dec-2020
  • (2020)Operation-Aware Power CappingEuro-Par 2020: Parallel Processing10.1007/978-3-030-57675-2_5(68-82)Online publication date: 24-Aug-2020
  • (2019)Understanding the Impact of Dynamic Power Capping on Application Progress2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2019.00088(793-804)Online publication date: May-2019
  • (2019)Performance Prediction for Power-Capped Applications based on Machine Learning Algorithms2019 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCS48598.2019.9188144(842-849)Online publication date: Jul-2019
  • (2018)Optimization of resources in parallel systems using a multiobjective artificial bee colony algorithmThe Journal of Supercomputing10.1007/s11227-018-2407-574:8(4019-4036)Online publication date: 1-Aug-2018

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media