Abstract
Server power and cooling power amount to a significant fraction of modern data centers' recurring costs. While data centers provision enough servers to guarantee response times under the maximum loading, data centers operate under much less loading most of the times (e.g., 30-70% of the maximum loading). Previous server-power proposals exploit this under-utilization to reduce the server idle power by keeping active only as many servers as necessary and putting the rest into low-power standby modes. However, these proposals incur higher cooling power due to hot spots created by concentrating the data center loading on fewer active servers, or degrade response times due to standby-to-active transition delays, or both. Other proposals optimize the cooling power but incur considerable idle power. To address the first issue of power, we propose PowerTrade, which trades-off idle power and cooling power for each other, thereby reducing the total power. To address the second issue of response time, we propose SurgeGuard to overprovision the number of active servers beyond that needed by the current loading so as to absorb future increases in the loading. SurgeGuard is a two-tier scheme which uses well-known over-provisioning at coarse time granularities (e.g., one hour) to absorb the common, smooth increases in the loading, and a novel fine-grain replenishment of the over-provisioned reserves at fine time granularities (e.g., five minutes) to handle the uncommon, abrupt loading surges. Using real-world traces, we show that combining PowerTrade and SurgeGuard reduces total power by 30% compared to previous low-power schemes while maintaining response times within 1.7%.
- AirPAK. Computational fluid dynamics (CFD) software by Ansys Inc. http://www.ansys.com/products/airpak/default.asp.Google Scholar
- O. Allen. Probability, statistics and queuing theory with computer science applications. 1990. Google ScholarDigital Library
- L. A. Barroso and U. Hölzle. The case for energy-proportional computing. IEEE Computer, 40(12):33--37, 2007. Google ScholarDigital Library
- C. Belady. Green grid data center power efficiency metrics, PUE and DCIE. White paper: Metrics & Measurements. http://www.thegreengrid.org, 2007.Google Scholar
- P. Bohrer, E. Elnozahy, T. Keller, M. Kistler, C. Lefurgy, C. McDowell, and R. Rajamony. The case for power management in web servers. Power aware computing, 2002. Google ScholarDigital Library
- G. Bolch, S. Greiner, H. Meer, and K. S. Trivedi. Queuing networks and markov chains: Modeling and performance evaluation with computer science applications. 1998. Google ScholarDigital Library
- J. S. Chase, D. C. Anderson, P. N. Thakar, A. M. Vahdat, and R. P. Doyle. Managing energy and server resources in hosting centers. In SOSP '01: Proceedings of the eighteenth ACM symposium on Operating systems principles, pages 103--116, 2001. Google ScholarDigital Library
- Y. Chen, A. Das, W. Qin, A. Sivasubramaniam, Q. Wang, and N. Gautam. Managing server energy and operational costs in hosting centers. In SIGMETRICS '05: Proceedings of ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, vol. 33, pages 303--314, 2005. Google ScholarDigital Library
- J. Choi, Y. Kim, A. Sivasubramaniam, J. Srebric, Q. Wang, and J. Lee. Modeling and managing thermal profiles of rack mounted servers with thermostat. In HPCA '07: High Performance Computer Architecture, pages 205--215, 2007. Google ScholarDigital Library
- Standard Performance Evaluation Corporation. http://www.spec.org/power_ssj2008/results/power_ssj2008.html.Google Scholar
- Linux Documentation. latest release 2.6, http://www.kernel.org/doc/documentation/power/states.txt.Google Scholar
- E. Elnozahy, M. Kistler, and R. Rajamony. Energy-efficient server clusters. In Proceedings of the 2nd Workshop on Power-Aware Computing Systems, pages 179--196, 2002. Google ScholarDigital Library
- U.S. EPA. Report to congress on server and data center energy efficiency. In U.S. Environmental Protection Agency, Tech Report, 2007.Google Scholar
- X. Fan, W.-D. Weber, and L. A. Barroso. Power provisioning for a warehouse-sized computer. In 34th International Symposium on Computer Architecture, pages 13--23, 2007. Google ScholarDigital Library
- J. Hamilton. Internet-scale service infrastructure efficiency. In ISCA '09 keynote, http://perspectives.mvdirona.com/2008/11/28/CostOfPowerInLargeScaleDataCenters.aspx, 2009. Google ScholarDigital Library
- T. Heath, B. Diniz, E. V. Carrera, W. M. Jr., and R. Bianchini. Energy conservation in heterogeneous server clusters. In PPoPP '05: 10th ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 186--195, 2005. Google ScholarDigital Library
- Traces in the Internet Traffic Archive. http://ita.ee.lbl.gov/html/traces.html.Google Scholar
- W. LeFebvre. CNN.com: Facing a world crisis. http://www.tcsa.org/lisa2001/cnn.txt.Google Scholar
- K. Lim, P. Ranganathan, J. Chang, C. Patel, T. Mudge, and S. Reinhardt. Understanding and designing new server architectures for emerging warehouse-computing environments. In ISCA '08: Proceedings of the 35th International Symposium on Computer Architecture, pages 315--326, 2008. Google ScholarDigital Library
- D. Meisner, B. T. Gold, and T. F. Wenisch. Powernap: Eliminating server idle power. In ASPLOS '09: Proceeding of the 14th conference on Architectural support for programming languages and operating systems, pages 205--216, 2009. Google ScholarDigital Library
- J. Moore, J. Chase, P. Ranganathan, and R. Sharma. Making scheduling "cool": Temperature-aware workload placement in data centers. In Proceedings of USENIX, 2005. Google ScholarDigital Library
- J. Moore, J. Chase, and P. Ranganathan. Weatherman: Automated, online and predictive thermal mapping and management for data centers. In ICAC '06: Proceedings of IEEE Conference on Autonomic Computing, pages 155--164, 2006. Google ScholarDigital Library
- T. Mukherjee, Q. Tang, C. Ziesman, S. K. S. Gupta, and P. Cayton. Software architecture for dynamic thermal management in data centers. In COMSWARE, 2007.Google Scholar
- R. Nathuji and K. Schwan. Virtualpower: Coordinated power management in virtualized enterprise systems. In SOSP '07: Proceedings of 21st ACM SIGOPS symposium on Operating systems principles, pages 265--278, 2007. Google ScholarDigital Library
- C. D. Patel, C. E. Bash, R. K. Sharma, and A. Beitelmal. Smart cooling of data centers. In Proceedings of the Pacific RIM/ASME International Electronics Packaging Technical Conference and Exhibition, IPACK, 2003.Google ScholarCross Ref
- E. Pinheiro, R. Bianchini, E. V. Carrera, and T. Heath. Load balancing and unbalancing for power and performance in cluster-based systems. In Workshop on Compilers and Operating Systems for Low Power, 2001.Google Scholar
- R. Raghavendra, P. Ranganathan, V. Talwar, Z. Wang, and X. Zhu. No "power" struggles: Coordinated multi-level power management for the data center. In ASPLOS XIII: Proceedings of the 13th conference on Architectural support for programming languages and operating systems, pages 48--59, 2008. Google ScholarDigital Library
- L. Ramos and R. Bianchini. C-Oracle: Predictive thermal management for data centers. In Proceedings of High-Performance Computer Architecture, HPCA, pages 111--122, 2008.Google ScholarCross Ref
- P. Ranganathan, P. Leech, D. Irwin, and J. Chase. Ensemble-level power management for dense blade servers. ISCA '06: Proceedings of the 33rd annual international symposium on Computer Architecture, 34(2):66--77, 2006. Google ScholarDigital Library
- C. Rusu, A. Ferreira, C. Scordino, and A. Watson. Energy-efficient real-time heterogeneous server clusters. In RTAS '06: Proceedings of the 12th IEEE Real-Time and Embedded Technology and Applications Symposium, pages 418--428, 2006. Google ScholarDigital Library
- R. K. Sharma, C. Bash, C. D. Patel, R. Friedrich, and J. Chase. Balance of power: Dynamic thermal management for internet data centers. IEEE Internet Computing, 9(1):42--49, 2005. Google ScholarDigital Library
- V. Sharma, A. Thomas, T. Abdelzaher, K. Skadron, and Z. Lu. Power-aware QoS management in web servers. In RTSS '03: Proceedings of the 24th International IEEE Real-Time Systems Symposium, pages 63--72, 2003. Google ScholarDigital Library
- R. F. Sullivan. Alternating cold and hot aisles provides more reliable cooling for server farms. In Uptime Institute, 2000.Google Scholar
- B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in shared hosting platforms. SIGOPS Oper. Syst. Rev., 36(SI):239--254, 2002. Google ScholarDigital Library
- P. Welch. On a generalized M/G/1 queuing process in which the first customer of each busy period receives exceptional service. In Operations Research, vol. 12, pages 736--752, 1964.Google ScholarDigital Library
Index Terms
- Joint optimization of idle and cooling power in data centers while maintaining response time
Recommendations
Joint optimization of idle and cooling power in data centers while maintaining response time
ASPLOS '10Server power and cooling power amount to a significant fraction of modern data centers' recurring costs. While data centers provision enough servers to guarantee response times under the maximum loading, data centers operate under much less loading most ...
Joint optimization of idle and cooling power in data centers while maintaining response time
ASPLOS XV: Proceedings of the fifteenth International Conference on Architectural support for programming languages and operating systemsServer power and cooling power amount to a significant fraction of modern data centers' recurring costs. While data centers provision enough servers to guarantee response times under the maximum loading, data centers operate under much less loading most ...
Optimal power allocation in server farms
SIGMETRICS '09Server farms today consume more than 1.5% of the total electricity in the U.S. at a cost of nearly $4.5 billion. Given the rising cost of energy, many industries are now seeking solutions for how to best make use of their available power. An important ...
Comments