skip to main content
10.1145/2168836.2168843acmconferencesArticle/Chapter ViewAbstractPublication PageseurosysConference Proceedingsconference-collections
research-article

GreenHadoop: leveraging green energy in data-processing frameworks

Authors Info & Claims
Published:10 April 2012Publication History

ABSTRACT

Interest has been growing in powering datacenters (at least partially) with renewable or "green" sources of energy, such as solar or wind. However, it is challenging to use these sources because, unlike the "brown" (carbon-intensive) energy drawn from the electrical grid, they are not always available. This means that energy demand and supply must be matched, if we are to take full advantage of the green energy to minimize brown energy consumption. In this paper, we investigate how to manage a datacenter's computational workload to match the green energy supply. In particular, we consider data-processing frameworks, in which many background computations can be delayed by a bounded amount of time. We propose GreenHadoop, a MapReduce framework for a datacenter powered by a photovoltaic solar array and the electrical grid (as a backup). GreenHadoop predicts the amount of solar energy that will be available in the near future, and schedules the MapReduce jobs to maximize the green energy consumption within the jobs' time bounds. If brown energy must be used to avoid time bound violations, GreenHadoop selects times when brown energy is cheap, while also managing the cost of peak brown power consumption. Our experimental results demonstrate that GreenHadoop can significantly increase green energy consumption and decrease electricity cost, compared to Hadoop.

References

  1. S. Akoush et al. Free Lunch: Exploiting Renewable Energy for Computing. In HotOS, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. B. Aksanli et al. Utilizing Green Energy Prediction to Schedule Mixed Batch and Service Jobs in Data Centers. In HotPower, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. H. Amur et al. Robust and Flexible Power-Proportional Storage. In SOCC, June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Apache. Apache Hadoop. http://hadoop.apache.org/.Google ScholarGoogle Scholar
  5. Apache. Apache Nutch. http://nutch.apache.org/.Google ScholarGoogle Scholar
  6. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, December 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. DSIRE. Database of State Incentives for Renewables and Efficiency. http://www.dsireusa.org/.Google ScholarGoogle Scholar
  8. D. Feitelson et al. Parallel Job Scheduling -- A Status Report. In JSSPP, June 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Goiri et al. GreenSlot: Scheduling Energy Consumption in Green Datacenters. In Supercomputing, November 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Govindan et al. Benefits and Limitations of Tapping into Stored Energy for Datacenters. In ISCA, June 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Jossen et al. Operation conditions of batteries in PV applications. Solar Energy, 76 (6), 2004.Google ScholarGoogle Scholar
  12. K. Kant et al. Willow: A Control System for Energy and Thermal Adaptive Computing. In IPDPS, May 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. T. Kaushik et al. Evaluation and Analysis of GreenHDFS: A Self-Adaptive, Energy-Conserving Variant of the Hadoop Distributed File System. In CloudCom, December 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. A. Krioukov et al. Integrating Renewable Energy Using Data Analytics Systems: Challenges and Opportunities. Bulletin of the IEEE Computer Society Technical Committee, March 2011.Google ScholarGoogle Scholar
  15. A. Krioukov et al. Design and Evaluation of an Energy Agile Computing Cluster. Technical Report EECS-2012-13, University of California at Berkeley, January 2012.Google ScholarGoogle Scholar
  16. W. Lang and J. Patel. Energy Management for MapReduce Clusters. In VLDB, September 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Le et al. Cost- And Energy-Aware Load Distribution Across Data Centers. In HotPower, October 2009.Google ScholarGoogle Scholar
  18. K. Le et al. Reducing Electricity Cost Through Virtual Machine Placement in High Performance Computing Clouds. In Supercomputing, November 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. Le et al. Capping the Brown Energy Consumption of Internet Services at Low Cost. In IGCC, August 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Leverich and C. Kozyrakis. On the Energy (In)efficiency of Hadoop Clusters. In HotPower, October 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. C. Li et al. SolarCore: Solar Energy Driven Multi-core Architecture Power Management. In HPCA, February 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Z. Liu et al. Greening Geographical Load Balancing. In SIGMETRICS, June 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Oikawa and R. Rajkumar. Linux/RK: A Portable Resource Kernel in Linux. In RTAS, May 1998.Google ScholarGoogle Scholar
  24. A. Qureshi et al. Cutting the Electric Bill for Internet-Scale Systems. In SIGCOMM, August 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. P. Ranganathan et al. Ensemble-level Power Management for Dense Blade Servers. In ISCA, June 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. N. Sharma et al. Cloudy Computing: Leveraging Weather Forecasts in Energy Harvesting Sensor Systems. In SECON, June 2010.Google ScholarGoogle ScholarCross RefCross Ref
  27. N. Sharma et al. Blink: Managing Server Clusters on Intermittent Power. In ASPLOS, March 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. SMA. Sunny Central 800CP, 2012.Google ScholarGoogle Scholar
  29. SolarBuzz. Marketbuzz, 2011.Google ScholarGoogle Scholar
  30. C. Stewart and K. Shen. Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter. In HotPower, October 2009.Google ScholarGoogle Scholar
  31. US Environmental Protection Agency. Report to Congress on Server and Data Center Energy Efficiency, August 2007.Google ScholarGoogle Scholar
  32. A. Yoo et al. SLURM: Simple Linux Utility for Resource Management. In JSPP, June 2003.Google ScholarGoogle ScholarCross RefCross Ref
  33. M. Zaharia et al. Job Scheduling for Multi-User MapReduce Clusters. In TR UCB/EECS-2009-55, Berkeley, August 2009.Google ScholarGoogle Scholar

Index Terms

  1. GreenHadoop: leveraging green energy in data-processing frameworks

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    EuroSys '12: Proceedings of the 7th ACM european conference on Computer Systems
    April 2012
    394 pages
    ISBN:9781450312233
    DOI:10.1145/2168836

    Copyright © 2012 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 10 April 2012

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Overall Acceptance Rate241of1,308submissions,18%

    Upcoming Conference

    EuroSys '24
    Nineteenth European Conference on Computer Systems
    April 22 - 25, 2024
    Athens , Greece

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader