skip to main content
10.1145/3127479.3131211acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
abstract

Rethinking reinforcement learning for cloud elasticity

Published: 24 September 2017 Publication History

Abstract

Cloud elasticity, i.e., the dynamic allocation of resources to applications to meet fluctuating workload demands, has been one of the greatest challenges in cloud computing. Approaches based on reinforcement learning have been proposed but they require a large number of states in order to model complex application behavior. In this work we propose a novel reinforcement learning approach that employs adaptive state space partitioning. The idea is to start from one state that represents the entire environment and partition this into finer-grained states adaptively to the observed workload and system behavior following a decision-tree approach. We explore novel statistical criteria and strategies that decide both the correct parameters and the appropriate time to perform the partitioning.

References

[1]
AWS | Auto Scaling, https://aws.amazon.com/autoscaling/.
[2]
Lolos, K., et al. Elastic Resource Management with Adaptive State Space Partitioning of Markov Decision Processes. arXiv:1702.02978 [cs] (Feb. 2017).
[3]
Rao, J., et al. VCONF: a Reinforcement Learning Approach to Virtual Machines Auto-configuration. In ICAC (2009), ACM, pp. 137--146.
[4]
Shen, Z., Subbiah, S., Gu, X., and Wilkes, J. Cloudscale: Elastic Resource Scaling for Multi-Tenant Cloud Systems. In SoCC (2011), ACM, p. 5.
[5]
Verma, A., et al. Large-scale Cluster Management at Google with Borg. In EuroSys (2015), ACM, p. 18.

Cited By

View all
  • (2024)Deep reinforcement learning-based methods for resource scheduling in cloud computing: a review and future directionsArtificial Intelligence Review10.1007/s10462-024-10756-957:5Online publication date: 23-Apr-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing
September 2017
672 pages
ISBN:9781450350280
DOI:10.1145/3127479
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 September 2017

Check for updates

Qualifiers

  • Abstract

Conference

SoCC '17
Sponsor:
SoCC '17: ACM Symposium on Cloud Computing
September 24 - 27, 2017
California, Santa Clara

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Deep reinforcement learning-based methods for resource scheduling in cloud computing: a review and future directionsArtificial Intelligence Review10.1007/s10462-024-10756-957:5Online publication date: 23-Apr-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media