abstract

Rethinking reinforcement learning for cloud elasticity

Authors:

Konstantinos Lolos,

Ioannis Konstantinou,

Verena Kantere,

Nectarios KozirisAuthors Info & Claims

SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing

Page 648

https://doi.org/10.1145/3127479.3131211

Published: 24 September 2017 Publication History

Get Access

Abstract

Cloud elasticity, i.e., the dynamic allocation of resources to applications to meet fluctuating workload demands, has been one of the greatest challenges in cloud computing. Approaches based on reinforcement learning have been proposed but they require a large number of states in order to model complex application behavior. In this work we propose a novel reinforcement learning approach that employs adaptive state space partitioning. The idea is to start from one state that represents the entire environment and partition this into finer-grained states adaptively to the observed workload and system behavior following a decision-tree approach. We explore novel statistical criteria and strategies that decide both the correct parameters and the appropriate time to perform the partitioning.

References

[1]

AWS | Auto Scaling, https://aws.amazon.com/autoscaling/.

Google Scholar

[2]

Lolos, K., et al. Elastic Resource Management with Adaptive State Space Partitioning of Markov Decision Processes. arXiv:1702.02978 [cs] (Feb. 2017).

Google Scholar

[3]

Rao, J., et al. VCONF: a Reinforcement Learning Approach to Virtual Machines Auto-configuration. In ICAC (2009), ACM, pp. 137--146.

Google Scholar

[4]

Shen, Z., Subbiah, S., Gu, X., and Wilkes, J. Cloudscale: Elastic Resource Scaling for Multi-Tenant Cloud Systems. In SoCC (2011), ACM, p. 5.

Google Scholar

[5]

Verma, A., et al. Large-scale Cluster Management at Google with Borg. In EuroSys (2015), ACM, p. 18.

Google Scholar

Cited By

View all

Zhou GTian WBuyya RXue RSong L(2024)Deep reinforcement learning-based methods for resource scheduling in cloud computing: a review and future directionsArtificial Intelligence Review10.1007/s10462-024-10756-957:5Online publication date: 23-Apr-2024
https://doi.org/10.1007/s10462-024-10756-9

Recommendations

Cloud Elasticity: going beyond demand as user load
ARMS-CC'16: Proceedings of the Third International Workshop on Adaptive Resource Management and Scheduling for Cloud Computing

Cloud computing systems have become not only popular, but extensively used. They are supported and exploited by both industry and academia. Cloud providers have diversified and so did the software offered by their systems. Infrastructure as a Service (...
Managing elasticity across multiple cloud providers
MultiCloud '13: Proceedings of the 2013 international workshop on Multi-cloud applications and federated clouds

In the context of cloud computing, elasticity is the capacity to scale computing resources up and down easily. Currently, most Platforms as a Service (PaaS) manage application elasticity within a single cloud provider. However, the not so infrequent ...
Portable Autoscaler for Managing Multi-cloud Elasticity
CUBE '13: Proceedings of the 2013 International Conference on Cloud & Ubiquitous Computing & Emerging Technologies

Ability to scale resources up or down dynamically as per changes in workload conditions is one of the key features of clouds. We present here a framework for elastic scaling of cloud resources that is portable across clouds from a wide range of private ...

Comments

Information & Contributors

Information

Published In

SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing

September 2017

672 pages

ISBN:9781450350280

DOI:10.1145/3127479

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 September 2017

Check for updates

Qualifiers

Abstract

Conference

SoCC '17

Sponsor:

SoCC '17: ACM Symposium on Cloud Computing

September 24 - 27, 2017

California, Santa Clara

Acceptance Rates

Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
195
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Zhou GTian WBuyya RXue RSong L(2024)Deep reinforcement learning-based methods for resource scheduling in cloud computing: a review and future directionsArtificial Intelligence Review10.1007/s10462-024-10756-957:5Online publication date: 23-Apr-2024
https://doi.org/10.1007/s10462-024-10756-9

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Recommendations

Cloud Elasticity: going beyond demand as user load

Managing elasticity across multiple cloud providers

Portable Autoscaler for Managing Multi-cloud Elasticity

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations