skip to main content
extended-abstract

Controlling the number of active instances in a cloud environment

Published: 20 March 2018 Publication History

Abstract

We study a cloud environment in which computing instances may either be reserved in advance, or dynamically spawned to serve a fluctuating or unknown load. We first consider a centralized scheme where a system operator maintains the job queue and controls the spawning of additional capacity; through queueing models and their fluid and diffusion counterparts we explore the tradeoff between queueing delay and the service capacity variability. Secondly, we consider the setting of a dispatcher who must immediately send jobs, with no delay, to decentralized instances, and in addition may summon extra capacity. Here the capacity scaling problem couples with one of load balancing. We show how the popular join-the-idle-queue policy can be combined with an adequate rule for spawning instances, yielding an equilibrium with no queuing delay and controlling service capacity variability; we accommodate as well the case where spawned instances incur startup delay. Finally, we analyze the question of deciding, for a given pricing structure for the cloud service, how many fixed instances should be reserved in advance. The behavior of these policies is illustrated by simulations.

References

[1]
D. Gamarnik, J. N. Tsitsiklis, and M. Zubeldia. Delay, memory, and messaging tradeoffs in distributed service systems. In ACM SIGMETRICS 2016, Juan-les-Pins, France, pages 1-12, 2016.
[2]
D. Goldsztajn, A. Ferragut, and F. Paganini. A feedback control approach to dynamic speed scaling in computing systems. In CISS'17, Baltimore, MD, 2017.
[3]
M. Lin, A. Wierman, L. L. Andrew, and E. Thereska. Dynamic right-sizing for power-proportional data centers. IEEE/ACM Trans. Netw. 21(5):1378-1391, 2013.
[4]
Y. Lu, Q. Xie, G. Kliot, A. Geller, J. R. Larus, and A. Greenberg. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation, 68(11):1056-1071, 2011.
[5]
M. Mitzenmacher. The power of two choices in randomized load balancing. IEEE Trans. on Parallel Distrib. Syst, 12(10):1094-1104, 2001.
[6]
D. Mukherjee, S. Dhara, S. Borst, and J. van Leeuwaarden. Optimal service elasticity in large-scale distributed systems. In ACM SIGMETRICS 2017, Urbana-Champaign, IL, 2017.
[7]
N. D. Vvedenskaya, R. L. Dobrushin, and F. I. Karpelevich. Queueing system with selection of the shortest of two queues: An asymptotic approach. Problemy Peredachi Informatsii, 32(1):20-34, 1996.
[8]
L. Zheng, C. Joe-Wong, C. G. Brinton, C. W. Tan, S. Ha, and M. Chiang. On the Viability of a Cloud Virtual Service Provider. In ACM SIGMETRICS 2015, Portland, OR, pages 235-248, 2016.

Cited By

View all
  • (2024)Asynchronous Load Balancing and Auto-Scaling: Mean-Field Limit and Optimal DesignIEEE/ACM Transactions on Networking10.1109/TNET.2024.336813032:4(2960-2971)Online publication date: Aug-2024
  • (2024)An online bi-objective scheduling algorithm for service provisioning in cloud computingJournal of Network and Computer Applications10.1016/j.jnca.2023.103792222(103792)Online publication date: Feb-2024
  • (2023)Utility Maximizing Load Balancing PoliciesStochastic Systems10.1287/stsy.2022.010313:2(211-246)Online publication date: Jun-2023
  • Show More Cited By
  1. Controlling the number of active instances in a cloud environment

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM SIGMETRICS Performance Evaluation Review
    ACM SIGMETRICS Performance Evaluation Review  Volume 45, Issue 3
    December 2017
    253 pages
    ISSN:0163-5999
    DOI:10.1145/3199524
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 March 2018
    Published in SIGMETRICS Volume 45, Issue 3

    Check for updates

    Qualifiers

    • Extended-abstract

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Asynchronous Load Balancing and Auto-Scaling: Mean-Field Limit and Optimal DesignIEEE/ACM Transactions on Networking10.1109/TNET.2024.336813032:4(2960-2971)Online publication date: Aug-2024
    • (2024)An online bi-objective scheduling algorithm for service provisioning in cloud computingJournal of Network and Computer Applications10.1016/j.jnca.2023.103792222(103792)Online publication date: Feb-2024
    • (2023)Utility Maximizing Load Balancing PoliciesStochastic Systems10.1287/stsy.2022.010313:2(211-246)Online publication date: Jun-2023
    • (2021)Self-Learning Threshold-Based Load BalancingINFORMS Journal on Computing10.1287/ijoc.2021.110034:1(39-54)Online publication date: 16-Sep-2021
    • (2021)Joint Monitorless Load-Balancing and Autoscaling for Zero-Wait-Time in Data CentersIEEE Transactions on Network and Service Management10.1109/TNSM.2020.304505918:1(672-686)Online publication date: 1-Mar-2021
    • (2021)Automatic cloud instance provisioning with quality and efficiencyPerformance Evaluation10.1016/j.peva.2021.102209149:COnline publication date: 1-Sep-2021
    • (2020)Resource allocation based on redundancy models for high availability cloudComputing10.1007/s00607-019-00728-1102:1(43-63)Online publication date: 1-Jan-2020
    • (2019)An optimization approach to load balancing, scheduling and right sizing of cloud computing systems with data locality2019 IEEE 58th Conference on Decision and Control (CDC)10.1109/CDC40024.2019.9029663(1114-1119)Online publication date: Dec-2019
    • (2018)Analyzing the availability and performance of an e-health system integrated with edge, fog and cloud infrastructuresJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-018-0118-37:1(1-22)Online publication date: 1-Dec-2018
    • (2018)Feedback control of server instances for right sizing in the cloud2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)10.1109/ALLERTON.2018.8635636(749-756)Online publication date: 2-Oct-2018

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media