skip to main content
10.1145/2494621.2494628acmotherconferencesArticle/Chapter ViewAbstractPublication PagescacConference Proceedingsconference-collections
research-article

A virtual machine re-packing approach to the horizontal vs. vertical elasticity trade-off for cloud autoscaling

Published: 09 August 2013 Publication History

Abstract

An automated solution to horizontal vs. vertical elasticity problem is central to make cloud autoscalers truly autonomous. Today's cloud autoscalers are typically varying the capacity allocated by increasing and decreasing the number of virtual machines (VMs) of a predefined size (horizontal elasticity), not taking into account that as load varies it may be advantageous not only to vary the number but also the size of VMs (vertical elasticity). We analyze the price/performance effects achieved by different strategies for selecting VM-sizes for handling increasing load and we propose a cost-benefit based approach to determine when to (partly) replace a current set of VMs with a different set. We evaluate our repacking approach in combination with different auto-scaling strategies. Our results show a range of 7% up to 60% cost saving in total resource utilization cost of our sample applications and workloads.

References

[1]
Amazon EC2: http://aws.amazon.com/ec2/instance-types/.
[2]
Rightscale cloud management. http://www.rightscale.com/, 2012.
[3]
A. Ali-Eldin, J. Tordsson, and E. Elmroth. An adaptive hybrid elasticity controller for cloud infrastructures. In Network Operations and Management Symposium (NOMS), 2012 IEEE, pages 204--212. IEEE, 2012.
[4]
D. Breitgand, Z. Dubitzky, A. Epstein, A. Glikson, and I. Shapira. Sla-aware resource over-commit in an iaas cloud. In 8th international conference on systems virtualiztion management (svm) Network and service management (cnsm), 2012, pages 73--81, Oct.
[5]
R. Chi, Z. Qian, and S. Lu. A game theoretical method for auto-scaling of multi-tiers web applications in cloud. In Proceedings of the Fourth Asia-Pacific Symposium on Internetware, Internetware '12, pages 3:1--3:10, New York, NY, USA, 2012. ACM.
[6]
T. N. B. Duong, X. Li, R. Goh, X. Tang, and W. Cai. Qos-aware revenue-cost optimization for latency-sensitive services in iaas clouds. In 16th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), 2012 IEEE/ACM, pages 11--18, Oct.
[7]
H. Ghanbari, B. Simmons, M. Litoiu, C. Barna, and G. Iszlai. Optimal autoscaling in a IaaS cloud. In Proceedings of the 9th international conference on Autonomic computing, pages 173--178. ACM, 2012.
[8]
I. Goiri, J. Guitart, and J. Torres. Characterizing cloud federation for enhancing providers' profit. In 3rd International Conference on Cloud Computing (CLOUD), 2010 IEEE, pages 123--130. IEEE, 2010.
[9]
Z. Gong, X. Gu, and J. Wilkes. Press: Predictive elastic resource scaling for cloud systems. In International Conference on Network and Service Management (CNSM), 2010, pages 9--16. IEEE, 2010.
[10]
G. Jung, K. Joshi, M. Hiltunen, R. Schlichting, and C. Pu. A cost-sensitive adaptation engine for server consolidation of multitier applications. Middleware 2009, pages 163--183, 2009.
[11]
J. Kupferman, J. Silverman, P. Jara, and J. Browne. Scaling into the cloud. CS270-Advanced Operating Systems, 2009.
[12]
W. Li, J. Tordsson, and E. Elmroth. Modeling for dynamic cloud scheduling via migration of virtual machines. In Proceedings of Third International Conference on Cloud Computing Technology and Science, CLOUDCOM '11, pages 163--171, 2011.
[13]
W. Li, J. Tordsson, and E. Elmroth. Virtual machine placement for predictable and time-constrained peak loads. In Proceedings of the 8th international conference on Economics of Grids, Clouds, Systems, and Services, GECON'11, pages 120--134, Berlin, Heidelberg, 2012. Springer-Verlag.
[14]
T. Lorido-Botrán, J. Miguel-Alonso, and J. A. Lozano. Auto-scaling techniques for elastic applications in cloud environments. Technical report, Department of Computer Architecture and Technology, UPV/EHU, 2012.
[15]
M. Mao and M. Humphrey. Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In Proceedings of International Conference for High Performance Computing, Networking, Storage and Analysis, pages 49:1--49:12, New York, NY, USA, 2011.
[16]
M. Maurer, I. Brandic, and R. Sakellariou. Enacting slas in clouds using rules. Euro-Par 2011 Parallel Processing, pages 455--466, 2011.
[17]
M. Mazzucco, D. Dyachuk, and R. Deters. Maximizing cloud providers' revenues via energy aware allocation policies. In 3rd International Conference on Cloud Computing (CLOUD), 2010 IEEE, pages 131--138. IEEE, 2010.
[18]
P. Padala, K.-Y. Hou, K. G. Shin, X. Zhu, M. Uysal, Z. Wang, S. Singhal, and A. Merchant. Automated control of multiple virtualized resources. In Proceedings of the 4th ACM European conference on Computer systems, pages 13--26. ACM, 2009.
[19]
T. Patikirikorala and A. Colman. Feedback controllers in the cloud. Swinburne University, 2011.
[20]
M. Salehi and R. Buyya. Adapting market-oriented scheduling policies for cloud computing. Algorithms and Architectures for Parallel Processing, pages 351--362, 2010.
[21]
M. Sedaghat, F. Hernández, and E. Elmroth. Unifying cloud management: Towards overall governance of business level objectives. In 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pages 591--597. IEEE, 2011.
[22]
U. Sharma, P. Shenoy, S. Sahu, and A. Shaikh. Kingfisher: A system for elastic cost-aware provisioning in the cloud. Dept. of CS, UMASS, Tech. Rep. UM-CS-2010-005, 2010.
[23]
J. M. Tirado, D. Higuero, F. Isaila, and J. Carretero. Multi-model prediction for enhancing content locality in elastic server infrastructures. In 18th International Conference on High Performance Computing (HiPC), pages 1--9. IEEE, 2011.
[24]
J. Tordsson, R. S. Montero, R. Moreno-Vozmediano, and I. M. Llorente. Cloud brokering mechanisms for optimized placement of virtual machines across multiple providers. Future Gener. Comput. Syst., 28(2):358--367, Feb. 2012.
[25]
B. Urgaonkar, P. Shenoy, and T. Roscoe. Resource overbooking and application profiling in shared hosting platforms. ACM SIGOPS Operating Systems Review, 36(SI):239--254, 2002.
[26]
M. Zhu, Q. Wu, and Y. Zhao. A cost-effective scheduling algorithm for scientific workflows in clouds. In 31st International on Performance Computing and Communications Conference (IPCCC), 2012 IEEE, pages 256--265, Dec.

Cited By

View all
  • (2024)Proactive VNF Scaling and Placement in 5G O-RAN Using MLIEEE Transactions on Network and Service Management10.1109/TNSM.2023.329298621:1(174-186)Online publication date: Feb-2024
  • (2024)Priority-Aware Deployment of Autoscaling Service Function Chains Based on Deep Reinforcement LearningIEEE Transactions on Cognitive Communications and Networking10.1109/TCCN.2024.335856510:3(1050-1062)Online publication date: Jun-2024
  • (2023)Towards Proactive Risk-Aware Cloud Cost Optimization Leveraging Transient ResourcesIEEE Transactions on Services Computing10.1109/TSC.2023.325347316:4(3014-3026)Online publication date: 1-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CAC '13: Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
August 2013
247 pages
ISBN:9781450321723
DOI:10.1145/2494621
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • University of Arizona: University of Arizona
  • OGF: Open Grid Forum
  • Florida Intl University: Florida International Univeristy

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 August 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. autonomous computing
  2. autoscaling
  3. cloud computing
  4. horizontal elasticity
  5. vertical elasticity

Qualifiers

  • Research-article

Funding Sources

Conference

CAC '13
Sponsor:
  • University of Arizona
  • OGF
  • Florida Intl University
CAC '13: ACM Cloud and Autonomic Computing Conference
August 5 - 9, 2013
Florida, Miami, USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Proactive VNF Scaling and Placement in 5G O-RAN Using MLIEEE Transactions on Network and Service Management10.1109/TNSM.2023.329298621:1(174-186)Online publication date: Feb-2024
  • (2024)Priority-Aware Deployment of Autoscaling Service Function Chains Based on Deep Reinforcement LearningIEEE Transactions on Cognitive Communications and Networking10.1109/TCCN.2024.335856510:3(1050-1062)Online publication date: Jun-2024
  • (2023)Towards Proactive Risk-Aware Cloud Cost Optimization Leveraging Transient ResourcesIEEE Transactions on Services Computing10.1109/TSC.2023.325347316:4(3014-3026)Online publication date: 1-Jul-2023
  • (2023)HyScaler: A Dynamic, Hybrid VNF Scaling System for Building Elastic Service Function Chains Across Multiple ServersIEEE Transactions on Network and Service Management10.1109/TNSM.2023.327755220:4(4803-4814)Online publication date: Dec-2023
  • (2023)CILP: Co-Simulation-Based Imitation Learner for Dynamic Resource Provisioning in Cloud Computing EnvironmentsIEEE Transactions on Network and Service Management10.1109/TNSM.2023.326825020:4(4448-4460)Online publication date: Dec-2023
  • (2023)ML-Based Dynamic Scaling and Traffic Forecasting for 5G O-RAN2023 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech)10.1109/DASC/PiCom/CBDCom/Cy59711.2023.10361376(0444-0451)Online publication date: 14-Nov-2023
  • (2022)Tiny Autoscalers for Tiny Workloads: Dynamic CPU Allocation for Serverless Functions2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid54584.2022.00026(170-179)Online publication date: May-2022
  • (2022)Utilization aware and network I/O intensive virtual machine placement policies for cloud data centerJournal of Network and Computer Applications10.1016/j.jnca.2022.103442205:COnline publication date: 1-Sep-2022
  • (2021)Leveraging vCPU-utilization rates to select cost-efficient VMs for parallel workloadsProceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing10.1145/3468737.3494095(1-10)Online publication date: 6-Dec-2021
  • (2021)A Survey on the Use of Preferences for Virtual Machine Placement in Cloud Data CentersACM Computing Surveys10.1145/345051754:5(1-39)Online publication date: 25-May-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media