skip to main content
10.1145/2465829.2465834acmconferencesArticle/Chapter ViewAbstractPublication PagesicacConference Proceedingsconference-collections
research-article

Tradeoffs in compressing virtual machine checkpoints

Published:18 June 2013Publication History

ABSTRACT

Checkpoint replication is a prevalent way of maintaining virtual machine availability in the presence of host failures. Since checkpoint replication can impose heavy load on network resources, checkpoint compression has been suggested to reduce network usage. This paper presents the first detailed evaluation and characterization of the effectiveness and overheads of checkpoint compression methods for various workloads frequently seen in high-availability systems. We propose a lightweight compression method that exploits similarities in checkpoints to eliminate redundant network traffic, and compare it with two well-known methods, gzip and delta compression. Our results show that gzip and delta compression reduce network traffic significantly for various workloads, but incur high CPU and memory overheads, respectively. The proposed similarity compression is most effective for VM clusters running homogeneous workloads, while using both CPU and memory efficiently. Based on our extensive evaluation, we suggest guidelines for selecting and using these compression methods.

References

  1. FFmpeg. http://www.ffmpeg.org.Google ScholarGoogle Scholar
  2. HPC Challenge. http://icl.cs.utk.edu/hpcc.Google ScholarGoogle Scholar
  3. LVM2. http://sourceware.org/lvm2.Google ScholarGoogle Scholar
  4. The RUBiS benchmark. http://rubis.ow2.org.Google ScholarGoogle Scholar
  5. The TPC-C-like benchmark of VoltDB. http://community.voltdb.com/node/134.Google ScholarGoogle Scholar
  6. VMware fault tolerance (FT). http://www.vmware.com/products/fault-tolerance.Google ScholarGoogle Scholar
  7. VoltDB. http://community.voltdb.com.Google ScholarGoogle Scholar
  8. A. Agarwal, D. Shah, N. Kalmala, N. Panchaksharam, R. Bharadhwaj, S. Lokray, S. Sm, and T. Bean. Method and apparatus for transactional fault tolerance in a client-server system, Oct. 2009. Patent, US 7610510.Google ScholarGoogle Scholar
  9. S. Al-Kiswany, D. Subhraveti, P. Sarkar, and M. Ripeanu. VMFlock: virtual machine co-migration for the cloud. In Proc. of the 20th Symp. on High Performance Distributed Computing, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Anand, A. Gupta, A. Akella, S. Seshan, and S. Shenker. Packet caches on routers: the implications of universal redundant traffic elimination. In Proc. of the SIGCOMM Conf., 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Anand, V. Sekar, and A. Akella. SmartRE: an architecture for coordinated network-wide redundancy elimination. In Proc. of the SIGCOMM Conf., 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. C. Bressoud and F. B. Schneider. Hypervisor-based fault tolerance. ACM Trans. on Computer System., 14(1), 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchisonson, and A. Warfield. Remus: High-availability via asynchronous virtual machine replication. In Proc. of the 5th Symp. on Networked Systems Design and Implementation, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. U. Deshpande, X. Wang, and K. Gopalan. Live gang migration of virtual machines. In Proc. of the Symp. on High Performance Distributed Computing, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. B. Gerofi, Z. Vass, and Y. Ishikawa. Utilizing memory content similarity for improving the performance of replicated virtual machines. In Proc. of the 4th Conf. on Utility and Cloud Computing, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Gupta, S. Lee, M. Vrable, S. Savage, A. C. Snoeren, G. Varghese, G. M. Voelker, and A. Vahdat. Difference engine: Harnessing memory redundancy in virtual machines. In Proc. of the 8th Symp. on Operating Systems Design and Implementation, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K.-Y. Hou, M. Uysal, A. Merchant, K. G. Shin, and S. Singhal. HydraVM: Low-cost, transparent high availability for virtual machines. Technical report, HP Labs, 2011.Google ScholarGoogle Scholar
  18. R. Koller and R. Rangaswami. I/O Deduplication: Utilizing content similarity to improve I/O performance. In Proc. of the 8th Conf. on File and Storage Technologies, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. Lu and T.-C. Chiueh. Fast memory state synchronization for virtualization-based fault tolerance. In Proc. of the 39th Conf. on Dependable Systems and Networks, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  20. D. T. Meyer, G. Aggarwal, B. Cully, G. Lefebvre, M. J. Feeley, N. C. Hutchinson, and A. Warfield. Parallax: virtual disks for virtual machines. In Proc. of the 3rd EuroSys Conf., 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. U. F. Minhas, S. R. B. Cully, A. Aboulnaga, K. Salem, and A. Warfield. RemusDB: Transparent high availability for database systems. PVLDB, 4(11), 2011.Google ScholarGoogle Scholar
  22. D. G. Murray, S. H, and M. A. Fetterman. Satori: Enlightened page sharing. In Proc. of the USENIX Annual Technical Conference, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. S. Quinlan and S. Dorward. Venti: A new approach to archival storage. In Proc. of the 1st Conf. on File and Storage Technologies, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Rajagopalan, B. Cully, R. O'Connor, and A. Warfield. Secondsite: disaster tolerance as a service. In Proc. of the 8th Conf. on Virtual Execution Environments, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Rhea, R. Cox, and A. Pesterev. Fast, inexpensive content-addressed storage in Foundation. In Proc. of the USENIX Annual Technical Conference, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. Riteau, C. Morin, and T. Priol. Shrinker: improving live migration of virtual clusters over WANs with distributed data deduplication and content-based addressing. In Proc. of the European Conference on Parallel Processing, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. P. Svard, B. Hudzia, J. Tordsson, and E. Elmroth. Evaluation of delta compression techniques for efficient live migration of large virtual machines. In Proc. of the 7th Conf. on Virtual Execution Environments, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Y. Tamura, K. Sato, S. Kihara, and S. Moriai. Kemari: Virtual machine synchronization for fault tolerance. In USENIX Annual Technical Conference (Poster), 2008.Google ScholarGoogle Scholar
  29. K. V. Vishwanath and N. Nagappan. Characterizing cloud computing hardware reliability. In Proc. of the 1st Symposium on Cloud Computing, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. C. A. Waldspurger. Memory resource management in VMware ESX server. In Proc. of the 5th Symp. on Operating Systems Design and Implementation, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Wood, K. K. Ramakrishnan, P. Shenoy, and J. Van der Merwe. Cloudnet: dynamic pooling of cloud resources by live WAN migration of virtual machines. In Proc. of the 7th Conf. on Virtual Execution Environments, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. X. Zhang, Z. Huo, J. Ma, and D. Meng. Exploiting data deduplication to accelerate live virtual machine migration. In Proc. of the International Conf. on Cluster Computing, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Tradeoffs in compressing virtual machine checkpoints

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          VTDC '13: Proceedings of the 7th international workshop on Virtualization technologies in distributed computing
          June 2013
          56 pages
          ISBN:9781450319850
          DOI:10.1145/2465829

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 18 June 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          VTDC '13 Paper Acceptance Rate5of10submissions,50%Overall Acceptance Rate5of10submissions,50%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader