skip to main content
research-article

zUpdate: updating data center networks with zero loss

Published: 27 August 2013 Publication History

Abstract

Datacenter networks (DCNs) are constantly evolving due to various updates such as switch upgrades and VM migrations. Each update must be carefully planned and executed in order to avoid disrupting many of the mission-critical, interactive applications hosted in DCNs. The key challenge arises from the inherent difficulty in synchronizing the changes to many devices, which may result in unforeseen transient link load spikes or even congestions. We present one primitive, zUpdate, to perform congestion-free network updates under asynchronous switch and traffic matrix changes. We formulate the update problem using a network model and apply our model to a variety of representative update scenarios in DCNs. We develop novel techniques to handle several practical challenges in realizing zUpdate as well as implement the zUpdate prototype on OpenFlow switches and deploy it on a testbed that resembles real DCN topology. Our results, from both real-world experiments and large-scale trace-driven simulations, show that zUpdate can effectively perform congestion-free updates in production DCNs.

References

[1]
Floodlight. http://floodlight.openflowhub.org/.
[2]
MOSEK. http://mosek.com/.
[3]
OpenFlow 1.0. http://www.openflow.org/documents/openflow-spec-v1.0.0.pdf.
[4]
M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM'08.
[5]
M. Alizadeh, A. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data Center TCP DCTCP. In SIGCOMM'10.
[6]
C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live Migration of Virtual Machines. In NSDI'05.
[7]
A. R. Curtis, J. C. Mogul, J. Tourrilhes, P. Yalag, P. Sharma, and S. Banerjee. Devoflow: Scaling Flow Management for High-Performance Networks. In SIGCOMM'11.
[8]
N. Feamster and H. Balakrishnan. Detecting BGP Configuration Faults with Static Analysis. In NSDI'05.
[9]
P. Francois, O. Bonaventure, B. Decraene, and P. A. Coste. Avoiding Disruptions During Maintenance Operations on BGP Sessions. IEEE Trans. on Netw. and Serv. Manag., 2007.
[10]
S. Ghorbani and M. Caesar. Walk the Line: Consistent Network Updates with Bandwidth Guarantees. In HotSDN'12.
[11]
J. P. John, E. Katz-Bassett, A. Krishnamurthy, T. Anderson, and A. Venkataramani. Consensus Routing: the Internet as a Distributed System. In NSDI'08.
[12]
P. Kazemian, M. Chang, H. Zeng, G. Varghese, N. McKeown, and S. Whyte. Real Time Network Policy Checking Using Header Space Analysis. In NSDI'13.
[13]
P. Kazemian, G. Varghese, and N. McKeown. Header Space Analysis: Static Checking for Networks. In NSDI'12.
[14]
E. Keller, S. Ghorbani, M. Caesar, and J. Rexford. Live Migration of an Entire Network (and its hosts). In HotNets'12.
[15]
A. Khurshid, W. Zhou, M. Caesar, and P. B. Godfrey. Veriflow: Verifying Network-Wide Invariants in Real Time. In HotSDN'12.
[16]
H. Mai, A. Khurshid, R. Agarwal, M. Caesar, P. B. Godfrey, and S. T. King. Debugging the Data Plane with Anteater. In SIGCOMM'11.
[17]
S. Raza, Y. Zhu, and C.-N. Chuah. Graceful Network State Migrations. Networking, IEEE/ACM Transactions on, 2011.
[18]
M. Reitblatt, N. Foster, J. Rexford, C. Schlesinger, and D. Walker. Abstractions for Network Update. In SIGCOMM'12.
[19]
L. Vanbever, S. Vissicchio, C. Pelsser, P. Francois, and O. Bonaventure. Seamless Network-Wide IGP Migrations. In SIGCOMM'11.
[20]
X. Wu, D. Turner, C.-C. Chen, D. A. Maltz, X. Yang, L. Yuan, and M. Zhang. NetPilot: Automating Datacenter Network Failure Mitigation. In SIGCOMM'12.

Cited By

View all
  • (2024)Joint Request Updating and Elastic Resource Provisioning With QoS Guarantee in CloudsIEEE/ACM Transactions on Networking10.1109/TNET.2023.327688132:1(110-126)Online publication date: 1-Feb-2024
  • (2024)Congestion-Free Rerouting of Network Flows: Hardness and an FPT AlgorithmNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575579(1-7)Online publication date: 6-May-2024
  • (2023)Distributed Controller Placement in Software-Defined Networks with Consistency and Interoperability ProblemsJournal of Electrical and Computer Engineering10.1155/2023/64669962023Online publication date: 1-Jan-2023
  • Show More Cited By

Index Terms

  1. zUpdate: updating data center networks with zero loss

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM SIGCOMM Computer Communication Review
      ACM SIGCOMM Computer Communication Review  Volume 43, Issue 4
      October 2013
      595 pages
      ISSN:0146-4833
      DOI:10.1145/2534169
      Issue’s Table of Contents
      • cover image ACM Conferences
        SIGCOMM '13: Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
        August 2013
        580 pages
        ISBN:9781450320566
        DOI:10.1145/2486001
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 August 2013
      Published in SIGCOMM-CCR Volume 43, Issue 4

      Check for updates

      Author Tags

      1. congestion
      2. data center network
      3. network update

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)125
      • Downloads (Last 6 weeks)21
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Joint Request Updating and Elastic Resource Provisioning With QoS Guarantee in CloudsIEEE/ACM Transactions on Networking10.1109/TNET.2023.327688132:1(110-126)Online publication date: 1-Feb-2024
      • (2024)Congestion-Free Rerouting of Network Flows: Hardness and an FPT AlgorithmNOMS 2024-2024 IEEE Network Operations and Management Symposium10.1109/NOMS59830.2024.10575579(1-7)Online publication date: 6-May-2024
      • (2023)Distributed Controller Placement in Software-Defined Networks with Consistency and Interoperability ProblemsJournal of Electrical and Computer Engineering10.1155/2023/64669962023Online publication date: 1-Jan-2023
      • (2023)Optimizing incremental SDN upgrades for load balancing in ISP networksTheoretical Computer Science10.1016/j.tcs.2023.113927962(113927)Online publication date: Jun-2023
      • (2023)AllSynth: A BDD-based approach for network update synthesisScience of Computer Programming10.1016/j.scico.2023.102992230(102992)Online publication date: Aug-2023
      • (2022)Automatic generation of network function accelerators using component-based synthesisProceedings of the Symposium on SDN Research10.1145/3563647.3563656(89-97)Online publication date: 19-Oct-2022
      • (2022)AllSynth: Transiently Correct Network Update Synthesis Accounting for Operator PreferencesTheoretical Aspects of Software Engineering10.1007/978-3-031-10363-6_23(344-362)Online publication date: 2022
      • (2021)Applying Buffer to SDN Switches: Benefits Analysis and Mechanism DesignIEEE Transactions on Cloud Computing10.1109/TCC.2018.28466209:1(54-65)Online publication date: 1-Jan-2021
      • (2021)Online Joint Optimization on Traffic Engineering and Network Update in Software-defined WANsIEEE INFOCOM 2021 - IEEE Conference on Computer Communications10.1109/INFOCOM42981.2021.9488837(1-10)Online publication date: 10-May-2021
      • (2021)Loss-freedom, Order-preservation and No-buffering: Pick Any Two During Flow Migration in Network Functions2021 IEEE 29th International Conference on Network Protocols (ICNP)10.1109/ICNP52444.2021.9651954(1-11)Online publication date: 1-Nov-2021
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media