skip to main content
10.1145/2212908.2212913acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

A reconfigurable optical/electrical interconnect architecture for large-scale clusters and datacenters

Published: 15 May 2012 Publication History

Abstract

Hybrid optical/electrical interconnects, using commercially available optical circuit switches at the core part of the network, have been recently proposed as an attractive alternative to fully-connected electronically-switched networks in terms of port density, bandwidth/port, cabling and energy efficiency. Although the shift from a traditionally packet-switched core to switching between server aggregations (or servers) at circuit granularity requires system redesign, the approach has been shown to fit well to the traffic requirements of certain classes of high-performance computing applications, as well as to the traffic patterns exhibited by typical data center workloads. Recent proposals for such system designs have looked at small/medium scale hybrid interconnects. In this paper, we present a hybrid optical/electrical interconnect architecture intended for large-scale deployments of high-performance computing systems and server co-locations. To reduce complexity, our architecture employs a regular shuffle network topology that allows for simple management and cabling. Thanks to using a single-stage core interconnect and multiple optical planes, our design can be both incrementally scaled up (in capacity) and scaled out (in the number of racks) without requiring major re-cabling and network re-configuration. Also, we are the first to our knowledge to explore the benefit of using multi-hopping in the optical domain as a means to avoid constant reconfiguration of optical circuit switches. We have prototyped our architecture at packet-level detail in a simulation framework to evaluate this concept. Our results demonstrate that our hybrid interconnect, by adapting to the changing nature of application traffic, can significantly exceed the throughput of a static interconnect of equal degree, while at times attaining a throughput comparable to that of a costly fully-connected network. We also show a further benefit brought by multi-hopping, that it reduces the performance drops by reducing the frequency of reconfiguration.

References

[1]
K. Barker and D. Kerbyson. Performance analysis of an optical circuit switched network for peta-scale systems. In Euro-Par 2007 Parallel Processing, volume 4641 of Lecture Notes in Computer Science, pages 858--867. Springer Berlin / Heidelberg, 2007.
[2]
K. J. Barker, A. Benner, R. Hoare, A. Hoisie, A. K. Jones, D. K. Kerbyson, D. Li, R. Melhem, R. Rajamony, E. Schenfeld, S. Shao, C. Stunkel, and P. Walker. On the feasibility of optical circuit switching for high performance computing systems. In Proc. ACM/IEEE SC 2005 Conf. Supercomp., 2005.
[3]
T. Benson, A. Akella, and D. A. Maltz. Network traffic characteristics of data centers in the wild. In Proceedings of the 10th annual conference on Internet measurement, IMC '10, pages 267--280, New York, NY, USA, 2010. ACM.
[4]
R. Cypher, A. Ho, S. Konstantinidou, and P. Messina. Architectural requirements of parallel scientific applications with explicit communication. In Proc. 20th Annual Int Comp. Arch. Symp, pages 2--13, 1993.
[5]
N. Desai, P. Balaji, P. Sadayappan, and M. Islam. Are nonblocking networks really needed for high-end-computing workloads? In Proc. IEEE Int Cluster Computing Conf, pages 152--159, 2008.
[6]
N. Farrington, G. Porter, S. Radhakrishnan, H. H. Bazzaz, V. Subramanya, Y. Fainman, G. Papen, and A. Vahdat. Helios: a hybrid electrical/optical switch architecture for modular data centers. SIGCOMM Comput. Commun. Rev., 40:339--350, August 2010.
[7]
V. Gupta and E. Schenfeld. Performance analysis of a synchronous, circuit-switched interconnection cached network. In Proceedings of the 8th international conference on Supercomputing, ICS '94, pages 246--255, New York, NY, USA, 1994. ACM.
[8]
S. Kamil, L. Oliker, A. Pinar, and J. Shalf. Communication requirements and interconnect optimization for high-end scientific applications. IEEE Transactions on Parallel and Distributed Systems, 21(2):188--202, 2010.
[9]
S. Kamil, A. Pinar, D. Gunter, M. Lijewski, L. Oliker, and J. Shalf. Reconfigurable hybrid interconnection for static and dynamic scientific applications. In Proceedings of the 4th international conference on Computing frontiers, CF '07, pages 183--194, New York, NY, USA, 2007. ACM.
[10]
J. Kim, W. J. Dally, S. Scott, and D. Abts. Technology driven, highly scalable dragonfly topology. SIGARCH Comput. Archit. News, 36(3):77--88, June 2008.
[11]
S. Kim and A. V. Veidenbaum. On shortest path routing in single stage shuffle-exchange networks. In Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures, SPAA '95, pages 298--307, New York, NY, USA, 1995. ACM.
[12]
L. Oliker, A. Canning, J. Carter, C. Iancu, M. Lijewski, S. Kamil, J. Shalf, H. Shan, E. Strohmaier, S. Ethier, and T. Goodale. Scientific application performance on candidate petascale platforms. In In Proc. of the International Parallel & Distributed Processing Symposium (IPDPS), 2007.
[13]
L. Schares, X. J. Zhang, R. Wagle, D. Rajan, P. Selo, S. P. Chang, J. Giles, K. Hildrum, D. Kuchta, J. Wolf, and E. Schenfeld. A reconfigurable interconnect fabric with optical circuit switch and software optimizer for stream computing systems. In Proc. Conf. Optical Fiber Communication - incudes post deadline papers OFC 2009, pages 1--3, 2009.
[14]
A. Singla, A. Singh, K. Ramachandran, L. Xu, and Y. Zhang. Feasibility study on topology malleable data center networks (dcn) using optical switching technologies. In Proc. and the National Fiber Optic Engineers Conf. Optical Fiber Communication Conf. and Exposition (OFC/NFOEC), pages 1--3, 2011.
[15]
J. Vetter and A. Yoo. An empirical performance evaluation of scalable scientific applications. In Supercomputing, ACM/IEEE 2002 Conference, page 16, nov. 2002.
[16]
G. Wang, D. G. Andersen, M. Kaminsky, K. Papagiannaki, T. E. Ng, M. Kozuch, and M. Ryan. c-through: part-time optics in data centers. SIGCOMM Comput. Commun. Rev., 40:327--338, August 2010.

Cited By

View all
  • (2020)Network Reconfiguration Algorithm (NRA) for scheduling communication-intensive graphs in heterogeneous computing environmentCluster Computing10.1007/s10586-019-03002-323:2(1419-1438)Online publication date: 1-Jun-2020
  • (2017)Optical Switching in Datacenters: Architectures Based on Optical Circuit SwitchingOptical Switching in Next Generation Data Centers10.1007/978-3-319-61052-8_2(23-44)Online publication date: 30-Aug-2017
  • (2015)HOSAProceedings of the 12th ACM International Conference on Computing Frontiers10.1145/2742854.2742877(1-8)Online publication date: 6-May-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CF '12: Proceedings of the 9th conference on Computing Frontiers
May 2012
320 pages
ISBN:9781450312158
DOI:10.1145/2212908
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 15 May 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. interconnection networks
  2. multi-hop topologies
  3. optical switching

Qualifiers

  • Research-article

Conference

CF'12
Sponsor:
CF'12: Computing Frontiers Conference
May 15 - 17, 2012
Cagliari, Italy

Upcoming Conference

CF '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)14
  • Downloads (Last 6 weeks)5
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Network Reconfiguration Algorithm (NRA) for scheduling communication-intensive graphs in heterogeneous computing environmentCluster Computing10.1007/s10586-019-03002-323:2(1419-1438)Online publication date: 1-Jun-2020
  • (2017)Optical Switching in Datacenters: Architectures Based on Optical Circuit SwitchingOptical Switching in Next Generation Data Centers10.1007/978-3-319-61052-8_2(23-44)Online publication date: 30-Aug-2017
  • (2015)HOSAProceedings of the 12th ACM International Conference on Computing Frontiers10.1145/2742854.2742877(1-8)Online publication date: 6-May-2015
  • (2015)Large-scale hybrid electronic/optical switching networks for datacenters and HPC systems2015 IEEE 4th International Conference on Cloud Networking (CloudNet)10.1109/CloudNet.2015.7335288(87-93)Online publication date: Oct-2015
  • (2014)A reconfigurable, regular-topology cluster/datacenter network using commodity optical switchesFuture Generation Computer Systems10.5555/2747903.274820030:C(78-89)Online publication date: 1-Jan-2014
  • (2014)A reconfigurable, regular-topology cluster/datacenter network using commodity optical switchesFuture Generation Computer Systems10.1016/j.future.2013.04.01630(78-89)Online publication date: Jan-2014
  • (2013)Software-defined massive multicore networking via freespace optical interconnectProceedings of the ACM International Conference on Computing Frontiers10.1145/2482767.2482802(1-9)Online publication date: 14-May-2013
  • (2013)SDN control for hybrid OCS/electrical datacenter networks: An enabler or just a convenience?2013 IEEE Photonics Society Summer Topical Meeting Series10.1109/PHOSST.2013.6614529(242-243)Online publication date: Jul-2013
  • (2013)Seeds for a Heterogeneous InterconnectProceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPSW.2013.260(84-92)Online publication date: 20-May-2013
  • (2013)A Network Configuration Algorithm Based on Optimization of Kirchhoff IndexProceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing10.1109/IPDPS.2013.116(407-417)Online publication date: 20-May-2013
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media