skip to main content
10.1145/1362622.1362670acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP

Published: 10 November 2007 Publication History

Abstract

Due to the growing need to tolerate network faults and congestion in high-end computing systems, supporting multiple network communication paths is becoming increasingly important. However, multi-path communication comes with the disadvantage of out-of-order arrival of packets (because packets may traverse different paths). While modern networking stacks such as the Internet Wide-Area RDMA Protocol (iWARP) over 10-Gigabit Ethernet (10GE) support multi-path communication, their current implementations do not handle out-of-order packets primarily owing to the overhead on in-order communication that it adds. Specifically, in iWARP, supporting out-of-order packets requires every packet to carry additional information causing significant overhead on packets that arrive in-order. Thus, in this paper, we analyze the trade-offs in designing a feature-complete iWARP stack, i.e., one that provides support for out-of-order arriving packets, and thus, multi-path systems, while focusing on the performance of in-order communication. We propose three feature-complete designs of iWARP and analyze the pros and cons of each of these designs using performance experiments based on several micro-benchmarks as well as an iso-surface visual rendering application. Our analysis reveals that the iWARP design providing the best overall performance depends on the particular characteristics of the upper layers and that different designs are optimal based on the metric of interest.

References

[1]
InfiniBand Trade Association. http://www.infinibandta.com.
[2]
S. Bailey and T. Talpey. Remote Direct Data Placement (RDDP), April 2005.
[3]
C. L. Bajaj, V. Pascucci, D. Thompson, and X. Y. Zhang. Parallel Accelerated Isocontouring for Out-of-core Visualization. In Proceedings of the IEEE Symposium on Parallel Visualization and Graphics, pages 97--104, San Francisco, CA, Oct 1999.
[4]
P. Balaji, H. W. Jin, K. Vaidyanathan, and D. K. Panda. Supporting iWARP Compatibility and Features for Regular Network Adapters. In RAIT, 2005.
[5]
P. Balaji, H. V. Shah, and D. K. Panda. Sockets vs RDMA Interface over 10 Gigabit Networks: An In-depth analysis of the Memory Traffic Bottleneck. In RAIT, 2004.
[6]
M. D. Beynon, T. Kurc, U. Catalyurek, C. Chang, A. Sussman, and J. Saltz. Distributed Processing of Very Large Datasets with DataCutter. Parallel Computing, October 2001.
[7]
M. D. Beynon, T. Kurc, U. Catalyurek, and J. Saltz. A Component-based Implementation of Iso-surface Rendering for Visualizing Large Datasets. Report CS-TR-4249 and UMIACS-TR-2001-34, University of Maryland, Department of Computer Science and UMIACS, 2001.
[8]
R. V. Boppana and S. Chalasani. Fault-Tolerant Wormhole Routing Algorithms for Mesh Networks. IEEE Transactions on Computers, pages 848--864, July 1995.
[9]
S. Herrmann M. Castagnoli, G. Brauer. Optimization of cyclic redundancy-check codes with 24 and 32 paritybits. In IEEE Transactions on Communication, 1993.
[10]
A. Cohen, S. Rangarajan, and H. Slye. On the Performance of TCP Splicing for URL-aware Redirection. In USENIX '99.
[11]
P. Culley, U. Elzur, R. Recio, and S. Bailey. Marker PDU Aligned Framing for TCP Specification, November 2002.
[12]
D. Dalessandro, A. Devulapalli, and P. Wyckoff. Design and Implementation of the iWARP Protocol in Software. In PDCS '05.
[13]
W. Feng, P. Balaji, C. Baron, L. N. Bhuyan, and D. K. Panda. Performance Characterization of a 10-Gigabit Ethernet TOE. In Hotl, 2005.
[14]
A. Foong, H. Hum, T. Huff, J. Patwardhan, and G. Regnier. TCP/IP Performance Revisited. In ISPASS, 2003.
[15]
J. Gao and H. Shen. Parallel view dependent isosurface extraction using multi-pass occlusion culling. In Proceedings ACM/IEEE Symposium on Parallel and Large Data Visualization and Graphics. ACM SIGGRAPH, 2001.
[16]
G. Hurwitz and W. Feng. Initial End-to-End Performance Evaluation of 10-Gigabit Ethernet. In Hotl, 2003.
[17]
NetEffect Inc. http://www.neteffect.com/product-features. html.
[18]
H. W. Jin, P. Balaji, C. Yoo, J. Y. Choi, and D. K. Panda. Exploiting NIC Architectural Support for Enhancing IP based Protocols on High Performance Networks. JPDC '05.
[19]
H. W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji, and D. K. Panda. Performance Evaluation of RDMA over IP: A Case Study with the Ammasso Gigabit Ethernet NIC. In HPI-DC, 2005.
[20]
H. M. Khosravi and A. Foong. Performance Analysis of iSCSI and Effect of CRC Computation. In BEACON '04.
[21]
F. Petrini, W. C. Feng, A. Hoisie, S. Coll, and E. Frachtenberg. The Quadrics Network (QsNet): High-Performance Clustering Technology. In Hot Interconnects, 2001.
[22]
G. F. Pfister and V. A. Norton. Hot-spot Contention and Combining in Multistage Interconnection Networks. IEEE Transactions on Computers, 34:943--948, 1985.
[23]
D. V. Sarvate. Computation of cyclic redundancy checks via table look-up. In Communications of the ACM, volume 31, 1998.
[24]
IEEE 802 Standards. IEEE 802.1Q - Virtual LANs.
[25]
J. Stone and C. Partridge. When the CRC and TCP Checksum Disagree. In ACM SIGCOMM, 2000.
[26]
A. Vishnu, M. Koop, A. Moody, A. Mamidala, S. Narravula, and D. K. Panda. Hot-Spot Avoidance With Multi-Pathing Over InfiniBand: An MPI Perspective. In CCGrid, 2007.
[27]
A. Vishnu, A. R. Mamidala, and D. K. Panda. Performance Modeling of Subnet Management on Fat Tree InfiniBand Networks using OpenSM. In Workshop on System Management Tools on Large Scale Parallel Systems, 2005.

Cited By

View all
  • (2014)An Evaluation of One-Sided and Two-Sided Communication Paradigms on Relaxed-Ordering InterconnectProceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium10.1109/IPDPS.2014.116(1115-1125)Online publication date: 19-May-2014
  • (2011)RDMA Capable iWARP over DatagramsProceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium10.1109/IPDPS.2011.66(628-639)Online publication date: 16-May-2011
  • (2010)iWARP redefined: Scalable connectionless communication over high-speed Ethernet2010 International Conference on High Performance Computing10.1109/HIPC.2010.5713192(1-10)Online publication date: Dec-2010
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing
November 2007
723 pages
ISBN:9781595937643
DOI:10.1145/1362622
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 November 2007

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

SC '07
Sponsor:

Acceptance Rates

SC '07 Paper Acceptance Rate 54 of 268 submissions, 20%;
Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2014)An Evaluation of One-Sided and Two-Sided Communication Paradigms on Relaxed-Ordering InterconnectProceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium10.1109/IPDPS.2014.116(1115-1125)Online publication date: 19-May-2014
  • (2011)RDMA Capable iWARP over DatagramsProceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium10.1109/IPDPS.2011.66(628-639)Online publication date: 16-May-2011
  • (2010)iWARP redefined: Scalable connectionless communication over high-speed Ethernet2010 International Conference on High Performance Computing10.1109/HIPC.2010.5713192(1-10)Online publication date: Dec-2010
  • (2009)Compiler Techniques for Efficient Communications in Circuit Switched Networks for Multiprocessor SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2008.8220:3(331-345)Online publication date: 1-Mar-2009
  • (2008)Sockets direct protocol for hybrid network stacksProceedings of the 15th international conference on High performance computing10.5555/1791889.1791939(478-490)Online publication date: 17-Dec-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media