skip to main content
10.1145/1323548.1323553acmconferencesArticle/Chapter ViewAbstractPublication PagesancsConference Proceedingsconference-collections
research-article

Frame shared memory: line-rate networking on commodity hardware

Published: 03 December 2007 Publication History

Abstract

Network processors provide an economical programmable platform to handle the high throughput and frame rates of modern and next-generation communication systems. However, these platforms have exchanged general-purpose capabilities for performance.
This paper presents an alternative; a software network processor (Soft-NP) framework using commodity general-purpose platforms capable of high-rate and throughput sequential frame processing compatible with high-level languages and general-purpose operating systems. A cache-optimized concurrent lock free queue provides the necessary low-overhead core-to-core communication for sustained sequential frame processing beyond the realized 1.41 million frames per second (Gigabit Ethernet) while permitting perframe processing time expansion with pipeline parallelism.

References

[1]
Advanced Micro Devices. AMD completes ATI acquisition and creates processing powerhouse. http://www.amd.com/us-en/Corporate/VirtualPressRoom/0,51_104_543~113741,00.html, October 2006.
[2]
K. Asanovic, R. Bodik, B. C. Catanzaro, J. J. Gebis, P. Husbands, K. Keutzer, D. A. Patterson, W. L. Plishker, J. Shalf, S. W. Williams, and K. A. Yelick. The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley, Dec 2006.
[3]
B. N. Bershad, T. E. Anderson, E. D. Lazowska, and H. M. Levy. Lightweight remote procedure call. In Proceedings of the twelfth ACM Symposium on Operating Systems Principles, 1989.
[4]
S. Blott, J. Brustoloni, and C. Martin. NetTap: An efficient and reliable PC-based platform for network programming, 1999.
[5]
J. C. Brustoloni. Interoperation of copy avoidance in network and file I/O. In Proceedings of the INFOCOM '99. Sixteenth Annual Joint Conference of the IEEE Computer and Communications Societies, 1999.
[6]
Celoxia and DRC. FPGA acceleration solution released for AMD Opteron™ processor-based systems. http://www.drccomputer.com/pdfs/PR_DRC_Celoxica_042406.pdf, April 2006.
[7]
CNET News.com. Intel pledges 80 cores in five years. http://news.com.com/2100-1006_3-6119618.html, September 2006.
[8]
L. Deri. nCap: Wire-speed packet capture and transmission. In Proceedings of 3rd IEEE/IFIP Workshop on End-to-End Monitoring Techniques and Services, 2005.
[9]
P. Druschel and L. L. Peterson. Fbufs: a high-bandwidth cross-domain transfer facility. In Proceedings of the fourteenth ACM symposium on Operating systems principles, 1993.
[10]
D. Dunning, G. Regnier, G. McAlpine, B. Shubert, F. Berry, A. M. Merritt, E. Gronke, and C. Dodd. The virtual interface architecture. IEEE Micro, 1998.
[11]
D. R. Engler, M. F. Kaashoek, and J. J. O'Toole. Exokernel: an operating system architecture for application-level resource management. In SOSP '95: Proceedings of the fifteenth ACM Symposium on Operating Systems Principles, pages 251--266, New York, NY, USA, 1995. ACM Press.
[12]
K. Gharachorloo, D. Lenoski, J. Laudon, P. B. Gibbons, A. Gupta, and J. L. Hennessy. Memory consistency and event ordering in scalable shared-memory multiprocessors. In 25 Years ISCA: Retrospectives and Reprints, pages 376--387, 1998.
[13]
J. Giacomoni, T. Moseley, and M. Vachharajani. Fastforward for efficient pipeline parallelism: A cache-optimized concurrent lock-free queue. In PPoPP '08: Proceedings of the The 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, NY, USA, February 2008. ACM Press.
[14]
M. Handley, E. Kohler, A. Ghosh, O. Hodson, and P. Radoslavov. Designing extensible IP router software. In Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation, 2005.
[15]
N. C. Hutchinson and L. L. Peterson. The x-Kernel: An architecture for implementing network protocols. IEEE Transactions on Software Engineering.
[16]
InfiniBand Trade Association. http://www.infinibandta.org.
[17]
Y. A. Khalidi and M. N. Thadani. An efficient zero-copy I/O framework for UNIX. Technical report, 1995.
[18]
E. Kohler, R. Morris, B. Chen, J. Jannotti, and M. F. Kaashoek. The click modular router. ACM Transactions on Computer Systems, 2000.
[19]
S. Kumar, J. Maschmeyer, and P. Crowley. Exploiting locality to ameliorate packet queue contention and serialization. In Proceedings of the 3rd conference on Computing Frontiers, 2006.
[20]
E. Ladan-Mozes and N. Shavit. An optimistic approach to lock-free FIFO queues. In DISC '04: Proceedings of the Distributed Computing, 18th International Conference, 2004.
[21]
L. Lamport. Specifying concurrent program modules. ACM Trans. Program. Lang. Syst., 1983.
[22]
libpcap. http://www.tcpdump.org.
[23]
libpcap-mmap. http://public.lanl.gov/cpw/.
[24]
H. Massalin and C. Pu. Threads and input/output in the Synthesis kernel. In Proceedings of the twelfth ACM Symposium on Operating Systems Principles, 1989.
[25]
S. McCanne and V. Jacobson. The BSD packet filter: A new architecture for user-level packet capture. In Proceedings of the USENIX Winter 1993 Conference Proceedings, 1993.
[26]
M. M. Michael and M. L. Scott. Nonblocking algorithms and preemption-safe locking on multiprogrammed shared -- memory multiprocessors. Journal of Parallel and Distributed Computing, 1998.
[27]
J. C. Mogul and K. K. Ramakrishnan. Eliminating receive livelock in an interrupt-driven kernel. ACM Transactions on Computer Systems, 1997.
[28]
J. Mudigonda, H. M. Vin, and R. Yavatkar. Overcoming the memory wall in packet processing: hammers or ladders? In Proceedings of the 2005 symposium on Architecture for Networking and Communications Systems, 2005.
[29]
R. Olsson. pktgen the linux packet generator. In Proc. linuxsymposium 2005, 2005.
[30]
V. S. Pai, P. Druschel, and W. Zwaenepoel. IO-lite: A unified I/O buffering and caching system. In Proceedings of the third Symposium on Operating Systems Design and Implementation, 1999.
[31]
R. Rashid, A. Tevanian, M. Young, D. Golub, R. Baron, D. Black, W. Bolosky, and J. Chew. Machine-independent virtual memory management for paged uniprocessor and multiprocessor architectures. In Proceedings of the second international conference on Architectual Support for Programming Languages and Operating Systems, 1987.
[32]
G. Regnier, S. Makineni, R. Illikkal, R. Iyer, D. Minturn, R. Huggahalli, D. Newell, L. Cline, and A. Foong. TCP onloading for data center servers. IEEE Computer, 2004.
[33]
G. Regnier, D. Minturn, G. McAlpine, V. A. Saletore, and A. Foong. ETA: Experience with an Intel Xeon processor as a packet processing engine. IEEE Micro, 2004.
[34]
M. B. Taylor, J. Kim, J. Miller, D. Wentzlaff, F. Ghodrat, B. Greenwald, H. Hoffman, P. Johnson, J.-W. Lee, W. Lee, A. Ma, A. Saraf, M. Seneski, N. Shnidman, V. Strumpen, M. Frank, S. Amarasinghe, and A. Agarwal. The Raw microprocessor: A computational fabric for software circuits and general-purpose programs. IEEE Micro, 2002.
[35]
J. Verdú, J. Garciá, M. Nemirovsky, and M. Valero. Architectural impact of stateful networking applications. In Proceedings of the 2005 symposium on Architecture for Networking and Communications Systems, 2005.
[36]
M. Welsh, D. Culler, and E. Brewer. SEDA: an architecture for well-conditioned, scalable internet services. In Proceedings of the eighteenth ACM Symposium on Operating Systems Principles, 2001.
[37]
W. A. Wulf and S. A. McKee. Hitting the memory wall: Implications of the obvious. SIGARCH Computer Architecture News, 1995.
[38]
B. Wun and P. Crowley. Network I/O acceleration in heterogeneous multicore processors. In Proceedings of the 14th IEEE Symposium on High-Performance Interconnects, 2006.

Cited By

View all
  • (2024)High Performance Cloud Ready Parallel Batch Processing using Auto Scaling Containers and Queue Technique2024 4th International Conference on Sustainable Expert Systems (ICSES)10.1109/ICSES63445.2024.10763354(630-639)Online publication date: 15-Oct-2024
  • (2020)EQueue: Elastic Lock-Free FIFO Queue for Core-to-Core Communication on Multi-Core ProcessorsIEEE Access10.1109/ACCESS.2020.29970718(98729-98741)Online publication date: 2020
  • (2019)Accelerating Wait-Free Algorithms: Pragmatic Solutions on Cache-Coherent Multicore ArchitecturesIEEE Access10.1109/ACCESS.2019.29207817(74653-74669)Online publication date: 2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ANCS '07: Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems
December 2007
212 pages
ISBN:9781595939456
DOI:10.1145/1323548
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 03 December 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. multi-core
  2. multiprocessors
  3. parallel programming
  4. software network processor

Qualifiers

  • Research-article

Conference

ANCS07

Acceptance Rates

ANCS '07 Paper Acceptance Rate 20 of 70 submissions, 29%;
Overall Acceptance Rate 88 of 314 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)High Performance Cloud Ready Parallel Batch Processing using Auto Scaling Containers and Queue Technique2024 4th International Conference on Sustainable Expert Systems (ICSES)10.1109/ICSES63445.2024.10763354(630-639)Online publication date: 15-Oct-2024
  • (2020)EQueue: Elastic Lock-Free FIFO Queue for Core-to-Core Communication on Multi-Core ProcessorsIEEE Access10.1109/ACCESS.2020.29970718(98729-98741)Online publication date: 2020
  • (2019)Accelerating Wait-Free Algorithms: Pragmatic Solutions on Cache-Coherent Multicore ArchitecturesIEEE Access10.1109/ACCESS.2019.29207817(74653-74669)Online publication date: 2019
  • (2017)A Flexible Communication Mechanism for Pipeline Parallelism2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC)10.1109/ISPA/IUCC.2017.00119(778-785)Online publication date: Dec-2017
  • (2012)B-Queue: Efficient and Practical Queuing for Fast Core-to-Core CommunicationInternational Journal of Parallel Programming10.1007/s10766-012-0213-x41:1(137-159)Online publication date: 9-Aug-2012
  • (2011)Performance of a software switch2011 IEEE 12th International Conference on High Performance Switching and Routing10.1109/HPSR.2011.5986035(256-263)Online publication date: Jul-2011
  • (2010)SIP server performance on multicore systemsIBM Journal of Research and Development10.1147/JRD.2009.203697654:1(79-90)Online publication date: 1-Jan-2010
  • (2009)Practice of parallelizing network applications on multi-core architecturesProceedings of the 23rd international conference on Supercomputing10.1145/1542275.1542307(204-213)Online publication date: 8-Jun-2009
  • (2008)Design of a scalable network programming frameworkProceedings of the 4th ACM/IEEE Symposium on Architectures for Networking and Communications Systems10.1145/1477942.1477945(10-18)Online publication date: 6-Nov-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media