Article

Sequencer virtualization

Authors:

Jamison D. Collins,

Gautham N. Chinya,

Hong WangAuthors Info & Claims

ICS '07: Proceedings of the 21st annual international conference on Supercomputing

Pages 148 - 157

https://doi.org/10.1145/1274971.1274993

Published: 17 June 2007 Publication History

Abstract

The Multiple Instruction Stream Processor (MISP) architecture introduces the sequencer as a new class of architectural resource, and provides a minimalist user-level MIMD instruction set extension for application programs to directly control execution of concurrent instruction streams on these sequencers. As with classic architectural resources, namely, registers and memory, the sequencer architectural resource can be subject to virtualization. This paper details the idea of Sequencer Virtualization (SV), a foundational architectural support to decouple architectural virtual sequencers from physical sequencers. SV enables more efficient utilization of sequencer resources at the microarchitectural level while maintaining a consistent programming interface at the architectural level. To evaluate the key tradeoffs for SV, we conduct extensive experiments by implementing a prototype SV system using a custom firmware on a large-scale multiprocessor system. Using the prototype SV system, we demonstrate that SV improves efficiency in sequencer utilization while incurring little performance overhead. In particular, for a set of real multithreaded workloads, SV can significantly improve sequencer utilization, achieving an average of 32% better wall-clock performance than MISP without SV support in a multi-programming environment.

References

[1]

V. Aslot, M. Domeika, R. Eigenmann, G. Gaertner, W. Jones, and B. Parady. SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance. In Proceedings of the International Workshop on OpenMP Applications and Tools, July 2001.

Digital Library

[2]

S. Balakrishnan, R. Rajwar, M. Upton, and K. Lai. The Impact of Performance Asymmetry in Emerging Multicore Architectures. In Proceedings of the 32nd Annual international Symposium on Computer Architecture, June 2005.

Digital Library

[3]

R. Blumofe and C. Leiserson. Scheduling Multithreaded Computations by Work Stealing. Journal of ACM, (5):720--749, September 1999.

Digital Library

[4]

R. Bryant, J. Hawkes, J. Steiner, J. Barnes, and J. Higdon. Scaling Linux to the Extreme: From 64 to 512 Processors. In Proceedings of the Ottawa Linux Symposium, July 2004.

[5]

K. Chakraborty, P. Wells, and G. Sohi. Computation Spreading: Employing Hardware Migration to Specialize CMP Cores on-the Fly. In Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, October 2006.

Digital Library

[6]

P. Dubey. Recognition, Mining and Synthesis Moves Computers to the Era of Tera. Technology@Intel Magazine, February 2005.

[7]

A. Eichenberger, K. O'Brien, K. O'Brien, P. Wu, T. Chen, P. Oden, D. Prener, J. Shepherd, B. So, Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind. Optimizing Compiler for the CELL Processor. In Proceedings of the 14th international Conference on Parallel Architectures and Compilation Techniques, 2005.

Digital Library

[8]

R. Hankins, G. Chinya, J. Collins, P. Wang, R. Rakvic, H. Wang, and J. Shen. Multiple Instruction Stream Processor. In Proceedings of the 33rd International Symposium on Computer Architecture, June 2006.

Digital Library

[9]

J. L. Hennessy and D. A. Patterson. Computer Architecture: A Quantitative Approach. Number ISBN 1-55860-724-2. Morgan Kauffmann Publishers, Inc.

Digital Library

[10]

Tera-scale Research Prototype: Connecting 80 Simple Sores on a Single Test Chip. ftp://download.intel.com/research/platform/terascale/terascaleresearchprototypebackgrounder.pdf.

[11]

Itanium Processor Microarchitecture Reference for Software Optimization. Intel Corporation, March 2000.

[12]

Intel Itanium 2 Processor Reference Manual for Software Development and Optimization. Intel Corporation, June 2002.

[13]

Intels Next Generation Integrated Graphics Architecture Intel Graphics Media Accelerator X3000 and 3000. Intel Corporation, 2006.

[14]

M. Litzkow, M. Livny, and M. Mutka. Condor: A Hunter of Idle Workstations. In Proceedings of the Eighth International Conference on Distributed Computing Systems, June 1988.

[15]

D. Marr, F. Binns, D. Hill, G. Hinton, D. Koufaty, J. Miller, and M. Upton. Hyper-Threading Technology Architecture and Microarchitecture. Intel Technology Journal, February 2002.

[16]

D. Oehmke, N. Binkert, T. Mudge, and S. Reinhardt. How to Fake 1000 Registers. In Proceedings of the 38th International Symposium on Microarchitecture, November 2005.

Digital Library

[17]

J. Owens, D. Luebke, N. Govindaraju, M. Harris, J. Krüger, A. Lefohn, and T. Purcell. A Survey of General-Purpose Computation on Graphics Hardware. In Eurographics, August 2005.

[18]

X. Tian, M. Girkar, S. Shah, D. Armstrong, E. Su, and P. Petersen. Compiler and Runtime Support for Running OpenMP Programs on Pentium and Itanium Architectures. In Proceedings of the 17th International Symposium on Parallel and Distributed Processing, April 2003.

Digital Library

[19]

R. Tomasulo. An Efficient Algorithm for Exploiting Multiple Arithmetic Units. IBM Journal Research and Development, (11):25--33, January 1967.

[20]

R. Uhlig, G. Neiger, D. Rodgers, A. Santoni, F. Martins, A. Anderson, S. Bennett, A. Kagi, F. Leung, and L. Smith. Intel Virtualization Technology. Computer Magazine, May 2005.

Digital Library

[21]

P. Wang, J. Collins, G. Chinya, H. Jiang, X. Tian, M. Girkar, N. Yang, G. Lueh, and H. Wang. EXOCHI: Architecture and Programming Environment for A Heterogeneous Multi-core Multithreaded System. In ACM SIGPLAN Conference on Programming Language Design and Implementation, June 2007.

Digital Library

[22]

P. Wang, J. Collins, H. Wang, D. Kim, B. Greene, K. Chan, A. Yunus, T. Sych, S. Moore, and J. Shen. Helper Threads via Virtual Multithreading on an Experimental Itanium 2 Processor-based Platform. In Proceedings of the 11th international Conference on Architectural Support For Programming Languages and Operating Systems, October 2004.

Digital Library

[23]

S. Zhou, X. Zheng, J. Wang, and P. Delisle. Utopia: a Load Sharing Facility for Large, Heterogeneous Distributed Computer Systems. Software - Practice and Experience, 23(12):1305--1336, December 1993.

Digital Library

Index Terms

Sequencer virtualization
1. Computer systems organization
  1. Architectures
    1. Parallel architectures
      1. Multiple instruction, multiple data

Recommendations

Hardware-Accelerated Cross-Architecture Full-System Virtualization

Hardware virtualization solutions provide users with benefits ranging from application isolation through server consolidation to improved disaster recovery and faster server provisioning. While hardware assistance for virtualization is supported by all ...
ARM virtualization: performance and architectural implications
ISCA '16: Proceedings of the 43rd International Symposium on Computer Architecture

ARM servers are becoming increasingly common, making server technologies such as virtualization for ARM of growing importance. We present the first study of ARM virtualization performance on server hardware, including multicore measurements of two ...
ARM virtualization: performance and architectural implications
ISCA'16

ARM servers are becoming increasingly common, making server technologies such as virtualization for ARM of growing importance. We present the first study of ARM virtualization performance on server hardware, including multicore measurements of two ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICS '07: Proceedings of the 21st annual international conference on Supercomputing

June 2007

315 pages

ISBN:9781595937681

DOI:10.1145/1274971

General Chair:
Burton Smith
Microsoft

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGARCH: ACM Special Interest Group on Computer Architecture

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 June 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

ICS07

Sponsor:

SIGARCH

ICS07: International Conference on Supercomputing

June 17 - 21, 2007

Washington, Seattle

Acceptance Rates

Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
556
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)0

Reflects downloads up to 15 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten