skip to main content
research-article

Using simulation to design extremescale applications and architectures: programming model exploration

Published: 29 March 2011 Publication History

Abstract

A key problem facing application developers is that they are expected to utilize extreme levels of parallelism soon after delivery of future leadership class machines, but developing applications capable of exposing sufficient concurrency is a time consuming process requiring experimentation. At the same time, due to the expense of building and operating an exascale machine, it will be necessary to apply tighter engineering margins to their design. Simple metrics such as the computation-communication ratio will not sufficiently specify machine requirements. Simulation fills this gap, allowing the study of extreme-scale architectures with the explicit inclusion of the complex interactions between the various hardware and software components, and can be used for correctness-checking as well as performance estimation. The simulator we discuss in this paper can be driven by reading trace files, typically generated by an actual application that has been run on real hardware, or by using a skeleton application. The skeleton application is designed to have the control flow of a real application, but with expensive computations and large data transfers replaced by discrete events for which the timings are determined by models. Using skeleton applications, we can predict application performance at levels of parallelism unobtainable on any current computational platform. The skeleton application can be modified to experiment with different communication strategies and programming models. Since the machine being simulated is in our control, we can experiment with different network topologies, routing algorithms, bandwidths, latencies, failure modes, core-to-node ratios, etc. In this paper, we use the Structural Simulation Toolkit macroscale components for coarse-grained simulation to illustrate the exploration of alternative programming models at extreme scale.

References

[1]
Int. Exascale Software Project. www.exascale.org.
[2]
Structural Simulation Toolkit macroscale. bitbucket.org/cljanss/sstmacro.
[3]
V. S. Adve et al. Compiler-optimized simulation of large-scale applications on high performance architectures. J. Par. Dist. Comp., 62(3), 2002.
[4]
R. C. Agarwal et al. A three-dimensional approach to parallel matrix multiplication. IBM J. of Research and Development, 39:575--582, 1995.
[5]
S. Ashby et al. Report on exascale computing, 2009. www.er.doe.gov/ascr/ASCAC/Reports/Exascale-Subcommittee-Report.pdf.
[6]
J. Decker et al. Exascale workshop panel meeting report, 2010. www.science.doe.gov/ascr/ProgramDocuments/Docs/TrivelpieceExascaleWorkshop.pdf.
[7]
J. Dongarra, P. Beckman, et al. The international exascale software roadmap. Int. J. High Performance Computer Applications, 25, 2011.
[8]
G. H. Golub and C. F. V. Loan. Matrix Computations. The Johns Hopkins University Press, USA, 1996.
[9]
K. Goto and R. A. van de Geijn. Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw., 34(3):1--25, 2008.
[10]
J. Gunnels, C. Lin, G. Morrow, and R. van de Geijn. A exible class of parallel matrix multiplication algorithms. IPPS/SPDP '98, 110--116, 1998.
[11]
A. Gupta and V. Kumar. Scalability of parallel algorithms for matrix multiplication. ICPP, 1993.
[12]
S. D. Hammond, et al. WARPP: a toolkit for simulating high-performance parallel scientific codes. Simutools '09, 19:1--19:10, 2009.
[13]
C. Hewitt, P. Bishop, and R. Steiger. A universal modular actor formalism for artificial intelligence. 3rd Int. Conf. Artificial intelligence, 235--245, 1973.
[14]
C. L. Janssen et al. A simulator for large-scale parallel architectures. Int. J. Par. Dist. Sys., 1(2):57--73, 2010.
[15]
P. Kogge et al. ExaScale computing study: Technology challenges in achieving exascale systems. Technical report, May 2008. www.darpa.mil/ipto/personnel/docs/ExaScale_Study_Initial.pdf.
[16]
Message Passing Interface Forum. MPI: A message-passing interface standard: Version 2.1, 2008. http://www.mpi-forum.org.
[17]
H. Simon, T. Zacharia, and R. Stevens. Modeing and simulation at the exascale for energy and the environment, 2007. http://www.science.doe.gov/ascr/ProgramDocuments/Docs/TownHall.pdf.
[18]
R. Susukita et al. Performance prediction of large-scale parallell system and application using macro-level simulation. SC08, 1--9, 2008.

Cited By

View all
  • (2022)On the analysis of makespan and performance of the task‐based execution model for enterprise application integration platforms: An empirical studySoftware: Practice and Experience10.1002/spe.308552:7(1717-1735)Online publication date: 28-Mar-2022
  • (2018)Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)10.1109/PAW-ATM.2018.00010(44-56)Online publication date: Nov-2018
  • (2018)Prometheus: Coherent Exploration of Hardware and Software Optimizations Using Aspen2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)10.1109/MASCOTS.2018.00032(244-250)Online publication date: Sep-2018
  • Show More Cited By

Index Terms

  1. Using simulation to design extremescale applications and architectures: programming model exploration

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM SIGMETRICS Performance Evaluation Review
      ACM SIGMETRICS Performance Evaluation Review  Volume 38, Issue 4
      Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
      March 2011
      93 pages
      ISSN:0163-5999
      DOI:10.1145/1964218
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 29 March 2011
      Published in SIGMETRICS Volume 38, Issue 4

      Check for updates

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)4
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 20 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)On the analysis of makespan and performance of the task‐based execution model for enterprise application integration platforms: An empirical studySoftware: Practice and Experience10.1002/spe.308552:7(1717-1735)Online publication date: 28-Mar-2022
      • (2018)Semi-Static and Dynamic Load Balancing for Asynchronous Hurricane Storm Surge Simulations2018 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI (PAW-ATM)10.1109/PAW-ATM.2018.00010(44-56)Online publication date: Nov-2018
      • (2018)Prometheus: Coherent Exploration of Hardware and Software Optimizations Using Aspen2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)10.1109/MASCOTS.2018.00032(244-250)Online publication date: Sep-2018
      • (2018)Performance and Accuracy Trade-offs of HPC Application Modeling and Simulation2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2018.00087(774-783)Online publication date: May-2018
      • (2017)PANORAMAInternational Journal of High Performance Computing Applications10.1177/109434201559451531:1(4-18)Online publication date: 1-Jan-2017
      • (2017)A Wait-Free Hash MapInternational Journal of Parallel Programming10.1007/s10766-015-0376-345:3(421-448)Online publication date: 1-Jun-2017
      • (2016)Topology-aware performance optimization and modeling of adaptive mesh refinement codes for exascaleProceedings of the First Workshop on Optimization of Communication in HPC10.5555/3018058.3018061(17-28)Online publication date: 13-Nov-2016
      • (2016)Performance modeling of in situ renderingProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.5555/3014904.3014936(1-12)Online publication date: 13-Nov-2016
      • (2016)Performance Modeling of In Situ RenderingSC16: International Conference for High Performance Computing, Networking, Storage and Analysis10.1109/SC.2016.23(276-287)Online publication date: Nov-2016
      • (2016)Modeling the Energy-Time Performance of MIC Architecture System2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS)10.1109/MASCOTS.2016.54(85-94)Online publication date: Sep-2016
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media