- 1.Advanced Computing Laboratory (LANL), "Scientific Template Library - Part of the DOE 2000 ACTS Toolkit," 1998. URL:htto:llwww.acl.lanl.eovlSciTLI.Google Scholar
- 2.Advanced Computing Laboratory(LANL), "TAU Portable Profiling," 1998. URL:http://www.acl.lanl.gov/tau.Google Scholar
- 3.V.S. Adve, j.M. Crummey, M. Anderson, K. Kennedy, J. C. Wang, and D. A. Reed, "Integrating Compilation and Performance Analysis for Data-Parallel Programs," Proc. of the Workshop on Debugging and Performance Tuning for Parallel Computing Systems, Jan. 1996.Google Scholar
- 4.F. Bassetti, K. Davis, and D. Quinlan, "A Comparison of Performance-Enhancing Strategies for Parallel Numerical Object-Oriented Frameworks," Proc. ISCOPE 97, LNCS Vol. 1343, Springer, Dec. 1997. Google ScholarDigital Library
- 5.J. Dongarra and L.S. B lackford, "ScaLAPACK tutorial," Proc. of Applied Parallel Computing, Industrial Computation and Optimization, Third International Workshop, PARA '96, Aug. 1996. Google ScholarDigital Library
- 6.I. Foster and C. Kesselman, "Globus" A Metacomputing infrastructure Toolkit," Proc. of the Workshop on Environments and Tools for Parallel Scientific Computing, Aug. 1996.Google Scholar
- 7.S.L. Graham, P.B. Kessler, and M.K. McKusik, "An Execution Profiling for Modular Programs," Software -- Practice and Experience, Vol. 13, pp. 671-85, 1983.Google ScholarCross Ref
- 8.J.K. Hollingsworth, B.P. Miller, M. J. R. Gonqalves, O. Naim, Z. Xu and L. Zheng, "MDL" A Language and Compiler for Dynamic Program Instrumentation," Proc. 1997 International Conference on Parallel Architectures and Compilation Techniques, pp. 201-12, Nov. 1997. Google ScholarDigital Library
- 9.Y. Ishikawa, R.R. Oldehoeft, J.V.W. Reynders, and M. Tholburn (Eds.), Scientific Computing in Object-Oriented Parallel Environments, Proc. ISCOPE97, LNCS Vol. 1343, Springer, Dec. 1997. Google ScholarDigital Library
- 10.E. Johnson, D. Gannon, and P. Beckman, "HPC++: Experiments with the Parallel Standard Template Library," Proc. International Conference on Supercomputing, July 1997. Google ScholarDigital Library
- 11.D. Lange and Y. Nakamura, "Object-Oriented Program Tracing and Visualization," IEEE Computer, Vol. 30, No. 5, pp. 63-70, May 1997 Google ScholarDigital Library
- 12.S.R. Lee, J.C. Cummings, S.D. Nolen, N.D. Keen, "MC++ and a Transport Physics Framework." Proc. ISCOPE 97, LNCS Vol. 1343, Springer, Dec. 1997. Google ScholarDigital Library
- 13.B. Miller, M. Callaghan, J. Cargille, J. Hollingsworth, R. Irvin, K. Karavanic, K. Kunchithapadam, and T. Newhall, "The Paradyne Parallel Performance Measurement Tools," IEEE Computer, Vol. 28, No. 11, pp.37-46, November 1995. Google ScholarDigital Library
- 14.B. Mohr, A. Malony, and J. Cuny, "TAU," in G.V. Wilson and P. Lu (Eds.), Parallel Programming using C++, MIT Press, 1996.Google Scholar
- 15.Pallas GmbH, "VAMPIR - Visualization and Analysis of MPI Resources," 1998. URL:http://www.pallas.de/ Da#es/vamoir.html.Google Scholar
- 16.D. Pase, "MPP Apprentice: A Non-Event Trace Performance Tool for the Cray T3D," Workshop on Debugging and Performance Tuning for Parallel Computing Systems, Oct. 1994.Google Scholar
- 17.J.V.W. Reynders et. al., "Pooma: A Framework for Scientific Simulation on Parallel Architectures," in G.V. Wilson and P. Lu (Eds.), Parallel Programming using C++, pp. 553-594, MIT Press, 1996.Google Scholar
- 18.B. Stroustrup, The C++ Programming Language, Third Edition, Addison-Wesley, Massachusetts, June 1997. Google ScholarDigital Library
- 19.Silicon Graphics, Inc., "Speed Shop User's Guide," 1997. URL:httD://techoubs.sei.com.Google Scholar
- 20.T. Veldhuizen, "Expression Templates," C++ Report, 7(5):26-31, June 1995.Google Scholar
- 21.T. Veldhuizen and M.E. Jernigan, "Will C++ be Faster than Fortran", Proc. ISCOPE 97, LNCS Vol. 1343, Springer, Dec. 1997. Google ScholarDigital Library
- 22.J. Yan, "Performance Tuning with AIMS---An Automated Instrumentation and Monitoring System for Multicomputers," Proc. 27th Hawaii Int. Conf. on System Sciences, Hawaii, Jan. 1994.Google Scholar
- 23.M. Zagha, B. Larson, S. Turner, and M. Itzkowitz, "Performance Analysis Using the MIPS R10000 Performance Counters," Proc. Supercomputing '96, IEEE Computer Society, November 1996. Google ScholarDigital Library
Index Terms
- Portable profiling and tracing for parallel, scientific applications using C++
Recommendations
Shared-Memory Parallel Programming in C++
A set of portable parallel-programming constructs for C, implemented in M4 macros called Parmacs, developed by researchers at Argonne National Laboratory, is considered. The Parmacs macros make it possible to write parallel C programs for shared-memory, ...
A Lightweight C++ Interface to MPI
PDP '12: Proceedings of the 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based ProcessingThe Message Passing Interface (MPI) provides bindings for the three programming languages commonly used in High Performance Computing (HPC): C, C++ and Fortran. Unfortunately, MPI supports only the lowest common denominator of the three languages, ...
Comments