skip to main content
10.1145/335231.335255acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article
Free Access

Performance analysis of distributed applications using automatic classification of communication inefficiencies

Authors Info & Claims
Published:08 May 2000Publication History

ABSTRACT

We present a technique for performance analysis that helps users understand the communication behavior of their message passing applications. Our method automatically classifies individual communication operations and it reveals the cause of communication inefficiencies in the application. This classification allows the developer to focus quickly on the culprits of truly inefficient behavior, rather than manually foraging through massive amounts of performance data. Specifically, we trace the message operations of MPI applications and then classify each individual communication event using decision tree classification, a supervised learning technique. We train our decision tree using microbenchmarks that demonstrate both efficient and inefficient communication. Since our technique adapts to the target system's configuration through these microbenchmarks, we can simultaneously automate the performance analysis process and improve classification accuracy. Our experiments on four applications demonstrate that our technique can improve the accuracy of performance analysis, and dramatically reduce the amount of data that users must encounter

References

  1. 1.T.E. Anderson and E.D. Lazowska, "Quartz: A Tool for Tuning Parallel Program Performance," Prec. 1990 SIGMETRICS Conf. Measurement and Modeling Computer Systems, 1990, PiT. 115-25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.D. Bailey, E. Barszez et al., "The NAS Parallel Benchmarks (94)," NASA Ames Research Center, RNR Technical Report RNR-94-007, 1994.Google ScholarGoogle Scholar
  3. 3.M. Calzarossa, L. Massari et al., "Medea: A Tool for Workload Characterization of Parallel Systems," IEEE Parallel & Distributed Technology, 3(4):72-80, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.U.M. Fayyad, G. Piatetsky-Shapiro et al., Eds., Advances in knowledge discovery and data mining. Menlo Park, CA: AAAI Press: MIT Press, 1996, pp. xiv, 611. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.I. Foster, Designing and building parallel programs: concepts and tools for parallel software engineering. Reading, MA: Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.I. Foster and C. Kesselman, Eds., The Grid: blueprint for a new computing infrastructure. San Francisco: Morgan Kaufmann Publishers, 1999, pp. xxiv, 677. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.J.A. Gannon, K.J. Williams et ai., "Using perturbation tracking to compensate for intrusion in message-passing systems," Prec. 14th Int'l Conf. Distributed Computing Systems, 1994, pp. 414-21.Google ScholarGoogle Scholar
  8. 8.G.A. Geist, M.T. Heath et al., "A Users' Guide to PICL - A Portable Instrumented Communication Library," Oak Ridge National Laboratory, P.O.Box 2009, Bldg. 9207-A, Oak Ridge, TN 37831-8083 1991.Google ScholarGoogle Scholar
  9. 9.W. Gropp, E. Lusk, and A. Skjellum, Using MPI: portable parallel programming with the message.passing interface, 2nd ed. Cambridge, MA: MIT Press, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.M.T. Heath, A.D. Malony, and D.T. Rover, "Parallel performance visualization: from practice to theory," IEEE Parallel & Distributed Technology: Systems & Applications, 3(4):44-60, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.M. Ji, E.W. Felten, and K. Li, "Performance Measurements for Multithreaded Programs," Prec. 1998 ACM Int'l Conf. Measurement and Modeling of Computer Systems, SIGMETRICS 98, 1998, pp. 161-70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.W. Lee, S. J.Stolfo, and K. W.Mok, "Mining in a dataflow environment: experience in network intrusion detection," Prec. Fifth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, 1999, pp. 114-24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.W. Meira, Jr; T.J. LeB!anc, and A. Poulos, "Waiting Time Analysis and Performance Visualization in Carnival," Prec. ACM SIGMETRICS Syrup. on Parallel and Distributed Tools, 1996, pp. 1-10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.B.P. Miller, M.D. Callaghan et al., "The Paradyn parallel performance measurement tool," IEEE Computer, 28(11):37-46, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.A.A. Mirin, R.H. Cohen et al., "Very High Resolution Simulation of Compressible Turbulence on the IBM-SP System," Prec. SC99, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.J.R. Quinlan, C4.5: programs for machine learning. San Mateo, CA: Morgan Kaufmann Publishers, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.R. Rajamony and A.L. Cox, "Performance debugging shared memory parallel programs using run-time dependence analysis," Performance Evaluation Review (Prec. 1997 ACM lnt'l Conf. Measurement and Modeling of Computer Systems, SIGMETRICS 97), 25(1):75-87, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.D.A. Reed, R.A. Aydt et al., "An Overview of the Pablo Performance Analysis Environment," Department of Computer Science, University of Illinois, 1304 West Springfield Avenue, Urbana, IL 61801 1992.Google ScholarGoogle Scholar
  19. 19.D.A. Reed, O.Y. Nickolayev, and P.C. Roth, "Real-Time Statistical Clustering and for Event Trace Reduction," Z Supercomputing Applications and High-Performance Computing, 11(2): 144-59, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. 20.S.R. Sarukkai, J. Yah, and J.K. Gotwals, "Normalized performance indices for message passing parallel programs," Prec. 8th ACM Int'l Conf. Supercomputing, 1994, pp. 323-32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.S. Shende, A.D. Malony et al., "Portable profiling and tracing for parallel, scientific applications using C++," Prec. SIGMETRICS Symp. Parallel and Distributed Tools (SPDT), 1998, pp. 134-45. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22.M. Snir, S. Otto et al., Eds., MP1-the complete reference, 2nd ed. Cambridge, MA: MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. 23.J. Stasko, J. Domingue et al., Eds., Software Visualization: Programming as a Multimedia Experience,. Cambridge, MA: MIT Press, 1998.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Performance analysis of distributed applications using automatic classification of communication inefficiencies

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Conferences
                  ICS '00: Proceedings of the 14th international conference on Supercomputing
                  May 2000
                  347 pages
                  ISBN:1581132700
                  DOI:10.1145/335231

                  Copyright © 2000 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 8 May 2000

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • Article

                  Acceptance Rates

                  ICS '00 Paper Acceptance Rate33of122submissions,27%Overall Acceptance Rate584of2,055submissions,28%

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader