Article

Free Access

Performance analysis of distributed applications using automatic classification of communication inefficiencies

Author:
Jeffrey Vetter

Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, California

Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, California
View Profile

ICS '00: Proceedings of the 14th international conference on SupercomputingMay 2000Pages 245–254https://doi.org/10.1145/335231.335255

Published:08 May 2000Publication History

ICS '00: Proceedings of the 14th international conference on Supercomputing

Pages 245–254

ABSTRACT

We present a technique for performance analysis that helps users understand the communication behavior of their message passing applications. Our method automatically classifies individual communication operations and it reveals the cause of communication inefficiencies in the application. This classification allows the developer to focus quickly on the culprits of truly inefficient behavior, rather than manually foraging through massive amounts of performance data. Specifically, we trace the message operations of MPI applications and then classify each individual communication event using decision tree classification, a supervised learning technique. We train our decision tree using microbenchmarks that demonstrate both efficient and inefficient communication. Since our technique adapts to the target system's configuration through these microbenchmarks, we can simultaneously automate the performance analysis process and improve classification accuracy. Our experiments on four applications demonstrate that our technique can improve the accuracy of performance analysis, and dramatically reduce the amount of data that users must encounter

References

1.T.E. Anderson and E.D. Lazowska, "Quartz: A Tool for Tuning Parallel Program Performance," Prec. 1990 SIGMETRICS Conf. Measurement and Modeling Computer Systems, 1990, PiT. 115-25. Google ScholarDigital Library
2.D. Bailey, E. Barszez et al., "The NAS Parallel Benchmarks (94)," NASA Ames Research Center, RNR Technical Report RNR-94-007, 1994.Google Scholar
3.M. Calzarossa, L. Massari et al., "Medea: A Tool for Workload Characterization of Parallel Systems," IEEE Parallel & Distributed Technology, 3(4):72-80, 1995. Google ScholarDigital Library
4.U.M. Fayyad, G. Piatetsky-Shapiro et al., Eds., Advances in knowledge discovery and data mining. Menlo Park, CA: AAAI Press: MIT Press, 1996, pp. xiv, 611. Google ScholarDigital Library
5.I. Foster, Designing and building parallel programs: concepts and tools for parallel software engineering. Reading, MA: Addison-Wesley, 1995. Google ScholarDigital Library
6.I. Foster and C. Kesselman, Eds., The Grid: blueprint for a new computing infrastructure. San Francisco: Morgan Kaufmann Publishers, 1999, pp. xxiv, 677. Google ScholarDigital Library
7.J.A. Gannon, K.J. Williams et ai., "Using perturbation tracking to compensate for intrusion in message-passing systems," Prec. 14th Int'l Conf. Distributed Computing Systems, 1994, pp. 414-21.Google Scholar
8.G.A. Geist, M.T. Heath et al., "A Users' Guide to PICL - A Portable Instrumented Communication Library," Oak Ridge National Laboratory, P.O.Box 2009, Bldg. 9207-A, Oak Ridge, TN 37831-8083 1991.Google Scholar
9.W. Gropp, E. Lusk, and A. Skjellum, Using MPI: portable parallel programming with the message.passing interface, 2nd ed. Cambridge, MA: MIT Press, 1999. Google ScholarDigital Library
10.M.T. Heath, A.D. Malony, and D.T. Rover, "Parallel performance visualization: from practice to theory," IEEE Parallel & Distributed Technology: Systems & Applications, 3(4):44-60, 1995. Google ScholarDigital Library
11.M. Ji, E.W. Felten, and K. Li, "Performance Measurements for Multithreaded Programs," Prec. 1998 ACM Int'l Conf. Measurement and Modeling of Computer Systems, SIGMETRICS 98, 1998, pp. 161-70. Google ScholarDigital Library
12.W. Lee, S. J.Stolfo, and K. W.Mok, "Mining in a dataflow environment: experience in network intrusion detection," Prec. Fifth ACM SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, 1999, pp. 114-24. Google ScholarDigital Library
13.W. Meira, Jr; T.J. LeB!anc, and A. Poulos, "Waiting Time Analysis and Performance Visualization in Carnival," Prec. ACM SIGMETRICS Syrup. on Parallel and Distributed Tools, 1996, pp. 1-10. Google ScholarDigital Library
14.B.P. Miller, M.D. Callaghan et al., "The Paradyn parallel performance measurement tool," IEEE Computer, 28(11):37-46, 1995. Google ScholarDigital Library
15.A.A. Mirin, R.H. Cohen et al., "Very High Resolution Simulation of Compressible Turbulence on the IBM-SP System," Prec. SC99, 1999. Google ScholarDigital Library
16.J.R. Quinlan, C4.5: programs for machine learning. San Mateo, CA: Morgan Kaufmann Publishers, 1993. Google ScholarDigital Library
17.R. Rajamony and A.L. Cox, "Performance debugging shared memory parallel programs using run-time dependence analysis," Performance Evaluation Review (Prec. 1997 ACM lnt'l Conf. Measurement and Modeling of Computer Systems, SIGMETRICS 97), 25(1):75-87, 1997. Google ScholarDigital Library
18.D.A. Reed, R.A. Aydt et al., "An Overview of the Pablo Performance Analysis Environment," Department of Computer Science, University of Illinois, 1304 West Springfield Avenue, Urbana, IL 61801 1992.Google Scholar
19.D.A. Reed, O.Y. Nickolayev, and P.C. Roth, "Real-Time Statistical Clustering and for Event Trace Reduction," Z Supercomputing Applications and High-Performance Computing, 11(2): 144-59, 1997.Google ScholarDigital Library
20.S.R. Sarukkai, J. Yah, and J.K. Gotwals, "Normalized performance indices for message passing parallel programs," Prec. 8th ACM Int'l Conf. Supercomputing, 1994, pp. 323-32. Google ScholarDigital Library
21.S. Shende, A.D. Malony et al., "Portable profiling and tracing for parallel, scientific applications using C++," Prec. SIGMETRICS Symp. Parallel and Distributed Tools (SPDT), 1998, pp. 134-45. Google ScholarDigital Library
22.M. Snir, S. Otto et al., Eds., MP1-the complete reference, 2nd ed. Cambridge, MA: MIT Press, 1998. Google ScholarDigital Library
23.J. Stasko, J. Domingue et al., Eds., Software Visualization: Programming as a Multimedia Experience,. Cambridge, MA: MIT Press, 1998.Google ScholarDigital Library

Index Terms

Recommendations

Analysis of classification margin for classification accuracy with applications

Classification margin is commonly used for describing the classification capability of a committee of classifiers. In this paper, we study the relation between classification margin and misclassification error, focusing on exploring useful information ...
Read More
Performance analysis of soft computing techniques for the automatic classification of fruits dataset

Different properties of numerous types of fruits and vegetable classification are still an intricate task. The soft computing strategies are used to recognize a fruit by blending the three basic features which characterize the object: color, shape and ...
Read More
Automatic Classification of Strike Techniques Using Limb Trajectory Data
Machine Learning and Data Mining for Sports Analytics
Abstract
The classification of trajectory data is required in a wide variety of movement tracking experiments. Automatic classification using machine learning techniques has the potential to greatly increase efficiency and reliability of these studies. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICS '00: Proceedings of the 14th international conference on Supercomputing
May 2000
347 pages
ISBN:1581132700
DOI:10.1145/335231
Chairmen:
John Reynders
Los Alamos National Lab, Los Alamos, NM
,
Alex Veidenbaum
Univ. of California at Irvine, Irvine
Copyright © 2000 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 May 2000
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
ICS '00 Paper Acceptance Rate33of122submissions,27%Overall Acceptance Rate584of2,055submissions,28%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 44
  Total Citations
  View Citations
- 619
  Total Downloads
- Downloads (Last 12 months)35
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Performance analysis of distributed applications using automatic classification of communication inefficiencies

ICS '00: Proceedings of the 14th international conference on Supercomputing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Analysis of classification margin for classification accuracy with applications

Performance analysis of soft computing techniques for the automatic classification of fruits dataset

Automatic Classification of Strike Techniques Using Limb Trajectory Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Performance analysis of distributed applications using automatic classification of communication inefficiencies

ICS '00: Proceedings of the 14th international conference on Supercomputing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Analysis of classification margin for classification accuracy with applications

Performance analysis of soft computing techniques for the automatic classification of fruits dataset

Automatic Classification of Strike Techniques Using Limb Trajectory Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media