ACM Home Page
Please provide us with feedback. Feedback
Using early phase termination to eliminate load imbalances at barrier synchronization points
Full text PdfPdf (258 KB)
Source
Conference on Object Oriented Programming Systems Languages and Applications archive
Proceedings of the 22nd annual ACM SIGPLAN conference on Object oriented programming systems and applications table of contents
Montreal, Quebec, Canada
SESSION: Isolation and repair table of contents
Pages: 369 - 386  
Year of Publication: 2007
ISBN:978-1-59593-786-5
Also published in ...
Author
Martin C. Rinard  Massachusetts Institute of Technology, Cambridge, MA
Sponsors
SIGPLAN: ACM Special Interest Group on Programming Languages
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 2,   Downloads (12 Months): 71,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1297027.1297055
What is a DOI?

ABSTRACT

We present a new technique, early phase termination, for eliminating idle processors in parallel computations that use barrier synchronization. This technique simply terminates each parallel phaseas soon as there are too few remaining tasks to keep all of the processors busy.

Although this technique completely eliminates the idling that would other wise occur at barrier synchronization points, it may also change the computation and therefore the result that the computation produces. We address this issue by providing probabilistic distortion models that characterize how the use of early phase termination distorts the result that the computation produces. Our experimental results show that for our set of benchmark applications, 1) early phase termination can improve the performance of the parallel computation, 2) the distortion is small (or can be made to be small with the use of an appropriate compensation technique) and 3) the distortion models provide accurate and tight distortion bounds. These bounds can enable users to evaluate the effect of early phase termination and confidently accept results from parallel computations that use this technique if they find the distortion bounds to be acceptable.

Finally, we identify a general computational pattern that works well with early phase termination and explain why computations that exhibit this pattern can tolerate the early termination of parallel tasks without producing unacceptable results.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
C. Ananian and M. Rinard. Efficient object-based software transactions. In Proceedings of the Workshop on Synchronization and Concurrency in Object-Oriented Languages, San Diego, CA, Oct. 2005.
 
2
 
3
J. Barnes and P. Hut. A hierarchical O(NlogN) force calculation algorithm. Nature, 324(4):446--449, Dec. 1986.
 
4
 
5
R. Browning, T. Li, B. Chui, J. Ye, R. Pease, Z. Czyzewski, and D. Joy. Empirical forms for the electron/atom elastic scattering cross sections from 0.1-30keV. J. Appl. Phys., 76(4):2016--2022, Aug. 1994.
 
6
R. Browning, T. Li, B. Chui, J. Ye, R. Pease, Z. Czyzewski, and D. Joy. Low-energy electron/atom elastic scattering cross sections for 0.1-30keV. Scanning, 17(4):250--253, July/August 1995.
 
7
 
8
 
9
A. Frommer and D. Szyld. On asynchronous iterations.
 
10
11
 
12
J. Harris, S. Lazaratos, and R. Michelena. Tomographic string inversion. In Proceedings of the 60th Annual International Meeting, Society of Exploration and Geophysics, Extended Abstracts, pages 82--85, 1990.
13
 
14
T. Kay and J. Kajiya. Ray tracing complex scenes. Computer Graphics (Proceedings of SIGGRAPH'86), 20(4):269--78, Aug. 1986.
 
15
C. Moler. Numerical Computing with Matlab. Society for Industrial and Applied Mathematics, 2004.
 
16
J. Nieh and M. Levoy. Volume rendering on scalable shared-memory MIMD architectures. Technical Report CSL-TR-92-537, Computer Systems Laboratory, Stanford Univ., Stanford, Calif., Aug. 1992.
 
17
18
19
20
 
21