ACM Home Page
Please provide us with feedback. Feedback
Hypergraph partitioning for automatic memory hierarchy management
Full text HtmlHtml (2 KB),  PdfPdf (351 KB)
Source Conference on High Performance Networking and Computing archive
Proceedings of the 2006 ACM/IEEE conference on Supercomputing table of contents
Tampa, Florida
SESSION: Technical papers table of contents
Article No. 98  
Year of Publication: 2006
ISBN:0-7695-2700-0
Authors
Sriram Krishnamoorthy  The Ohio State University
Umit Catalyurek  The Ohio State University
Jarek Nieplocha  The Ohio State University
Atanas Rountev  The Ohio State University
P. Sadayappan  The Ohio State University
Sponsors
IEEE : Institute of Electrical and Electronics Engineers
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 47,   Citation Count: 1
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1188455.1188558
What is a DOI?

ABSTRACT

In this paper, we present a mechanism for automatic management of the memory hierarchy, including secondary storage, in the context of a global address space parallel programming framework. The programmer specifies the parallelism and locality in the computation. The scheduling of the computation into stages, together with the movement of the associated data between secondary storage and global memory, and between global memory and local memory, is automatically managed. A novel formulation of hypergraph partitioning is used to model the optimization problem of minimizing disk I/O. Experimental evaluation of the proposed approach using a sub-computation from the quantum chemistry domain shows a reduction in the disk I/O cost by up to a factor of 11, and a reduction in turnaround time by up to 49%, as compared to alternative approaches used in state-of-the-art quantum chemistry codes.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
 
3
 
4
 
5
Chang, C., Kurc, T., Sussman, A., Çatalyürek, U. V., And Saltz, J. 2001. A hypergraph-based workload partitioning strategy for parallel data aggregation. In Proceedings of the Eleventh SIAM Conference on Parallel Processing for Scientific Computing, SIAM.
 
6
Crawford, T., And III, H. S. 2000. An Introduction to Coupled Cluster Theory for Computational Chemists. In Reviews in Computational Chemistry, K. Lipkowitz and D. Boyd, Ed., vol. 14. John Wiley & Sons, Ltd., 33--136.
7
 
8
Hendrickson, B., And Leland, R. 1994. The Chaco user's guide: Version 2.0. Tech. Rep. SAND94-2692, Sandia National Laboratories.
 
9
High Performance Computational Chemistry Group. 2004. NWChem, A Computational Chemistry Package for Parallel Computers, Version 4.6. Pacific Northwest National Laboratory.
10
11
 
12
13
 
14
Krishnamoorthy, S., Catalyurek, U., Nieplocha, J., Rountev, A., And Sadayappan, P. 2006. An extensible global address space frame-work with decoupled task and data abstractions. In Proc. IPDPS Workshop on Next Generation Software.
 
15
Krishnan, S., Krishnamoorthy, S., Baumgartner, G., Cociorva, D., Lam, C., Sadayappan, P., Ramanujam, J., Bernholdt, D., And Choppella, V. 2003. Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. In Proc. 10th Annual International Conference on High Performance Computing (HiPC), Springer Verlag, 406--417.
 
16
Krishnan, S., Krishnamoorthy, S., Baumgartner, G., Lam, C.-C., Ramanujam, J., Sadayappan, P., And Choppella, V. 2004. Efficient synthesis of out-of-core algorithms for tensor contractions using a nonlinear optimization solver. In The 18th International Parallel and Distributed Processing Symposium.
 
17
18
19
 
20
 
21
 
22
 
23
Sinha, A., And Kalé, L. 1993. A load balancing strategy for prioritized execution of tasks. In Seventh International Parallel Processing Symposium, 230--237.
 
24
Tuminaro, R. S., Heroux, M., Hutchinson, S. A., And Shadid, J. N. 1999. Official Aztec user's guide: Version 2.1. Tech. rep., Sandia National Laboratories.


Collaborative Colleagues:
Sriram Krishnamoorthy: colleagues
Umit Catalyurek: colleagues
Jarek Nieplocha: colleagues
Atanas Rountev: colleagues
P. Sadayappan: colleagues