|
ABSTRACT
We demonstrate that the Qbox code supports unprecedented large-scale First-Principles Molecular Dynamics (FPMD) applications on the BlueGene/L supercomputer. Qbox is an FPMD implementation specifically designed for large-scale parallel platforms such as BlueGene/L. Strong scaling tests for a Materials Science application show an 86% scaling efficiency between 1024 and 32,768 CPUs. Measurements of performance by means of hardware counters show that 36% of the peak FPU performance can be attained.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
[1] R. Car and M. Parrinello, Phys. Rev. Lett. 55, 2471 (1985). For a review, see e.g. M. Parrinello, "From Silicon to RNA: the Coming of Age of First-Principles Molecular Dynamics" Sol. St. Comm. 103, 107 (1997).
|
| |
2
|
[2] W. Kohn and L. J. Sham, Phys. Rev. A140, 1133 (1965).
|
| |
3
|
[3] T. Ogitsu, E. Schwegler, F. Gygi, and G. Galli, Phys. Rev. Lett. 91, 175502 (2003).
|
| |
4
|
[4] F. Gygi, "Qbox: a large-scale parallel implementation of First-Principles Molecular Dynamics" (LLNL preprint, 2005).
|
| |
5
|
[5] N. R. Adiga et al., "An overview of the BlueGene/L supercomputer" SC2002 - High Performance Networking and Computing, 2002.
|
| |
6
|
Leonardo Bachega , Siddhartha Chatterjee , Kenneth A. Dockser , John A. Gunnels , Manish Gupta , Fred G. Gustavson , Christopher A. Lapkowski , Gary K. Liu , Mark P. Mendell , Charles D. Wait , T. J. Chris Ward, A High-Performance SIMD Floating Point Unit for BlueGene/L: Architecture, Compilation, and Algorithm Design, Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques, p.85-96, September 29-October 03, 2004
[doi> 10.1109/PACT.2004.2]
|
| |
7
|
Jack J. Dongarra , L. S. Blackford , J. Choi , A. Cleary , E. D'Azeuedo , J. Demmel , I. Dhillon , S. Hammarling , G. Henry , A. Petitet , K. Stanley , D. Walker , R. C. Whaley, ScaLAPACK user's guide, Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997
|
| |
8
|
[8] These caches are not coherent in "co-processor mode" operation, but the advantages and disadvantages of this property are beyond the scope of this paper.
|
| |
9
|
[9] M. Frigo and S. G. Johnson, "FFTW: an adaptive software architecture for the FFT", Proceedings of ICASSP 1998, Vol.3, pages 1381-1384.
|
| |
10
|
[10] J. Lorenz, S. Kral, F. Franchetti, C. W. Ueberhuber, "Vectorization techniques for the BlueGene/L double FPU", IBM Journal of Research and Development, Vol. 49, No. 2/3, 2005, pages 437-446.
|
| |
11
|
[11] S. Kral: FFTW-GEL Homepage: http://www.complang.tuwien.ac.at/skral/fftwgel.html
|
| |
12
|
[12] Franchetti, S. Kral, J. Lorenz, C. W. Ueberhuber: Efficient Utilization of SIMD Extensions, Proceedings of the IEEE Special Issue on "Program Generation, Optimization, and Adaptation," Vol. 93, No. 2, 2005, pages 409-425.
|
CITED BY 6
|
Francois Gygi , Erik W. Draeger , Martin Schulz , Bronis R. de Supinski , John A. Gunnels , Vernon Austel , James C. Sexton , Franz Franchetti , Stefan Kral , Christoph W. Ueberhuber , Juergen Lorenz, Gordon Bell finalists I---Large-scale electronic structure calculations of high-Z metals on the BlueGene/L platform, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
|
|
José Moreira , Michael Brutman , José Castaños , Thomas Engelsiepen , Mark Giampapa , Tom Gooding , Roger Haskin , Todd Inglett , Derek Lieber , Pat McCarthy , Mike Mundy , Jeff Parker , Brian Wallenfelt, Blue Gene system software---Designing a highly-scalable operating system: the Blue Gene/L story, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
|
|
|
José E. Moreira , Valentina Salapura , George Almasi , Charles Archer , Ralph Bellofatto , Peter Bergner , Randy Bickford , Mathias Blumrich , José R. Brunheroto , Arthur A. Bright , Michael Brutman , José G. Castaños , Dong Chen , Paul Coteus , Paul Crumley , Sam Ellis , Thomas Engelsiepen , Alan Gara , Mark Giampapa , Tom Gooding , Shawn Hall , Ruud A. Haring , Roger Haskin , Philip Heidelberger , Dirk Hoenicke , Todd Inglett , Gerrard V. Kopcsay , Derek Lieber , David Limpert , Pat McCarthy , Mark Megerian , Mike Mundy , Martin Ohmacht , Jeff Parker , Rick A. Rand , Don Reed , Ramendra Sahoo , Alda Sanomiya , Richard Shok , Brian Smith , Gordon G. Stewart , Todd Takken , Pavlos Vranas , Brian Wallenfelt , Michael Blocksome , Joe Ratterman, The blue gene/L supercomputer: a hardware and software story, International Journal of Parallel Programming, v.35 n.3, p.181-206, June 2007
|
|
Adolfy Hoisie , Greg Johnson , Darren J. Kerbyson , Michael Lang , Scott Pakin, Architecture---A performance comparison through benchmarking and modeling of three leading supercomputers: blue Gene/L, Red Storm, and Purple, Proceedings of the 2006 ACM/IEEE conference on Supercomputing, November 11-17, 2006, Tampa, Florida
|
|
|
Bronis R. De Supinski , Martin Schulz , Vasily V. Bulatov , William Cabot , Bor Chan , Andrew W. Cook , Erik W. Draeger , James N. Glosli , Jeffrey A. Greenough , Keith Henderson , Alison Kubota , Steve Louis , Brian J. Miller , Mehul V. Patel , Thomas E. Spelce , Frederick H. Streitz , Peter L. Williams , Robert K. Yates , Andy Yoo , George Almasi , Gyan Bhanot , Alan Gara , John A. Gunnels , Manish Gupta , Jose Moreira , James Sexton , Bob Walkup , Charles Archer , Francois Gygi , Timothy C. Germann , Kai Kadau , Peter S. Lomdahl , Charles Rendleman , Michael L. Welcome , William Mclendon , Bruce Hendrickson , Franz Franchetti , Stefan Kral , Jürgen Lorenz , Christoph W. Überhuber , Edmond Chow , Ümit Çatalyürek, BlueGene/L applications: Parallelism On a Massive Scale, International Journal of High Performance Computing Applications, v.22 n.1, p.33-51, February 2008
|
|
|
Aiichiro Nakano , Rajiv K. Kalia , Ken-Ichi Nomura , Ashish Sharma , Priya Vashishta , Fuyuki Shimojo , Adri C. T. Van Duin , William A. Goddard , Rupak Biswas , Deepak Srivastava , Lin H. Yang, De Novo Ultrascale Atomistic Simulations On High-End Parallel Supercomputers, International Journal of High Performance Computing Applications, v.22 n.1, p.113-128, February 2008
|
|