skip to main content
10.1145/1122971.1122983acmconferencesArticle/Chapter ViewAbstractPublication PagesppoppConference Proceedingsconference-collections
Article

Performance characterization of molecular dynamics techniques for biomolecular simulations

Published: 29 March 2006 Publication History

Abstract

Large-scale simulations and computational modeling using molecular dynamics (MD) continues to make significant impacts in the field of biology. It is well known that simulations of biological events at native time and length scales requires computing power several orders of magnitude beyond today's commonly available systems. Supercomputers, such as IBM Blue Gene/L and Cray XT3, will soon make tens to hundreds of teraFLOP/s of computing power available by utilizing thousands of processors. The popular algorithms and MD applications, however, were not initially designed to run on thousands of processors. In this paper, we present detailed investigations of the performance issues, which are crucial for improving the scalability of the MD-related algorithms and applications on massively parallel processing (MPP) architectures. Due to the varying characteristics of biological input problems, we study two prototypical biological complexes that use the MD algorithm: an explicit solvent and an implicit solvent. In particular, we study the AMBER application, which supports a variety of these types of input problems. For the explicit solvent problem, we focused on the particle mesh Ewald (PME) method for calculating the electrostatic energy, and for the implicit solvent model, we targeted the Generalized Born (GB) calculation. We uncovered and subsequently modified a limitation in AMBER that restricted the scaling beyond 128 processors. We collected performance data for experiments on up to 2048 Blue Gene/L and XT3 processors and subsequently identified that the scaling is largely limited by the underlying algorithmic characteristics and also by the implementation of the algorithms. Furthermore, we found that the input problem size of biological system is constrained by memory available per node. In conclusion, our results indicate that MD codes can significantly benefit from the current generation architectures with relatively modest optimization efforts. Nevertheless, the key for enabling scientific breakthroughs lies in exploiting the full potential of these new architectures.

References

[1]
mpiP: Lightweight, Scalable MPI Profiling. http://www.llnl.gov/CASC/mpip/.
[2]
P. K. Agarwal. Enzymes: An integrated view of structure, dynamics and function. Microbial Cell Factories, 5:2, 2006.
[3]
P. K. Agarwal. Role of Protein Dynamics in Reaction Rate Enhancement by Enzymes. Journal of American Chemical Society, 2005.
[4]
P. K. Agarwal, A. Geist, and A. Gorin. Protein Dynamics and Enzymatic Catalysis: Investigating the Peptidyl-Prolyl cis/trans Isomerization Activity of Cyclophilin A. Biochemistry, 43, 2004.
[5]
G. S. Almasi, C. Cascaval, J. G. Castanos, M. Denneau, W. E. Donath, M. Eleftheriou, M. Giampapa amd H. Ho, D. Lieber, J. E. Moreira, D. M. Newns, M. Snir, and H. S. Warren Jr. Demonstrating the Scalability of a Molecular Dynamics Application on a Petaflops Computer. International Journal of Parallel Programming, 30(4), 2002.
[6]
B.R. Brooks, R.E. Bruccoleri, B.D. Olafson, D.J. States, S. Swaminathan, and M. Karplus. CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. Journal of Computational Chemistry, 4, 1983.
[7]
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. The International Journal of High Performance Computing Applications, 2000.
[8]
T. Darden, D. York, and L. Pederson. Particle mesh Ewald--an Nlog(N) method for Ewald sums in large systems. Journal of Chemical Physics, 98, 1993.
[9]
B. G. Fitch, R. S. Germain, M. P. Mendell, J. Pitera, M. Pitman, A. Rayshubskiy, Y. Y. Sham, F. Suits, W. C. Swope, T. J. C. Ward, Y. Zhestkov, and R. Zhou. Blue Matter, an application framework for molecular simulation on Blue Gene. Journal of Parallel and Distributed Computing, 63, 2003.
[10]
N. Goodman. Biological data becomes computer literate: new advances in bioinformatics. Curr. Opin. Biotechnol., 13(1), 2002.
[11]
GROMACS. http://www.gromacs.org/.
[12]
K. Hiroaki. Computational systems biology. Nature, 420, 2002.
[13]
L. Kale, R. Skeel, M. Bhandarkar, R. Brunner, A. Gursoy, N. Krawetz, J. Phillips, A. Shinozaki, K. Varadarajan, and title = "NAMD2 : Greater scalability for parallel molecular dynamics K. Schulten". NAMD2: Greater scalability for parallel molecular dynamics. Journal of Computational Physics, 151, 1999.
[14]
M. Karplus and G.A. Petsko. Molecular dynamics simulations in biology. Nature, 347, 1990.
[15]
A. R. Leach. Molecular Modelling: Principles and Applications. Prentice Hall, 2001.
[16]
D. A. Pearlman, D.A. Case, J.W. Caldwell, W.S. Ross, III T.E. Cheatham, S. DeBolt, D. Ferguson, G. Seibel, and P.Kollman. AMBER, a package of computer programs for applying molecular mechanics, normalmode analysis, molecular dynamics and free energy calculationsto simulate the structural and energetic properties of molecules. Computer Physics Communication, 91, 1995.
[17]
J. C. Phillips, G. Zheng, and L. Kale. NAMD: Biomolecular simulation on thousands of processors. In Supercomputing, 2002.
[18]
S. J. Plimpton. Fast parallel algorithms for short-range molecular dynamics. Journal of Computational Physics, 117, 1995.
[19]
IBM Blue Gene team. Blue Gene: A vision for protein science using a petaflop supercomputer. IBM Systems Jornal, 40, 2001.
[20]
IBM Blue Gene/L Team. An Overview of the Blue Gene/L supercomputer. In Supercomputing 2002.
[21]
V. Tsui and D.A. Case. Theory and applications of the generalized born solvation model in macromolecular simulations. Biopolymers (Nucl. Acid. Sci.), 56, 2001.
[22]
J. Vetter, S. Alam, T. Dunigan, M. Fahey, P. Roth, and P. Worley. Early evaluation of the Cray XT3. In 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2006.
[23]
G. Zheng, T. Wilmarth, P. Jagadishprasad, and L. V. Kale. Simulation-based Performance Prediction for Large Parallel Machines. International Journal of Parallel Programming, 33(2-3), 2005.

Cited By

View all
  • (2016)OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable Computing2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2016.28(544-554)Online publication date: May-2016
  • (2012)Modelling the Mechanical Behavior of Polymer-Based NanocompositesMaterials Science Forum10.4028/www.scientific.net/MSF.730-732.543730-732(543-548)Online publication date: Nov-2012
  • (2010)Simulating Biomolecules on the Petascale SupercomputersPetascale Computing10.1201/9781584889106.ch11(211-235)Online publication date: 31-Jan-2010
  • Show More Cited By

Index Terms

  1. Performance characterization of molecular dynamics techniques for biomolecular simulations

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    PPoPP '06: Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
    March 2006
    258 pages
    ISBN:1595931899
    DOI:10.1145/1122971
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 March 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. computational biology
    2. molecular dynamics algorithms
    3. performance analysis
    4. workload characterization

    Qualifiers

    • Article

    Conference

    PPoPP06
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 230 of 1,014 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 07 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2016)OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable Computing2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS.2016.28(544-554)Online publication date: May-2016
    • (2012)Modelling the Mechanical Behavior of Polymer-Based NanocompositesMaterials Science Forum10.4028/www.scientific.net/MSF.730-732.543730-732(543-548)Online publication date: Nov-2012
    • (2010)Simulating Biomolecules on the Petascale SupercomputersPetascale Computing10.1201/9781584889106.ch11(211-235)Online publication date: 31-Jan-2010
    • (2010)Performance Analysis Framework for High-Level Language Applications in Reconfigurable ComputingACM Transactions on Reconfigurable Technology and Systems10.1145/1661438.16614433:1(1-23)Online publication date: 1-Jan-2010
    • (2008)Experimental evaluation of molecular dynamics simulations on multi-core systemsProceedings of the 15th international conference on High performance computing10.5555/1791889.1791907(131-141)Online publication date: 17-Dec-2008
    • (2008)An Evaluation of the Oak Ridge National Laboratory Cray XT3International Journal of High Performance Computing Applications10.1177/109434200708501922:1(52-80)Online publication date: 1-Feb-2008
    • (2008)A regression-based approach to scalability predictionProceedings of the 22nd annual international conference on Supercomputing10.1145/1375527.1375580(368-377)Online publication date: 7-Jun-2008
    • (2008)Performance Analysis with High-Level Languages for High-Performance Reconfigurable ComputingProceedings of the 2008 16th International Symposium on Field-Programmable Custom Computing Machines10.1109/FCCM.2008.18(23-30)Online publication date: 14-Apr-2008
    • (2008)Examining the Feasibility of Reconfigurable Models for Molecular Dynamics SimulationProceedings of the 8th international conference on Algorithms and Architectures for Parallel Processing10.1007/978-3-540-69501-1_13(109-120)Online publication date: 9-Jun-2008
    • (2007)Using FPGA Devices to Accelerate Biomolecular SimulationsComputer10.1109/MC.2007.10840:3(66-73)Online publication date: 1-Mar-2007
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media