skip to main content
10.1145/1713072.1713078acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Case studies in storage access by loosely coupled petascale applications

Published: 14 November 2009 Publication History

Abstract

A large number of real-world scientific applications can be characterized as loosely coupled: the communication among tasks is infrequent and can be performed by using file operations. While these applications may be ported to large scale machines designed for tightly coupled, massively parallel jobs, direct implementations do not perform well because of the large number of small, latency-bound file accesses. This problem may be overcome through the use of a variety of custom, hand-coded strategies applied at various subsystems of modern near-petascale computers- but is a labor intensive process that will become increasingly difficult at the petascale and beyond. This work profiles the essential operations in the I/O workload for five loosely coupled scientific applications. We characterize the I/O workload induced by these applications and offer an analysis to motivate and aid the development of programming tools, I/O subsystems, and filesystems.

References

[1]
N. Ali, A. Devulapalli, D. Dalessandro, P. Wyckoff, and P. Sadayappan. An OSD-based approach to managing directory operations in parallel file systems. In Proc. CLUSTER, 2008.
[2]
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. Basic local alignment search tool. J. Molecular Biology, 215(3), 1990.
[3]
J. Bent, D. Thain, A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, and M. Livny. Explicit control in a batch-aware distributed file system. In Proc. USENIX Symposium on Networked Systems Design and Implementation, 2004.
[4]
P. Carns, S. Lang, R. Ross, M. Vilayannur, J. Kunkel, and T. Ludwig. Small-file access in parallel file systems. In Proc. International Parallel and Distributed Processing Symposium, 2009.
[5]
P. A. Cheeseman, M. W. Deem, D. J. Earl, and W. I. Whitson. Adapting an application for use in a Condor based parameter sweep on TeraGrid. In Proc. TeraGrid 2007 Conference, 2007.
[6]
Y. Chen, W. Chen, M. H. Cobb, and Y. Zhao. PTMap -- A sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites. Proceedings of the National Academy of Sciences of the USA, 106(3), 2009.
[7]
A. Ching, K. Coloma, J. Li, W. keng Liao, and A. Choudhary. High-performance techniques for parallel I/O. In Handbook of Parallel Computing: Models, Algorithms and Applications, chapter 35. 2008.
[8]
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In Proc. Operating Systems Design and Implementation, 2004.
[9]
J. DeBartolo, A. Colubri, A. K. Jha, J. Fitzgerald, and T. R. S. Karl F. Freed. Mimicking the folding pathway to improve homology-free protein structure prediction. Proc. National Academy of Sciences, 106(10), 2009.
[10]
J. J. Dongarra and D. W. Walker. The quest for petascale computing. Computing in Science and Engineering, 3(3), 2001.
[11]
M. P. I. Forum. MPI: A message-passing interface standard, 1994.
[12]
Y. Gil, P. A. González-Calero, and E. Deelman. On the black art of designing computational workflows. In Proc. Workshop on Workflows in Support of Large-Scale Science, 2007.
[13]
G. Grider, L. Ward, R. Ross, and G. Gibson. A business case for extensions to the POSIX I/O API for high end, clustered, and highly concurrent computing, 2006. Available at: http://www.opengroup.org/platform/hecewg/uploads/40/10891/POSIXIO-API-Business-case-HEC-ggrider.pdf.
[14]
U. Hasson, J. I. Skipper, M. J. Wilde, H. C. Nusbaum, and S. L. Small. Improving the analysis, storage and sharing of neuroimaging data using relational databases and distributed computing. Neuroimage, 39(2), 2008.
[15]
D. M. Lorber and B. K. Shoichet. Hierarchical docking of databases of multiple ligand conformations. Current Topics in Medicinal Chemistry, 5(8), 2005.
[16]
G. Mackey, S. Sehrish, J. Bent, J. Lopez, S. Habib, and J. Wang. Introducing Map-Reduce to high end computing. In Proc. Petascale Data Storage Workshop, 2008.
[17]
H. M. Monti, A. R. Butt, and S. S. Vazhkudai./Scratch as a cache: Rethinking HPC center scratch storage. In Proc. International Conference on Supercomputing, 2008.
[18]
C. Moretti, J. Bulosan, D. Thain, and P. J. Flynn. All-pairs: An abstraction for data-intensive cloud computing. In Proc. International Parallel and Distributed Processing Symposium, 2008.
[19]
E. B. Nightingale, P. M. Chen, and J. Flinn. Speculative execution in a distributed file system. ACM Transactions on Computer Systems, 24(4), 2006.
[20]
J. Ousterhout. Scripting: Higher-level programming for the 21st century. IEEE Computer, Mar. 1998.
[21]
I. Raicu, I. Foster, Y. Zhao, P. Little, C. Moretti, A. Chaudhary, and D. Thain. The quest for scalable support of data-intensive workloads in distributed systems. In Proc. High Performance Distributed Computing, 2009.
[22]
I. Raicu, Z. Zhang, M. Wilde, I. Foster, P. Beckman, K. Iskra, and B. Clifford. Towards loosely-coupled programming on petascale systems. In Proc. SC'08, 2008.
[23]
I. Raicu, Y. Zhao, C. Dumitrescu, I. Foster, and M. Wilde. Falkon: A Fast and Light-weight tasK executiON framework. In Proc SC'07, 2007.
[24]
F. Schmuck and R. Haskin. GPFS: A shared-disk file system for large computing clusters. In Proc. USENIX Conference on File and Storage Technologies, 2002.
[25]
S. Shepler, B. Callaghan, D. Robinson, R. Thurlow, C. Beame, M. Eisler, and D. Noveck. Network File System (NFS) version 4 protocol. RFC 3530, 2003.
[26]
I. Taylor, E. Deelman, D. Gannon, and M. Shields, editors. Workflows for e-Science. Springer, 2007.
[27]
D. Thain and C. Moretti. Efficient access to many small files in a filesystem for grid computing. In Proc. Conference on Grid Computing, 2007.
[28]
D. Thain, T. Tannenbaum, and M. Livny. Distributed computing in practice: The Condor experience. Concurrency and Computation: Practice and Experience, 17(2--4), 2005.
[29]
R. Thakur, W. Gropp, and E. Lusk. On implementing MPI-IO portably and with high performance. In Proc. of the Sixth Workshop on I/O in Parallel and Distributed Systems, May 1999.
[30]
Z. Zhang, A. Espinosa, K. Iskra, I. Raicu, I. Foster, and M. Wilde. Design and evaluation of a collective I/O model for loosely-coupled petascale programming. In Proc. MTAGS Workshop and SC'08, 2008.
[31]
Y. Zhao, M. Hategan, B. Clifford, I. Foster, G. von Laszewski, I. Raicu, T. Stef-Praun, and M. Wilde. Swift: Fast, reliable, loosely coupled parallel computation. In Proc. Workshop on Scientific Workflows, 2007.

Cited By

View all
  • (2017)A cross-layer optimized storage system for workflow applicationsFuture Generation Computer Systems10.1016/j.future.2017.02.03875(423-437)Online publication date: Oct-2017
  • (2016)Support for Provisioning and Configuration Decisions for Data Intensive WorkflowsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.249769327:9(2725-2739)Online publication date: 1-Sep-2016
  • (2016)A Software-Defined Storage for Workflow Applications2016 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2016.56(350-353)Online publication date: Sep-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
PDSW '09: Proceedings of the 4th Annual Workshop on Petascale Data Storage
November 2009
58 pages
ISBN:9781605588834
DOI:10.1145/1713072
  • Conference Chair:
  • Garth A. Gibson
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 November 2009

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Funding Sources

Conference

SC '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 17 of 41 submissions, 41%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2017)A cross-layer optimized storage system for workflow applicationsFuture Generation Computer Systems10.1016/j.future.2017.02.03875(423-437)Online publication date: Oct-2017
  • (2016)Support for Provisioning and Configuration Decisions for Data Intensive WorkflowsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2015.249769327:9(2725-2739)Online publication date: 1-Sep-2016
  • (2016)A Software-Defined Storage for Workflow Applications2016 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2016.56(350-353)Online publication date: Sep-2016
  • (2015)The Case for Workflow-Aware StorageJournal of Grid Computing10.1007/s10723-014-9307-613:1(95-113)Online publication date: 1-Mar-2015
  • (2014)Supporting storage configuration for I/O intensive workflowsProceedings of the 28th ACM international conference on Supercomputing10.1145/2597652.2597679(191-200)Online publication date: 10-Jun-2014
  • (2013)Predicting intermediate storage performance for workflow applicationsProceedings of the 8th Parallel Data Storage Workshop10.1145/2538542.2538560(33-38)Online publication date: 17-Nov-2013
  • (2013)Evaluating cloud computing techniques for smart power grid design using parallel scriptingProceedings of the 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2013.26(319-326)Online publication date: 13-May-2013
  • (2013)JETSJournal of Grid Computing10.1007/s10723-013-9259-211:3(341-360)Online publication date: 1-Sep-2013
  • (2012)Design and analysis of data management in scalable parallel scriptingProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/2388996.2389112(1-11)Online publication date: 10-Nov-2012
  • (2012)Job and data clustering for aggregate use of multiple production cyberinfrastructuresProceedings of the fifth international workshop on Data-Intensive Distributed Computing Date10.1145/2286996.2287000(3-12)Online publication date: 19-Jun-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media