skip to main content
10.1145/582034.582080acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article

High-performance remote access to climate simulation data: a challenge problem for data grid technologies

Published:10 November 2001Publication History

ABSTRACT

In numerous scientific disciplines, terabyte and soon petabyte-scale data collections are emerging as critical community resources. A new class of Data Grid infrastructure is required to support management, transport, distributed access to, and analysis of these datasets by potentially thousands of users. Researchers who face this challenge include the Climate Modeling community, which performs long-duration computations accompanied by frequent output of very large files that must be further analyzed. We describe the Earth System Grid prototype, which brings together advanced analysis, replica management, data transfer, request management, and other technologies to support high-performance, interactive analysis of replicated data. We present performance results that demonstrate our ability to manage the location and movement of large datasets from the user's desktop. We report on experiments conducted over SciNET at SC'2000, where we achieved peak performance of 1.55Gb/s and sustained performance of 512.9Mb/s for data transfers between Texas and California.

References

  1. "Climate Data Analysis Tool," http://www.pcmdi.llnl.gov/software/cdat/index.html.Google ScholarGoogle Scholar
  2. W. Allcock, J. Bester, J. Bresnahan, A. L. Chervenak, I. Foster, C. Kesselman, S. Meder, V. Nefedova, D. Quesnel, and S. Tuecke, "Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing," presented at Mass Storage Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. C. Baru, R. Moore, A. Rajasekar, and M. Wan, "The SDSC Storage Resource Broker," presented at Proc. CASCON'98 Conference, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, and S. Tuecke, "The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Data Sets," J. Network and Computer Applications, pp. 187-200, 2001.Google ScholarGoogle Scholar
  5. K. Czajkowski, S. Fitzgerald, I. Foster, and C. Kesselman, "Grid Information Services for Distributed Resource Sharing," presented at IEEE International Symposium on High Performance Distributed Computing, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. I. Foster and C. Kesselman, "Globus: A Metacomputing Infrastructure Toolkit," International Journal of Supercomputer Applications, vol. 11, pp. 115-128, 1997.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. I. Foster, C. Kesselman, G. Tsudik, and S. Tuecke, "A Security Architecture for Computational Grids," in ACM Conference on Computers and Security, 1998, pp. 83-91. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Foster and C. Kesselman, "The Grid: Blueprint for a New Computing Infrastructure,".: Morgan Kaufmann, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Foster and C. Kesselman, "Globus: A Toolkit-Based Grid Architecture," in The Grid: Blueprint for a New Computing Infrastructure, I. Foster and C. Kesselman, Eds.: Morgan Kaufmann, 1999, pp. 259-278. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. I. Foster and C. Kesselman, "A Data Grid Reference Architecture," GriPhyN 2001-6, 2001.Google ScholarGoogle Scholar
  11. I. Foster, C. Kesselman, and S. Tuecke, "The Anatomy of the Grid: Enabling Scalable Virtual Organizations," Intl. J. Supercomputer Applications, vol. (to appear), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. P. A. Fox, J. Garcia, and P. Kellogg, "The HAO Data Service: Experience in Interdisciplinary Data Delivery," presented at Proc. of the CODATA 2000 Workshop, US National Academy, 2000.Google ScholarGoogle Scholar
  13. D. Gunter, B. Tierney, B. Crowley, M. Holding, and J. Lee, "NetLogger: a toolkit for distributed system performance analysis.," presented at 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. NTONC, "NTON Connection in support of SC2000," http://www.ntonc.org/docs/NTON_ConnectionsForSC2000v1.1.ppt, 2000.Google ScholarGoogle Scholar
  15. L. Qiu, Y. Zhang, and S. Keshav, "On Individual and Aggregate TCP Performance," presented at 7th Intl. Conference on Network Protocols (ICNP'99), Toronto, Canada, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. B. Tierney, "TCP Tuning Guide for Distributed Applications on Wide Area Networks," presented at Usenix; login, 2001.Google ScholarGoogle Scholar
  17. S. Vazhkudai, S. Tuecke, and I. Foster, "Replica Selection in the Globus Data Grid," presented at International Workshop on Data Models and Databases on Clusters and the Grid (DataGrid 2001), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. R. Wolski, "Forecasting Network Performance to Support Dynamic Scheduling Using the Network Weather Service," in Proc. 6th IEEE Symp. on High Performance Distributed Computing. Portland, Oregon, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. High-performance remote access to climate simulation data: a challenge problem for data grid technologies

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SC '01: Proceedings of the 2001 ACM/IEEE conference on Supercomputing
            November 2001
            756 pages
            ISBN:158113293X
            DOI:10.1145/582034

            Copyright © 2001 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 10 November 2001

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            SC '01 Paper Acceptance Rate60of240submissions,25%Overall Acceptance Rate1,516of6,373submissions,24%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader