ACM Home Page
Please provide us with feedback. Feedback
DART: a substrate for high speed asynchronous data IO
Full text PdfPdf (74 KB)
Source
High Performance Distributed Computing archive
Proceedings of the 17th international symposium on High performance distributed computing table of contents
Boston, MA, USA
POSTER SESSION: Poster session table of contents
Pages 219-220  
Year of Publication: 2008
ISBN:978-1-59593-997-5
Authors
Ciprian Docan  Rutgers University, Piscataway, NJ, USA
Manish Parashar  Rutgers University, Piscataway, NJ, USA
Scott Klasky  Oak Ridge National Laboratory, Oak Ridge, TN, USA
Sponsors
ACM: Association for Computing Machinery
SIGARCH: ACM Special Interest Group on Computer Architecture
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 22,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1383422.1383454
What is a DOI?

ABSTRACT

Large scale simulations of complex physics phenomena have long run times and generate massive amounts of data. Saving this data to external storage systems or transferring it to remote locations for analysis is a costly operation that quickly becomes a performance bottleneck. In this paper, we present DART (Decoupled and Asynchronous Remote Transfers), an efficient data transfer substrate that effectively minimizes the data I/O overhead on the running simulations. DART is a thin software layer built on RDMA technology to enable fast, low-overhead and asynchronous access to data from a running simulation, and support high-throughput, low-latency data transfers.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
R. Brightwell, T. Hudson, K. Pedretti, R. Riesen, and K. Underwood. Implementation and Performance of Portals 3.3 on the Cray XT3. IEEE International Conference on Cluster Computing, September 2005.
 
2
K. S. Hasan Abbasi, Matthew Wolf. Live data workspace: A flexible, dynamic and extensible platform for petascale applications. In Cluster Computing, Austin, TX, September 2007. IEEE International.
 
3
C. F. S. Inc. Lustre: A Scalable, High Performance File System. http://lustre.org/docs/whitepaper.pdf.
 
4
I. Inc. Gpfs: A Shared-Disk File System for Large Computing Clusters. www.almaden.ibm.com/StorageSystems/projects/gpfs/Fast02.pdf.
 
5
N. Stone, D. Balog, B. Gill, B. Johanson, J. Marsteller, P. Nowoczynski, D. Porter, R. Reddy, J. R. Scott, D. Simmel, J. Sommerfield, K. Vargo, and C. Vizino. Pdio: High-performance remote file i/o for portals-enabled compute nodes. In H. R. Arabnia, editor, PDPTA, pages 925--930. CSREA Press, 2006.
 
6
R. Thakur, W. Gropp, and E. Lusk. Data sieving and collective I/O in ROMIO. In Proceedings of the Seventh Symposium on the Frontiers of Massively Parallel Computation, pages 182--189. IEEE Computer Society Press, 1999.

Collaborative Colleagues:
Ciprian Docan: colleagues
Manish Parashar: colleagues
Scott Klasky: colleagues