skip to main content
10.1145/2493123.2465527acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

Scalable in situ scientific data encoding for analytical query processing

Authors Info & Claims
Published:17 June 2013Publication History

ABSTRACT

The process of scientific data analysis in high-performance computing environments has been evolving along with the advancement of computing capabilities. With the onset of exascale computing, the increasing gap between compute performance and I/O bandwidth has rendered the traditional method of post-simulation processing a tedious process. Despite the challenges due to increased data production, there exists an opportunity to benefit from "cheap" computing power to perform query-driven exploration and visualization during simulation time. To accelerate such analyses, applications traditionally augment raw data with large indexes, post-simulation, which are then repeatedly utilized for data exploration. However, the generation of current state-of-the-art indexes involve a compute- and memory-intensive processing, thus rendering them inapplicable in an in situ context. In this paper we propose DIRAQ, a parallel in situ, in network data encoding and reorganization technique that enables the transformation of simulation output into a query-efficient form, with negligible runtime overhead to the simulation run. DIRAQ begins with an effective core-local, precision-based encoding approach, which incorporates an embedded compressed index that is 3 -- 6x smaller than current state-of-the-art indexing schemes. DIRAQ then applies an in network index merging strategy, enabling the creation of aggregated indexes ideally suited for spatial-context querying that speed up query responses by up to 10x versus alternative techniques. We also employ a novel aggregation strategy that is topology-, data-, and memory-aware, resulting in efficient I/O and yielding overall end-to-end encoding and I/O time that is less than that required to write the raw data with MPI collective I/O.

References

  1. H. Abbasi, G. Eisenhauer, M. Wolf, K. Schwan, and S. Klasky. Just in time: adding value to the IO pipelines of high performance applications with JITStaging. In Proc. Symp. High Performance Distributed Computing (HPDC), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. H. Abbasi, J. Lofstead, F. Zheng, K. Schwan, M. Wolf, and S. Klasky. Extending I/O through high performance data services. In Proc. Conf. Cluster Computing (CLUSTER), Sep 2009.Google ScholarGoogle ScholarCross RefCross Ref
  3. H. Abbasi, M. Wolf, G. Eisenhauer, S. Klasky, K. Schwan, and F. Zheng. DataStager: scalable data staging services for petascale applications. In Proc. Symp. High Performance Distributed Computing (HPDC), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. C. Bennett, H. Abbasi, P.-T. Bremer, R. Grout, A. Gyulassy, T. Jin, S. Klasky, H. Kolla, M. Parashar, V. Pascucci, P. Pebay, D. Thompson, H. Yu, F. Zhang, and J. Chen. Combining in-situ and in-transit processing to enable extreme-scale scientific analysis. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Byna, J. Chou, O. Rübel, Prabhat, H. Karimabadi, W. S. Daughton, V. Roytershteyn, E. W. Bethel, M. Howison, K.-J. Hsu, K.-W. Lin, A. Shoshani, A. Uselton, and K. Wu. Parallel I/O, analysis, and visualization of a trillion particle simulation. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Chaarawi and E. Gabriel. Automatically selecting the number of aggregators for collective I/O operations. In Proc. Conf. Cluster Computing (CLUSTER), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. H. Chen, A. Choudhary, B. de Supinski, M. DeVries, E. R. Hawkes, S. Klasky, W.-K. Liao, K.-L. Ma, J. Mellor-Crummey, N. Podhorszki, R. Sankaran, S. Shende, and C. S. Yoo. Terascale direct numerical simulations of turbulent combustion using S3D. Journal of Computational Science & Discovery (CSD), 2(1), 2009.Google ScholarGoogle Scholar
  8. J. Chou, K. Wu, and Prabhat. FastQuery: a parallel indexing system for scientific data. In Proc. Conf. Cluster Computing (CLUSTER), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Chou, K. Wu, O. Rübel, M. Howison, J. Qiang, Prabhat, B. Austin, E. W. Bethel, R. D. Ryne, and A. Shoshani. Parallel index and query for large scale data analysis. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), Nov 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. M. del Rosario, R. Bordawekar, and A. Choudhary. Improved parallel I/O via a two-phase run-time access strategy. ACM SIGARCH Computer Architecture News, 21(5):31--38, Dec 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Fryxell, K. Olson, P. Ricker, F. X. Timmes, M. Zingale, D. Q. Lamb, P. MacNeice, R. Rosner, J. W. Truran, and H. Tufo. FLASH: an adaptive mesh hydrodynamics code for modeling astrophysical thermonuclear flashes. Astrophysical Journal Supplement Series, 131:273--334, Nov 2000.Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Fu, R. Latham, M. Min, and C. D. Carothers. I/O threads to reduce checkpoint blocking for an electromagnetics solver on Blue Gene/P and Cray XK6. In Proc. Workshop on Runtime and Operating Systems for Supercomputers (ROSS), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Fu, M. Min, R. Latham, and C. D. Carothers. Parallel I/O performance for application-level checkpointing on the Blue Gene/P system. In Proc. Conf. Cluster Computing (CLUSTER), 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. K. Hornik, M. Stinchcombe, and H. White. Multilayer feedforward networks are universal approximators. Proc. Conf. Neural Networks, Jul 1989.Google ScholarGoogle ScholarCross RefCross Ref
  15. C. Igel and M. Hüsken. Empirical evaluation of the improved Rprop learning algorithm. Journal of Neurocomputing, 50:2003, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  16. J. Jenkins, I. Arkatkar, S. Lakshminarasimhan, N. Shah, E. R. Schendel, S. Ethier, C.-S. Chang, J. H. Chen, H. Kolla, S. Klasky, R. B. Ross, and N. F. Samatova. Analytics-driven lossless data compression for rapid in-situ indexing, storing, and querying. In Proc. Conf. Database and Expert Systems Applications, Part II (DEXA), 2012.Google ScholarGoogle ScholarCross RefCross Ref
  17. J. Kim, H. Abbasi, L. Chacon, C. Docan, S. Klasky, Q. Liu, N. Podhorszki, A. Shoshani, and K. Wu. Parallel in situ indexing for data-intensive computing. In Proc. Symp. Large Data Analysis and Visualization (LDAV), Oct 2011.Google ScholarGoogle ScholarCross RefCross Ref
  18. S. Kumar, V. Vishwanath, P. Carns, J. A. Levine, R. Latham, G. Scorzelli, H. Kolla, R. Grout, R. Ross, M. E. Papka, J. Chen, and V. Pascucci. Efficient data restructuring and aggregation for I/O acceleration in PIDX. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. L. Ma. In situ visualization at extreme scale: challenges and opportunities. Journal of Computer Graphics and Application (CG&A), pages 14--19, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. S. Nissen. Implementation of a fast artificial neural network library (fann). Technical report, Department of Computer Science University of Copenhagen (DIKU), Oct 2003. http://fann.sf.net.Google ScholarGoogle Scholar
  21. O. Rübel, Prabhat, K. Wu, H. Childs, J. Meredith, C. G. R. Geddes, E. Cormier-Michel, S. Ahern, G. H. Weber, P. Messmer, H. Hagen, B. Hamann, and E. W. Bethel. High performance multivariate visual data exploration for extremely large data. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. F. Schmuck and R. Haskin. GPFS: a shared-disk file system for large computing clusters. In Proc. Conf. File and Storage Technologies (FAST), Jan 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. R. Thakur and A. Choudhary. An extended two-phase method for accessing sections of out-of-core arrays. Journal of Scientific Programming, 5(4):301--317, Dec 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. T. Tu, H. Yu, J. Bielak, O. Ghattas, J. C. Lopez, K.-L. Ma, D. R. O'Hallaron, L. Ramirez-Guzman, N. Stone, R. Taborda-Rios, and J. Urbanic. Remote runtime steering of integrated terascale simulation and visualization. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. V. Vishwanath, M. Hereld, V. Morozov, and M. E. Papka. Topology-aware data movement and staging for I/O acceleration on Blue Gene/P supercomputing systems. In Proc. Conf. High Performance Computing, Networking, Storage and Analysis (SC), pages 1--11, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. K. Wu. FastBit: an efficient indexing technology for accelerating data-intensive science. In Journal of Physics: Conference Series (JPCS), volume 16, page 556, 2005.Google ScholarGoogle Scholar
  27. K. Wu, E. Otoo, and A. Shoshani. On the performance of bitmap indices for high cardinality attributes. In Proc. Conf Very Large Data Bases (VLDB), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. K. Wu, R. R. Sinha, C. Jones, S. Ethier, S. Klasky, K.-L. Ma, A. Shoshani, and M. Winslett. Finding regions of interest on toroidal meshes. Journal Computational Science & Discovery (CSD), 4(1), 2011.Google ScholarGoogle Scholar
  29. H. Yan, S. Ding, and T. Suel. Inverted index compression and query processing with optimized document ordering. In Proc. Conf. World Wide Web (WWW), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. R. M. Yoo, H. Lee, K. Chow, and H.-H. S. Lee. Constructing a non-linear model with neural networks for workload characterization. In Proc. Symp. Workload Characterization (IISWC), Oct 2006.Google ScholarGoogle ScholarCross RefCross Ref
  31. H. Yu, C. Wang, R. W. Grout, J. H. Chen, and K.-L. Ma. In situ visualization for large-scale combustion simulations. Journal of Computer Graphics and Applications (CG&A), 30(3):45 --57, May-Jun 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. J. Zhang, X. Long, and S. Torsten. Performance of compressed inverted list caching in search engines. In Proc. Conf. World Wide Web (WWW), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. F. Zheng, H. Abbasi, C. Docan, J. Lofstead, Q. Liu, S. Klasky, M. Parashar, N. Podhorszki, K. Schwan, and M. Wolf. PreDatA: preparatory data analytics on peta-scale machines. In Proc. Symp. Parallel Distributed Processing (IPDPS), Apr 2010.Google ScholarGoogle ScholarCross RefCross Ref
  34. M. Zukowski, S. Heman, N. Nes, and P. Boncz. Super-scalar RAM-CPU cache compression. In Proc. Conf. Data Engineering (ICDE), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Scalable in situ scientific data encoding for analytical query processing

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        HPDC '13: Proceedings of the 22nd international symposium on High-performance parallel and distributed computing
        June 2013
        276 pages
        ISBN:9781450319102
        DOI:10.1145/2493123
        • General Chairs:
        • Manish Parashar,
        • Jon Weissman,
        • Program Chairs:
        • Dick Epema,
        • Renato Figueiredo

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 June 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        HPDC '13 Paper Acceptance Rate20of131submissions,15%Overall Acceptance Rate166of966submissions,17%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader