skip to main content
10.1145/2484425.2484433acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Scale-up graph processing: a storage-centric view

Published:23 June 2013Publication History

ABSTRACT

The determinant of performance in scale-up graph processing on a single system is the speed at which the graph can be fetched from storage: either from disk into memory or from memory into CPU-cache. Algorithms that follow edges perform random accesses to the storage medium for the graph and this can often be the determinant of performance, regardless of the algorithmic complexity or runtime efficiency of the actual algorithm in use. A storage-centric viewpoint would suggest that the solution to this problem lies in recognizing that graphs represent a unique workload and therefore should be treated as such by adopting novel ways to access graph structured data. We approach this problem from two different aspects and this paper details two different efforts in this direction. One approach is specific to graphs stored on SSDs and accelerates random access using a novel prefetcher called RASP. The second approach takes a fresh look at how graphs are accessed and suggests that trading off the low cost of random access for the approach of sequentially streaming a large set of (potentially unrelated) edges can be a winning proposition under certain circumstances: leading to a system for graphs stored on any medium (main-memory, SSD or magnetic disk) called X-stream. RASP and X-stream therefore take - diametrically opposite - storage centric viewpoints of the graph processing problem. After contrasting the approaches and demonstrating the benefit of each, this paper ends with a description of planned future development of an online algorithm that selects between the two approaches, possibly providing the best of both worlds.

References

  1. Aapo Kyrola and Guy Blelloch. Graphchi: Large-scale graph computation on just a PC. In OSDI. USENIX Association, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Grzegorz Malewicz, Matthew H. Austern, Aart J. C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD, pages 135--146. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Joseph E. Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, and Carlos Guestrin. Powergraph: distributed graph-parallel computation on natural graphs. In OSDI, pages 17--30. USENIX Association, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Amitabha Roy, Karthik Nilakant, Valentin Dalibard, and Eiko Yoneki. Mitigating I/O latency in SSD-based graph traversal. Technical Report UCAM-CL-TR-823, University of Cambridge, November 2012.Google ScholarGoogle Scholar
  5. Philip Stutz, Abraham Bernstein, and William Cohen. Signal/collect: graph algorithms for the (semantic) web. In ISWC - Volume Part I, pages 764--780. Springer-Verlag, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Sungpack Hong, Tayo Oguntebi, and Kunle Olukotun. Efficient parallel graph exploration on multi-core CPU and GPU. In PACT, pages 78--88. IEEE Computer Society, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Virat Agarwal, Fabrizio Petrini, Davide Pasetto, and David A. Bader. Scalable graph exploration on multicore processors. In SC, pages 1--11. IEEE Computer Society, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Roger Pearce, Maya Gokhale, and Nancy M. Amato. Multithreaded asynchronous graph traversal for in-memory and semi-external memory. In SC, pages 1--11. IEEE Computer Society, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Sungpack Hong, Hassan Chafi, Edic Sedlar, and Kunle Olukotun. Green-marl: a DSL for easy and efficient graph analysis. In ASPLOS, pages 349--362. ACM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. http://twitter.mpi-sws.org/.Google ScholarGoogle Scholar
  11. http://snap.stanford.edu/.Google ScholarGoogle Scholar
  12. http://www.graph500.org/.Google ScholarGoogle Scholar
  13. Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University, 1998.Google ScholarGoogle Scholar
  14. Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon. What is twitter, a social network or a news media? In WWW, pages 591--600. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. http://dimacs.rutgers.edu/Challenges/.Google ScholarGoogle Scholar
  16. http://www.cise.ufl.edu/research/sparse/matrices/LAW/sk-2005.html.Google ScholarGoogle Scholar
  17. http://webscope.sandbox.yahoo.com/.Google ScholarGoogle Scholar
  18. Jure Leskovec, Jon Kleinberg, and Christos Faloutsos. Graph evolution: Densification and shrinking diameters. ACM Trans. Knowl. Discov. Data, 1(1), March 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    GRADES '13: First International Workshop on Graph Data Management Experiences and Systems
    June 2013
    101 pages
    ISBN:9781450321884
    DOI:10.1145/2484425

    Copyright © 2013 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 23 June 2013

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

    Acceptance Rates

    Overall Acceptance Rate29of61submissions,48%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader