skip to main content
10.1145/2882903.2899398acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Quegel: A General-Purpose System for Querying Big Graphs

Published: 26 June 2016 Publication History

Abstract

Inspired by Google's Pregel, many distributed graph processing systems have been developed recently to process big graphs. These systems expose a vertex-centric programming interface to users, where a programmer thinks like a vertex when designing parallel graph algorithms. However, existing systems are designed for tasks where most vertices in a graph participate in the computation, and they are not suitable for processing light-workload graph queries which only access a small portion of vertices. This is because their programming model can seriously under-utilize the resources in a cluster for processing graph queries. In this demonstration, we introduce a general-purpose system for querying big graphs, called Quegel, which treats queries as first-class citizens in the design of its computing model. Quegel adopts a novel superstep-sharing execution model to overcome the weaknesses of existing systems. We demonstrate it is user-friendly to write parallel graph-querying programs with Quegel's interface; and we also show that Quegel is able to achieve real-time response time in various applications, including the two applications that we plan to demonstrate: point-to-point shortest-path queries and XML keyword search.

References

[1]
Apache Giraph: http://giraph.apache.org.
[2]
Quegel: http://www.cse.cuhk.edu.hk/quegel/.
[3]
J. E. Gonzalez, Y. Low, H. Gu, D. Bickson, and C. Guestrin. Powergraph: Distributed graph-parallel computation on natural graphs. In OSDI, pages 17--30, 2012.
[4]
R. Jin, N. Ruan, B. You, and H. Wang. Hub-accelerator: Fast and exact shortest path computation in large social networks. CoRR, abs/1305.0507, 2013.
[5]
A. Khan and S. Elnikety. Systems for big-graphs. PVLDB, 7(13):1709--1710, 2014.
[6]
Y. Lu, J. Cheng, D. Yan, and H. Wu. Large-scale distributed graph computing systems: An experimental evaluation. PVLDB, 8(3):281--292, 2014.
[7]
G. Malewicz, M. H. Austern, A. J. C. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD, pages 135--146, 2010.
[8]
S. Sakr, S. Elnikety, and Y. He. G-SPARQL: a hybrid engine for querying large attributed graphs. In CIKM, pages 335--344, 2012.
[9]
M. Sarwat, S. Elnikety, Y. He, and M. F. Mokbel. Horton+: A distributed system for processing declarative reachability queries over partitioned graphs. PVLDB, 6(14):1918--1929, 2013.
[10]
D. Yan, J. Cheng, Y. Lu, and W. Ng. Blogel: A block-centric framework for distributed computation on real-world graphs. PVLDB, 7(14):1981--1992, 2014.
[11]
D. Yan, J. Cheng, Y. Lu, and W. Ng. Effective techniques for message reduction and load balancing in distributed graph computation. In WWW, pages 1307--1317, 2015.
[12]
D. Yan, J. Cheng, M. T. Özsu, F. Yang, Y. Lu, J. C. S. Lui, Q. Zhang, and W. Ng. A general-purpose query-centric framework for querying big graphs. PVLDB, 9(7):564--575, 2016.

Cited By

View all
  • (2024)Automating Vectorized Distributed Graph ComputationProceedings of the ACM on Management of Data10.1145/36988332:6(1-27)Online publication date: 20-Dec-2024
  • (2023)Achieving Sub-second Pairwise Query over Evolving GraphsProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3575693.3576173(1-15)Online publication date: 27-Jan-2023
  • (2021)Parallel mining of large maximal quasi-cliquesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00712-231:4(649-674)Online publication date: 26-Nov-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '16: Proceedings of the 2016 International Conference on Management of Data
June 2016
2300 pages
ISBN:9781450335317
DOI:10.1145/2882903
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 June 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. big data
  2. distributed
  3. graph
  4. pregel
  5. query

Qualifiers

  • Research-article

Funding Sources

  • Hong Kong Research Grants Council (RGC) General Research Funding (GRF)

Conference

SIGMOD/PODS'16
Sponsor:
SIGMOD/PODS'16: International Conference on Management of Data
June 26 - July 1, 2016
California, San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Automating Vectorized Distributed Graph ComputationProceedings of the ACM on Management of Data10.1145/36988332:6(1-27)Online publication date: 20-Dec-2024
  • (2023)Achieving Sub-second Pairwise Query over Evolving GraphsProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 210.1145/3575693.3576173(1-15)Online publication date: 27-Jan-2023
  • (2021)Parallel mining of large maximal quasi-cliquesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00712-231:4(649-674)Online publication date: 26-Nov-2021
  • (2021)G-thinker: a general distributed framework for finding qualified subgraphs in a big graph with load balancingThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00688-z31:2(287-320)Online publication date: 4-Aug-2021
  • (2020)Optimizing Distance Computation in Distributed Graph SystemsIEEE Access10.1109/ACCESS.2020.30327278(191673-191682)Online publication date: 2020
  • (2019)PathQuery Pregel: high-performance graph query with bulk synchronous processingPattern Analysis and Applications10.1007/s10044-019-00841-z23:3(1493-1504)Online publication date: 1-Aug-2019
  • (2019)Distributed Landmark Selection for Lower Bound Estimation of Distances in Large GraphsWeb and Big Data10.1007/978-3-030-26072-9_16(223-239)Online publication date: 18-Jul-2019
  • (2018)Construing the big data based on taxonomy, analytics and approachesIran Journal of Computer Science10.1007/s42044-018-0024-31:4(237-259)Online publication date: 3-Oct-2018
  • (2017)Hands-On ExperiencesSystems for Big Graph Analytics10.1007/978-3-319-58217-7_3(23-36)Online publication date: 2-Jun-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media