Article

Rank-aware query optimization

Authors:
Ihab F. Ilyas

Purdue University, West Lafayette, Indiana

Purdue University, West Lafayette, Indiana
View Profile

,
Rahul Shah

Purdue University, West Lafayette, Indiana

Purdue University, West Lafayette, Indiana
View Profile

,
Walid G. Aref

Purdue University, West Lafayette, Indiana

Purdue University, West Lafayette, Indiana
View Profile

,
Jeffrey Scott Vitter

Purdue University, West Lafayette, Indiana

Purdue University, West Lafayette, Indiana
View Profile

,
Ahmed K. Elmagarmid

Purdue University, West Lafayette, Indiana

Purdue University, West Lafayette, Indiana
View Profile

SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of dataJune 2004Pages 203–214https://doi.org/10.1145/1007568.1007593

Published:13 June 2004Publication History

SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data

Pages 203–214

ABSTRACT

Ranking is an important property that needs to be fully supported by current relational query engines. Recently, several rank-join query operators have been proposed based on rank aggregation algorithms. Rank-join operators progressively rank the join results while performing the join operation. The new operators have a direct impact on traditional query processing and optimization.We introduce a rank-aware query optimization framework that fully integrates rank-join operators into relational query engines. The framework is based on extending the System R dynamic programming algorithm in both enumeration and pruning. We define ranking as an interesting property that triggers the generation of rank-aware query plans. Unlike traditional join operators, optimizing for rank-join operators depends on estimating the input cardinality of these operators. We introduce a probabilistic model for estimating the input cardinality, and hence the cost of a rank-join operator. To our knowledge, this paper is the first effort in estimating the needed input size for optimal rank aggregation algorithms. Costing ranking plans, although challenging, is key to the full integration of rank-join operators in real-world query processing engines. We experimentally evaluate our framework by modifying the query optimizer of an open-source database management system. The experiments show the validity of our framework and the accuracy of the proposed estimation model.

References

Sihem Amer-Yahia, SungRan Cho, and Divesh Srivastava. Tree pattern relaxation. In EDBT, 2002. Google ScholarDigital Library
J.C. Borda. M.émoire sur les élections au scrutin. Histoire de l'Académie Royale des Sciences, 1781.Google Scholar
Nicolas Bruno, Surajit Chaudhuri, and Luis Gravano. Top-k selection queries over relational databases: Mapping strategies and performance evaluation. TODS, 27(2), 2002. Google ScholarDigital Library
Nicolas Bruno, Luis Gravano, and Amelie Marian. Evaluating top-k queries over web-accessible databases. In ICDE, 2002.Google ScholarCross Ref
Michael J. Carey and Donald Kossmann. Reducing the braking distance of an SQL query engine. In VLDB. Google ScholarDigital Library
Michael J. Carey and Donald Kossmann. On saying "Enough already!" in SQL. In SIGMOD, 1997. Google ScholarDigital Library
Kevin Chen-Chuan Chang and Seung won Hwang. Minimal probing: supporting expensive predicates for top-k queries. In SIGMOD, 2002. Google ScholarDigital Library
Yuan-Chi Chang, Lawrence Bergman, Vittorio Castelli, Chung-Sheng Li, Ming-Ling Lo, and John R. Smith. The onion technique: indexing for linear optimization queries. In SIGMOD, pages 391--402, 2000. Google ScholarDigital Library
Surajit Chaudhuri, Luis Gravano, and Amelie Marian. Optimizing top-k selection queries over multimedia repositories. IEEE Transactions on Knowledge and Data Engineering, to appear. Google ScholarDigital Library
M.-J. Condorcet. Éssai sur l'application de l'analyse à la probabilité des décisions rendues à la puralité des voix, 1785.Google Scholar
Donko Donjerkovic and Raghu Ramakrishnan. Probabilistic optimization of top N queries. In VLDB, 1999. Google ScholarDigital Library
Cynthia Dwork, S. Ravi Kumar, Moni Naor, and D. Sivakumar. Rank aggregation methods for the web. In World Wide Web, 2001. Google ScholarDigital Library
Ronald Fagin. Combining fuzzy information from multiple systems. Journal of Computer and System Sciences (JCSS), 58(1), Feb 1999. Google ScholarDigital Library
Ronald Fagin, Amnon Lotem, and Moni Naor. Optimal aggregation algorithms for middleware. In PODS, Santa Barbara, California, May 2001. Google ScholarDigital Library
G. Graefe and W. J. McKenna. The volcano optimizer generator: Extensibility and efficient search. In ICDE, 1993. Google ScholarDigital Library
Goetz Graefe and David J. DeWitt. The exodus optimizer generator. In SIGMOD, 1987. Google ScholarDigital Library
Ulrich Güntzer, Wolf-Tilo Balke, and Werner Kießling. Optimizing multi-feature queries for image databases. In VLDB, 2000. Google ScholarDigital Library
Ulrich Güntzer, Wolf-Tilo Balke, and Werner Kießling. Towards efficient multi-feature queries in heterogeneous environments. In ITCC, 2001. Google ScholarDigital Library
Peter J. Haas and Joseph M. Hellerstein. Ripple joins for online aggregation. In SIGMOD, june 1999. Google ScholarDigital Library
W. Hong and M. Stonebraker. Optimization of parallel query execution plans in XPRS. Distributed and Parallel Databases, 1(1), Jan. 1993. Google ScholarDigital Library
Vagelis Hristidis, Luis Gravano, and Yannis Papakonstantinou. Efficient ir-style keyword search over relational databases. In VLDB, 2003. Google ScholarDigital Library
Vagelis Hristidis, Nick Koudas, and Yannis Papakonstantinou. Prefer: A system for the efficient execution of multi-parametric ranked queries. In SIGMOD, 2001. Google ScholarDigital Library
Ihab F. Ilyas, Walid G. Aref, and Ahmed K. Elmagarmid. Joining ranked inputs in practice. In VLDB, 2002. Google ScholarDigital Library
Ihab F. Ilyas, Walid G. Aref, and Ahmed K. Elmagarmid. Supporting top-k join queries in relational databases. In VLDB, 2003. Google ScholarDigital Library
Guy M. Lohman. Grammar-like functional rules for representing query optimization alternatives. In SIGMOD, 1988. Google ScholarDigital Library
Apostol Natsev, Yuan-Chi Chang, John R. Smith, Chung-Sheng Li, and Jeffrey S. Vitter. Supporting incremental join queries on ranked inputs. In VLDB, 2001. Google ScholarDigital Library
Surya Nepal and M. V. Ramakrishna. Query processing issues in image (multimedia) databases. In ICDE, 1999.Google ScholarCross Ref
P. G. Selinger, M. M. Astrahan, D. D. Chamberlin, R. A. Lorie, and T. G. Price. Access path election in a relational database management system. In SIGMOD, 1979. Google ScholarDigital Library
Panayiotis Tsaparas, Themistoklis Palpanas, Yannis Kotidis, Nick Koudas, and Divesh Srivastava. Ranked join indices. In ICDE, 2003.Google ScholarCross Ref
Annita N. Wilschut and Peter M. G. Apers. Dataflow query execution in a parallel main-memory environment. Distributed and Parallel Databases, 1(1), 1993. Google ScholarDigital Library

Rank-aware query optimization
1. Information systems
  1. Data management systems
    1. Database management system engines
  2. Information retrieval
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory

Recommendations

Adaptive rank-aware query optimization in relational databases

Rank-aware query processing has emerged as a key requirement in modern applications. In these applications, efficient and adaptive evaluation of top-k queries is an integral part of the application semantics. In this article, we introduce a rank-aware ...
Read More
Rank-aware query processing and optimization
Read More
Query Optimization for Ontology-Mediated Query Answering
WWW '24: Proceedings of the ACM on Web Conference 2024

Ontology-mediated query answering (OMQA) consists in asking database queries on knowledge bases (KBs); a KB is a set of facts called the KB's database, which is described by domain knowledge called the KB's ontology. A widely-investigated OMQA technique ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data
June 2004
988 pages
ISBN:1581138598
DOI:10.1145/1007568
Conference Chairs:
Arnd Christian König
Microsoft Research
,
Stefan Dessloch
University of Kaiserslautern, Germany
,
General Chair:
Patrick Valduriez
INRIA, France
,
Program Chair:
Gerhard Weikum
University of the Saarland
Copyright © 2004 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 13 June 2004
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate785of4,003submissions,20%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 101
  Total Citations
  View Citations
- 1,282
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Rank-aware query optimization

SIGMOD '04: Proceedings of the 2004 ACM SIGMOD international conference on Management of data

ABSTRACT

References

Cited By

Recommendations

Adaptive rank-aware query optimization in relational databases

Rank-aware query processing and optimization

Query Optimization for Ontology-Mediated Query Answering