|
ABSTRACT
This paper describes the Query Rewrite facility of the Starburst extensible database system, a novel phase of query optimization. We present a suite of rewrite rules used in Starburst to transform queries into equivalent queries for faster execution, and also describe the production rule engine which is used by Starburst to choose and execute these rules. Examples are provided demonstrating that these Query Rewrite transformations lead to query execution time improvements of orders of magnitude, suggesting that Query Rewrite in general—and these rewrite rules in particular—are an essential step in query optimization for modern database systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
ABC+76
|
M. M. Astrahan , M. W. Blasgen , D. D. Chamberlin , K. P. Eswaran , J. N. Gray , P. P. Griffiths , W. F. King , R. A. Lorie , P. R. McJones , J. W. Mehl , G. R. Putzolu , I. L. Traiger , B. W. Wade , V. Watson, System R: relational approach to database management, ACM Transactions on Database Systems (TODS), v.1 n.2, p.97-137, June 1976
[doi> 10.1145/320455.320457]
|
| |
Anf89
|
Ole Jirgen Anfindsen.A Study of Access Path Selection in DB2. Technical report, Norwegian Telecommunications Administration and University of Oslo, Norway, October 1989.
|
| |
BFKM85
|
|
| |
BTA90
|
Jose Blakeley, Craig Thompson, and Abdallah Alashqur. Strawman reference model for object query language. In First OODB Standardization Workshop, X3/SPARC/DBSSG/OODBTG, Atlantic City, New Jersey, May 1990.
|
| |
Day87
|
|
 |
GW87
|
|
| |
HCL+90
|
L. M. Haas , W. Chang , G. M. Lohman , J. McPherson , P. F. Wilms , G. Lapis , B. Lindsay , H. Pirahesh , M. J. Carey , E. Shekita, Starburst Mid-Flight: As the Dust Clears, IEEE Transactions on Knowledge and Data Engineering, v.2 n.1, p.143-160, March 1990
[doi> 10.1109/69.50910
]
|
| |
HH91
|
Joseph M. Hellerstein and Meichun Hsu. Determinism in Partially Ordered Production Systems. Research Report RJ 8089, IBM Almaden Research Center, March 1991.
|
| |
HP88
|
Waqar Hasan and Hamid Pirahesh. Query Rewrite Optimization in Starburst. Research Report RJ 6367, IBM Almaden Research Center, August 1988.
|
| |
HSS88
|
T. Harder , H. Schoning , A. Sikeler, Parallelism in processing queries on complex objects, Proceedings of the first international symposium on Databases in parallel and distributed systems, p.131-143, December 05-07, 1988, Austin, Texas, United States
|
| |
ISO91
|
ISO_ANSI. Database Language SQL ISOBEC 9075:1992, 1991.
|
 |
Kim82
|
|
 |
LLOW91
|
|
 |
LLPS91
|
Guy M. Lohman , Bruce Lindsay , Hamid Pirahesh , K. Bernhard Schiefer, Extensions to Starburst: objects, types, functions, and rules, Communications of the ACM, v.34 n.10, p.94-109, Oct. 1991
[doi> 10.1145/125223.125266]
|
| |
Loo86
|
Chris Loosley. Measuring IBM Database 2 Release 2 - The Benchmark Game. InfoDB, 1(2), 1986.
|
| |
MF78
|
J. McDermott and C. Forgy. Production System Conflict Resolution Strategies. In D.A. Waterman and Fredrick Hayes-Roth, editors, Pattern Directed Inference Systems, pages 177-199. Academic Press, 1978.
|
 |
MFPR90a
|
I. S. Mumick , S. J. Finkelstein , Hamid Pirahesh , Raghu Ramakrishnan, Magic is relevant, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.247-258, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
MFPR90b
|
Inderpal Singh Mumick , Sheldon J. Finkelstein , Hamid Pirahesh , Raghu Ramakrishnan, Magic conditions, Proceedings of the ninth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.314-330, April 02-04, 1990, Nashville, Tennessee, United States
[doi> 10.1145/298514.298584]
|
| |
MPR90
|
|
| |
O'N89
|
P. O'Neil. Revisiting DBMS Benchmarks. Datamation, pages 47-54, September 15, 1989.
|
| |
Pir89
|
Hamid Pirahesh. Early Experience with Rule-Based Query Rewrite Optimization. In G. Graefe, editor, Workshop on Database Query Optimization, CSE Technical Report 89-005. Oregon Graduate Center, May 1989.
|
| |
Pro90
|
Proc. ACM-SIGMOD International Conference on Management of Data, Atlantic City, May 1990.
|
| |
Ras90
|
Louiqa Raschid. Maintaining Consistency in a Stratified Production System Program. In Proc. AAAI National Conference on Artificial Intelligence, 1990.
|
 |
RGL90
|
Arnon Rosenthal , Cesar Galindo-Legaria, Query graphs, implementing trees, and freely-reorderable outerjoins, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.291-299, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
SAC+79
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
 |
SJGP90
|
Michael Stonebraker , Anant Jhingran , Jeffrey Goh , Spyros Potamianos, On rules, procedure, caching and views in data base systems, Proceedings of the 1990 ACM SIGMOD international conference on Management of data, p.281-290, May 23-26, 1990, Atlantic City, New Jersey, United States
|
 |
SWK76
|
|
| |
TOB89
|
C. Turbyfill, C. Orji, and Dina Bitton. AS3AP - A Comparative Relational Database Benchmark. In Proc. IEEE Compcon Spring '89, February 1989.
|
| |
ZH90
|
|
CITED BY 70
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Lewis Girod , Kyle Jamieson , Yuan Mei , Ryan Newton , Stanislav Rost , Arvind Thiagarajan , Hari Balakrishnan , Samuel Madden, WaveScope: a signal-oriented data stream management system, Proceedings of the 4th international conference on Embedded networked sensor systems, October 31-November 03, 2006, Boulder, Colorado, USA
|
|
H. V. Jagadish , Alberto O. Mendelzon , Inderpal Singh Mumick, Managing conflicts between rules (extended abstract), Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.192-201, June 04-06, 1996, Montreal, Quebec, Canada
|
|
|
R. Telford , R. Horman , S. Lightstone , N. Markov , S. O'Connell , G. Lohman, Usability and design considerations for an autonomic relational database management system, IBM Systems Journal, v.42 n.4, p.568-581, October 2003
|
|
|
K. Beyer , R. Cochrane , M. Hvizdos , V. Josifovski , J. Kleewein , G. Lapis , G. Lohman , R. Lyle , M. Nicola , F. Özcan , H. Pirahesh , N. Seemann , A. Singh , T. Truong , R. C. Van der Linden , B. Vickery , C. Zhang , G. Zhang, DB2 goes hybrid: integratng native XML and XQuery with relational data and SQL, IBM Systems Journal, v.45 n.2, p.271-298, January 2006
|
|
|
|
|
|
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, Simplification of outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.7, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
Tobias Kraft , Holger Schwarz , Ralf Rantzau , Bernhard Mitschang, Coarse-grained optimization: techniques for rewriting SQL statement sequences, Proceedings of the 29th international conference on Very large data bases, p.488-499, September 09-12, 2003, Berlin, Germany
|
|
Francis Chu , Joseph Y. Halpern , Praveen Seshadri, Least expected cost query optimization: an exercise in utility, Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, p.138-147, May 31-June 03, 1999, Philadelphia, Pennsylvania, United States
|
|
|
|
|
Benoit Dageville , Dinesh Das , Karl Dias , Khaled Yagoub , Mohamed Zait , Mohamed Ziauddin, Automatic SQL tuning in oracle 10g, Proceedings of the Thirtieth international conference on Very large data bases, p.1098-1109, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|
Rakesh Agrawal , Roberto Bayardo , Christos Faloutsos , Jerry Kiernan , Ralf Rantzau , Ramakrishnan Srikant, Auditing compliance with a Hippocratic database, Proceedings of the Thirtieth international conference on Very large data bases, p.516-527, August 31-September 03, 2004, Toronto, Canada
|
|
|
|
|
|
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, No regression algorithm for the enumeration of projections in SQL queries with joins and outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.8, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
|
|
Qi Cheng , Jarek Gryz , Fred Koo , T. Y. Cliff Leung , Linqi Liu , Xiaoyan Qian , K. Bernhard Schiefer, Implementation of Two Semantic Query Optimization Techniques in DB2 Universal Database, Proceedings of the 25th International Conference on Very Large Data Bases, p.687-698, September 07-10, 1999
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Marko Vrhovnik , Holger Schwarz , Oliver Suhre , Bernhard Mitschang , Volker Markl , Albert Maier , Tobias Kraft, An approach to optimize data processing in business processes, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
Gautam Bhargava , Piyush Goel , Balakrishna R. Iyer, No regression algorithm for the enumeration of projections in SQL queries with joins and outer joins, Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research, p.8, November 07-09, 1995, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
|
Jayavel Shanmugasundaram , Eugene Shekita , Rimon Barr , Michael Carey , Bruce Lindsay , Hamid Pirahesh , Berthold Reinwald, Efficiently publishing relational data as XML documents, The VLDB Journal — The International Journal on Very Large Data Bases, v.10 n.2-3, p.133-154, September 2001
|
|
|
Rafi Ahmed , Allison Lee , Andrew Witkowski , Dinesh Das , Hong Su , Mohamed Zait , Thierry Cruanes, Cost-based query transformation in Oracle, Proceedings of the 32nd international conference on Very large data bases, September 12-15, 2006, Seoul, Korea
|
|
|
|
|
|
|
|
|
|
|
|
|
Gene Fuh , Jyh-Herng Chow , Nelson Mattos , Brian Tran, Supporting procedural constructs in existing SQL compilers, Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research, p.11, November 12-14, 1996, Toronto, Ontario, Canada
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Andrew Witkowski , Srikanth Bellamkonda , Hua-Gang Li , Vince Liang , Lei Sheng , Wayne Smith , Sankar Subramanian , James Terry , Tsae-Feng Yu, Continuous queries in oracle, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
Lane B. Warshaw , Daniel P. Miranker, Rule-based query optimization, revisited, Proceedings of the eighth international conference on Information and knowledge management, p.267-275, November 02-06, 1999, Kansas City, Missouri, United States
|
|
|
|
|
|
|
Irina Botan , Donald Kossmann , Peter M. Fischer , Tim Kraska , Dana Florescu , Rokas Tamosevicius, Extending XQuery with window functions, Proceedings of the 33rd international conference on Very large data bases, September 23-27, 2007, Vienna, Austria
|
|
|
|
|
|
|
|
|
|
|
A. Balmin , T. Eliaz , J. Hornibrook , L. Lim , G. M. Lohman , D. Simmen , M. Wang , C. Zhang, Cost-based optimization in DB2 XML, IBM Systems Journal, v.45 n.2, p.299-319, January 2006
|
|
|
|
|
|
|
|
Kevin Beyer , Roberta J. Cochrane , Vanja Josifovski , Jim Kleewein , George Lapis , Guy Lohman , Bob Lyle , Fatma Özcan , Hamid Pirahesh , Normen Seemann , Tuong Truong , Bert Van der Linden , Brian Vickery , Chun Zhang, System RX: one part relational, one part XML, Proceedings of the 2005 ACM SIGMOD international conference on Management of data, June 14-16, 2005, Baltimore, Maryland
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|