|
ABSTRACT
Automatic physical database design tools rely on "what-if" interfaces to the query optimizer to estimate the execution time of the training query workload under different candidate physical designs. The tools use these what-if interfaces to recommend physical designs that minimize the estimated execution time of the input training workload. In this paper, we argue that minimizing estimated execution time alone can lead to designs with inherent problems. In particular, if the optimizer makes an error in estimating the execution time of some workload queries, then the recommended physical design may actually harm the workload instead of benefiting it. In this sense, the physical design is risky. Moreover, if the production queries are slightly different from the training queries, the recommended physical design may not benefit them at all. In this sense, the physical design is not general. We define Risk and Generality as two new metrics to evaluate the quality of a proposed physical database design, and we show one way of extending the objective function being optimized by a generic physical design advisor to take these measures into account. We have implemented a physical design advisor in PostgreSQL, and we use it to experimentally demonstrate the usefulness of our approach. We show that our two new metrics result in physical designs that are more robust, which means that the user can implement them with a higher degree of confidence. This is particularly important as we move towards truly zero-administration database systems in which there is not the possibility for a DBA to vet the recommendations of the physical design tool before applying them.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
PostgreSQL. http://www.postgresql.org.
|
 |
2
|
|
| |
3
|
A. Aboulnaga , P. Haas , M. Kandil , S. Lightstone , G. Lohman , V. Markl , I. Popivanov , V. Raman, Automated statistics collection in DB2 UDB, Proceedings of the Thirtieth international conference on Very large data bases, p.1158-1169, August 31-September 03, 2004, Toronto, Canada
|
| |
4
|
S. Agrawal, S. Chaudhuri, L. Kollàr, A. P. Marathe, V. R. Narasayya, and M. Syamala. Database tuning advisor for Microsoft SQL Server 2005. In Proc. VLDB, 2004.
|
| |
5
|
|
 |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
| |
10
|
N. Bruno and S. Chaudhuri. Efficient creation of statistics over query expressions. In Proc. ICDE, 2003.
|
 |
11
|
|
 |
12
|
|
| |
13
|
N. Bruno and S. Chaudhuri. Physical design refinement: The "merge-reduce" approach. In Proc. EDBT, 2006.
|
| |
14
|
|
| |
15
|
N. Bruno and S. Chaudhuri. An online approach to physical design tuning. In Proc. ICDE, 2007.
|
 |
16
|
Nicolas Bruno , Surajit Chaudhuri , Luis Gravano, STHoles: a multidimensional workload-aware histogram, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.211-222, May 21-24, 2001, Santa Barbara, California, United States
|
 |
17
|
|
 |
18
|
|
| |
19
|
|
| |
20
|
Benoit Dageville , Dinesh Das , Karl Dias , Khaled Yagoub , Mohamed Zait , Mohamed Ziauddin, Automatic SQL tuning in oracle 10g, Proceedings of the Thirtieth international conference on Very large data bases, p.1098-1109, August 31-September 03, 2004, Toronto, Canada
|
| |
21
|
K. El Gebaly. Robustness in automatic physical database design. Master's thesis, University of Waterloo, 2007. Also available as University of Waterloo Computer Science Technical Report CS-2007-29.
|
| |
22
|
|
| |
23
|
|
| |
24
|
Transaction Processing Performance Council. TPC-H, the Ad-hoc Decision Support Benchmark. http://www.tpc.org.
|
| |
25
|
|
| |
26
|
Daniel C. Zilio , Jun Rao , Sam Lightstone , Guy Lohman , Adam Storm , Christian Garcia-Arellano , Scott Fadden, DB2 design advisor: integrated automatic physical database design, Proceedings of the Thirtieth international conference on Very large data bases, p.1087-1097, August 31-September 03, 2004, Toronto, Canada
|
|