ACM Home Page
Please provide us with feedback. Feedback
Physical database design for relational databases
Full text PdfPdf (2.99 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 13 ,  Issue 1  (March 1988) table of contents
Pages: 91 - 128  
Year of Publication: 1988
ISSN:0362-5915
Authors
S. Finkelstein  IBM Almaden Research Center, San Jose, CA
M. Schkolnick  IBM Thomas J. Watson Research Center, Yorktown Heights, NY
P. Tiberio  Univ. of Bologna, Bologna, Italy
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 15,   Downloads (12 Months): 177,   Citation Count: 35
Additional Information:

abstract   references   cited by   index terms   review   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/42201.42205
What is a DOI?

ABSTRACT

This paper describes the concepts used in the implementation of DBDSGN, an experimental physical design tool for relational databases developed at the IBM San Jose Research Laboratory. Given a workload for System R (consisting of a set of SQL statements and their execution frequencies), DBDSGN suggests physical configurations for efficient performance. Each configuration consists of a set of indices and an ordering for each table. Workload statements are evaluated only for atomic configurations of indices, which have only one index per table. Costs for any configuration can be obtained from those of the atomic configurations. DBDSGN uses information supplied by the System R optimizer both to determine which columns might be worth indexing and to obtain estimates of the cost of executing statements in different configurations. The tool finds efficient solutions to the index-selection problem; if we assume the cost estimates supplied by the optimizer are the actual execution costs, it finds the optimal solution. Optionally, heuristics can be used to reduce execution time. The approach taken by DBDSGN in solving the index-selection problem for multiple-table statements significantly reduces the complexity of the problem. DBDSGN's principles were used in the Relational Design Tool (RDT), an IBM product based on DBDSGN, which performs design for SQL/DS, a relational system based on System R. System R actually uses DBDSGN's suggested solutions as the tool expects because cost estimates and other necessary information can be obtained from System R using a new SQL statement, the EXPLAIN statement. This illustrates how a system can export a model of its internal assumptions and behavior so that other systems (such as tools) can share this model.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
 
2
ASTRAHAN, M. M., KIM, W., AND SCHKOLNICK, M. Performance of the System R access path selection mechanism. In Info Processing 80: Proceedings of the IFIP Congress 80 (Tokyo, Oct. 1980), pp. 487-491.
 
3
4
 
5
BLASGEN, M. W., ASTRAHAN, M. M., CHAMBERLIN, D. D., GRAY, J. N., KING, W. F., LINDSAY, B. G., LORIE, R. A., MEHL, J. W., PRICE, T. O., PUTZOLU, G. R., SCHKOLNICK, M., SELINGER, P. G., SLUTZ, P. R., STRONG, H. R., TRAIGER, I. L., WADE, B. W., AND YOST, R.A. System R: An architectural overview. IBM Syst. J. 20, 1 (1981), 41-62.
 
6
BONANNO, R., MAIO, D., AND TIBERIO, P. Secondary index selection in relational database physical design. Comput. J. 28, 4 (Aug. 1985), 398-405.
 
7
BONFATTI, F., MAIO, D., AND TIBERIO, P. A separability-based method for secondary index selection in physical database design. In Methodology and Tools/or Data Base Design, S. Ceri, Ed. Elsevier North-Holland, New York, 1983, pp. 148-160.
8
 
9
CHAMBERLIN, D. D., ASTRAHAN, M. M., ESWARAN, K. P., GRIFFITHS, P. P., LORIE, R. A., MEHL, J. W., REISNER, P., AND WADE, B. W. SEQUEL2: A unified approach to data definition, manipulation and control. IBM J. Res. Dev. 20, 6 (Nov. 1976), 560-575.
10
11
12
13
14
15
16
 
17
HATZOPOULOUS, M., AND KOLLIAS, J.Y. On the selection of a reduced set of indexes. Comput. J. 28, 4 (Aug. 1985), 406-408.
18
 
19
IBM. Relational Design Tool--Structured Query Language/Data System. Ref. Man. SH20- 6415-1, IBM, San Jose, Calif., 1985.
 
20
IBM. 8QL-data system application programming. Man. SH24-5018-2, IBM, San Jose, Calif., Aug. 1983.
 
21
IBM. SQL-data system planning and administration. Man. SH24-5014-1, IBM, San Jose, Calif., Aug. 1983.
 
22
IBM SYSTEMS JOURNAL. Database 2. IBM Syst. d. 23, 2 (1984), 98-218.
 
23
IP, M. Y. L., SAXTON, L. V., AND RAGHAVAN, V. V. On the selection of an optimal set of
24
 
25
KING, W.F. On the selection of indices for a file. Res. Rep. RJ1641, IBM, San Jose, Calif., Jan. 1974.
 
26
KOLLIAS, J.G. A heuristic approach for determining the optimal degree of file inversion. Inf. Syst. 4, 1 (1979), 307-318.
27
 
28
PUTKONEN, A. On the selection of access paths in inverted database organizations, inf. Syst. 4, 3 {1979), 219-225.
 
29
RELATIONAL TECHNOLOGY. An Introduction to INGRES. Realtional Technology. Berkeley Calif., Jan. 1984.
 
30
SCHKOLNICK, M. The optimal selection of secondary indices for files. Inf. Syst. 1 (1975), 141-146.
 
31
SCHKOLNICK, M. A survey of physical database methodologT and techniques. In Proceedings of the Very Large Database Conference (Berlin, Sept. 1978), pp. 474-487.
 
32
SCHKOLN{CK, M., AND TIBERZO, P. Considerations in developing a design tool for a relational DBMS. In Proceedings of the IEEE COMPSAC Con{erence (Chicago, Ill., Nov. 1979). IEEE Press, New York, 1979, pp. 228-235.
33
 
34
 
35
SELINGER, P. G., ASTRAHAN, M. M., CHAMBERLIN, D. D., LORIE, R. A., AND PRICE, T.G. Access (Boston, Mass., May 1979). ACM, New York, 1979, pp. 23-34.
 
36
STONEBRAKER, M. The choice of partial inversions and combined indices. Int. J. Comput. In{. Sci. 3, 2 (June 1974), 167-188.
37
 
38
WHANG, K, J. A physical database design methodology using the property of separability. Rep. STAN-CS-83-968, Computer Science Dept., Stanford Univ., Stanford, Calif., May 1983.
 
39
WHANG, K, Y., WIEDERHOLD, G., AND SAGALOWITZ, D. Separability--An approach to physical database design. IEEE Trans. Comput. 33, 3 (Mar. 1984), 209-222.
40
 
41
YAO, S.B. Database storage structure design. In Proceedings of the Conference on Database Engineering (Amagi, Japan, Nov. 1979). IBM, San Jose, Calif., 1979, pp. 1-22.
 
42
YoussgF!, K._, AND WONG~ E. Query processing in a relational database management, system. In Proceedings of the Very Large Database Conference (Rio de Janeiro, Oct. 1979). pp. 409-417.

CITED BY  35
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 


REVIEW

"Clement R. Attanasio : Reviewer"

This paper describes DBDSGN, a research effort which led to the IBM product relational design tool (RDT). RDT accepts as input definitions of relational tables to be used in the environment of the IBM relational database product SQL/DS and a set  more...

Collaborative Colleagues:
S. Finkelstein: colleagues
M. Schkolnick: colleagues
P. Tiberio: colleagues

Peer to Peer - Readers of this Article have also read: