| Evaluating the size of queries on relational databases with non-uniform distribution and stochastic dependence |
| Full text |
Pdf
(637 KB)
|
| Source
|
International Conference on Management of Data
archive
Proceedings of the 1989 ACM SIGMOD international conference on Management of data
table of contents
Portland, Oregon, United States
Pages: 8 - 14
Year of Publication: 1989
ISBN:0-89791-317-5
Also published in ...
|
|
Authors
|
|
Silvio Salza
|
Istituto di Analisi dei Sistemi ed Informatica de1 CNR, Viale Manzoni, 30 I-00185 Roma, Italy
|
|
Mario Terranova
|
Istituto di Analisi dei Sistemi ed Informatica de1 CNR, Viale Manzoni, 30 I-00185 Roma, Italy
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 0, Downloads (12 Months): 25, Citation Count: 4
|
|
|
ABSTRACT
The paper deals with the problem of evaluating how the originality of the attributes of a relation, i.e. the number of distinct values in each attribute, is affected by relational operations that reduce the cardinality of the relation. This is indeed an interesting problem in research areas such as database design and query optimization. Some authors have shown that non uniform distributions and stochastic dependence significantly affect the originality of the attributes. Therefore the models that have been proposed in the literature, based on uniformity and independence assumptions, in several situation can not be conveniently utilized. In this paper we propose a probabilistic model that overcomes the need of the uniformity and independence assumptions. The model is exact for non uniform distributions when the attributes are independent, and gives approximate results when stochastic dependence is considered. In the latter case the analytical results have been compared with a simulation, and proved to be quite accurate.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
Philip A. Bernstein , Nathan Goodman , Eugene Wong , Christopher L. Reeve , James B. Rothnie, Jr., Query processing in a system for distributed databases (SDD-1), ACM Transactions on Database Systems (TODS), v.6 n.4, p.602-625, Dec. 1981
[doi> 10.1145/319628.319650]
|
| |
2
|
|
| |
3
|
S. Christodulakis. Estimating record selectivities. Information Systems, 8(2):105-115, 1983.
|
 |
4
|
|
 |
5
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
| |
6
|
A.R. Hevner and S. B. Yao. Query processing in distributed database systems. IEEE Transactions on Software Engineering, 5(3):177-187, 1979.
|
 |
7
|
|
 |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
|
| |
12
|
S. Salza and M. Terranova. Evaluating the cardinality of the result of relational operations" a probabilistic approach. In 6-th Advanced Database Symposium, Tokyo, pages 223-231, 1986.
|
| |
13
|
|
 |
14
|
|
| |
15
|
G.K. Zipf. Human Behaviour and the Principle of Least Effort. Addison Wesley Publ. Co., Reading, Massachussetts, 1949.
|
|