|
ABSTRACT
Secondary indexes are often used in database management systems for secondary key retrieval. Although their use can improve retrieval time significantly, the cost of index maintenance and storage increases the overhead of the file processing application. The optimal set of indexed secondary keys for a particular application depends on a number of application dependent factors. In this paper a cost function is developed for the evaluation of candidate indexing choices and applied to the optimization of index selection. Factors accounted for include file size, the relative rates of retrieval and maintenance and the distribution of retrieval and maintenance over the candidate keys, index structure, and system charging rates. Among the results demonstrated are the increased effectiveness of secondary indexes for large files, the effect of the relative rates of retrieval and maintenance, the greater cost of allowing for arbitrarily formulated queries, and the impact on cost of the use of different index structures.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
BOOKMAN, P.J. Make your users pay the price. Computer Decisions 4 (Sept. 1972), 28-31.
|
 |
3
|
|
| |
4
|
CODASYL SYSTEMS COMMITTEE. A survey of generalized data base management systems. ACM, New York, May 1969.
|
 |
5
|
|
| |
6
|
DELOBEL, C. Determination of an optimal set of secondary keys for formatted files. ONLINE 72, Int. Symp. and Exhib. of Online Interactive Comptng, Brunel U., Uxbridge, England, Sept. 1972.
|
| |
7
|
IBM System/370 Model 155 Functional Characteristics. GA22-6942-1, IBM Corp., White Plains, N.Y., 1971.
|
| |
8
|
JONES, W.J. Syracuse University Computing Center charges. Document C64-0495, Syracuse U. Comptng. Ctr. Inform. SET., Syracuse U., Syracuse, N.Y., Feb. 1972.
|
| |
9
|
KING~ W.F. On the Selection of indices for a file. IBM Res. Rep. RJ 1341, IBM Res. Lab., San Jose, Calif., Jan. 1974.
|
| |
10
|
KNUTtt, D.E. The Art of Computer Programming, Vol. 3: Searching and Sorting. Addison Wesley, Reading, Mass., 1973.
|
 |
11
|
|
 |
12
|
|
 |
13
|
|
 |
14
|
|
| |
15
|
WEBB, D.A. Evaluation of hash coding systems. Ph.D. Th., SIS Dept., Syracuse U., Syracuse, N.Y., Aug. 1972.
|
INDEX TERMS
Primary Classification:
H.
Information Systems
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.1
Content Analysis and Indexing
Subjects:
Indexing methods
Additional Classification:
H.
Information Systems
H.2
DATABASE MANAGEMENT
H.2.2
Physical Design
Subjects:
Access methods
H.3
INFORMATION STORAGE AND RETRIEVAL
H.3.2
Information Storage
Subjects:
File organization
General Terms:
Design,
Management
Keywords:
Boolean query,
access methods,
access path,
cost function,
data management,
database,
file design,
file organization,
inverted file,
inverted index,
maintenance,
optimization,
retrieval,
secondary index,
secondary key,
secondary key access
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|