ACM Home Page
Please provide us with feedback. Feedback
Disk allocation for Cartesian product files on multiple-disk systems
Full text PdfPdf (1.44 MB)
Source ACM Transactions on Database Systems (TODS) archive
Volume 7 ,  Issue 1  (March 1982) table of contents
Pages: 82 - 101  
Year of Publication: 1982
ISSN:0362-5915
Authors
H. C. Du  Univ. of Minnesota, Minneapolis
J. S. Sobolewski  Univ. of Washington, Seattle
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 3,   Downloads (12 Months): 26,   Citation Count: 52
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/319682.319698
What is a DOI?

ABSTRACT

Cartesian product files have recently been shown to exhibit attractive properties for partial match queries. This paper considers the file allocation problem for Cartesian product files, which can be stated as follows: Given a k-attribute Cartesian product file and an m-disk system, allocate buckets among the m disks in such a way that, for all possible partial match queries, the concurrency of disk accesses is maximized. The Disk Modulo (DM) allocation method is described first, and it is shown to be strict optimal under many conditions commonly occurring in practice, including all possible partial match queries when the number of disks is 2 or 3. It is also shown that although it has good performance, the DM allocation method is not strict optimal for all possible partial match queries when the number of disks is greater than 3. The General Disk Modulo (GDM) allocation method is then described, and a sufficient but not necessary condition for strict optimality of the GDM method for all partial match queries and any number of disks is then derived. Simulation studies comparing the DM and random allocation methods in terms of the average number of disk accesses, in response to various classes of partial match queries, show the former to be significantly more effective even when the number of disks is greater than 3, that is, even in cases where the DM method is not strict optimal. The results that have been derived formally and shown by simulation can be used for more effective design of optimal file systems for partial match queries. When considering multiple-disk systems with independent access paths, it is important to ensure that similar records are clustered into the same or similar buckets, while similar buckets should be dispersed uniformly among the disks.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
3
 
4
LIN, W.C., LEE, R.C.T., AND DU, H.C. Common properties of some multi-attribute file systems. IEEE Trans. Software Eng. SE-5, 2 (March 1979}, 160-174.
 
5
LIou, J.H., AND YAO, S.B. Multi-dimensional clustering for data base organizations. Information Syst. 2 (1977) 187-198.
 
6
RIVEST, R.L. Partial-match retrieval algorithms. SIAM J. Comput. 15, 1 (March 1976), 19-50.
7

CITED BY  52
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Collaborative Colleagues:
H. C. Du: colleagues
J. S. Sobolewski: colleagues

Peer to Peer - Readers of this Article have also read: