|
ABSTRACT
We investigate the performance of some of the best-known object clustering algorithms on four different workloads based upon the tektronix benchmark. For all four workloads, stochastic clustering gave the best performance for a variety of performance metrics. Since stochastic clustering is computationally expensive, it is interesting that for every workload there was at least one cheaper clustering algorithm that matched or almost matched stochastic clustering. Unfortunately, for each workload, the algorithm that approximated stochastic clustering was different. Our experiments also demonstrated that even when the workload and object graph are fixed, the choice of the clustering algorithm depends upon the goals of the system. For example, if the goal is to perform well on traversals of small portions of the database starting with a cold cache, the important metric is the per-traversal expansion factor, and a well-chosen placement tree will be nearly optimal; if the goal is to achieve a high steady-state performance with a reasonably large cache, the appropriate metric is the number of pages to which the clustering algorithm maps the active portion of the database. For this metric, the PRP clustering algorithm, which only uses access probabilities achieves nearly optimal performance.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
And+90
|
T. Lousenia Anderson et al. The Tektronix HyperModel Benchmark. EDBT, 1990.
|
| |
Bai89
|
|
| |
BD90
|
Veronique Benzaken and Claude Delobel. Enhancing performance in a persistent object store: Clustering strategies in 02. Technical Report 50-90, Altair, August 1990.
|
 |
Den68
|
|
 |
Deu+91
|
|
| |
DK90
|
|
| |
HBD91
|
Gilbert Harrus, Veronique Benzaken, and Claude Delobel. Measuring performance of clustering strategies: the CluB-0 benchmark. Technical Report 66-91, Altair, January 1991.
|
 |
HK89
|
|
| |
KL70
|
B.W. Kernighan and S. Lin. An efficient heuristic procedure for partitioning graphs. Bell System Technical Journal, 49(2):291-307, February 1970.
|
| |
PS82
|
|
| |
PZ91
|
|
| |
RC89
|
|
| |
SS90
|
Karen Shannon and Richard Snodgrass. Semantic clustering. In Proceedings of the $-th Int'l Workshop in Persistent Object Systems, pages 361-374, Martha's Vineyard, MA, September 1990.
|
 |
Sta84
|
|
 |
TN91a
|
|
| |
TN91b
|
Manolis M. Tsangaris and Jeffrey F. Naughton. A stochastic approach for clustering in object stores. Technical Report 1017, University of Wisconsin, Computer Sciences Dept., April 1991.
|
| |
TN92
|
Manolis M. Tsangaris and Jeffrey F. Naughton. On the performance of object clustering techniques. Technical report, University of Wisconsin, Computer Sciences Dept., March 1992.
|
 |
YSLS85
|
|
 |
YW73
|
|
CITED BY 31
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Kien A. Hua , S. D. Lang , Wen K. Lee, A decomposition-based simulated annealing technique for data clustering, Proceedings of the thirteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, p.117-128, May 24-27, 1994, Minneapolis, Minnesota, United States
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
B. Liskov , A. Adya , M. Castro , S. Ghemawat , R. Gruber , U. Maheshwari , A. C. Myers , M. Day , L. Shrira, Safe and efficient sharing of persistent objects in Thor, ACM SIGMOD Record, v.25 n.2, p.318-329, June 1996
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|