|
ABSTRACT
Most commercial database systems do (or should) exploit many sorting techniques that are publicly known, but not readily available in the research literature. These techniques improve both sort performance on modern computer systems and the ability to adapt gracefully to resource fluctuations in multiuser operations. This survey collects many of these techniques for easy reference by students, researchers, and product developers. It covers in-memory sorting, disk-based external sorting, and considerations that apply specifically to sorting in database systems.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
 |
1
|
|
| |
2
|
Alfred V. Aho , John E. Hopcroft , Jeffrey Ullman , J. D. Ullman , J. E. Hopcroft, Data Structures and Algorithms, Addison-Wesley Longman Publishing Co., Inc., Boston, MA, 1983
|
 |
3
|
|
| |
4
|
|
 |
5
|
Andrea C. Arpaci-Dusseau , Remzi H. Arpaci-Dusseau , David E. Culler , Joseph M. Hellerstein , David A. Patterson, High-performance sorting on networks of workstations, Proceedings of the 1997 ACM SIGMOD international conference on Management of data, p.243-254, May 11-15, 1997, Tucson, Arizona, United States
|
| |
6
|
|
 |
7
|
|
 |
8
|
|
 |
9
|
|
 |
10
|
|
 |
11
|
Peter M. Chen , Edward K. Lee , Garth A. Gibson , Randy H. Katz , David A. Patterson, RAID: high-performance, reliable secondary storage, ACM Computing Surveys (CSUR), v.26 n.2, p.145-185, June 1994
[doi> 10.1145/176979.176981]
|
 |
12
|
|
| |
13
|
Conner, W. M. 1977. Offset value coding. IBM Technical Disclosure Bulletin 20, 7, 2832--2837.
|
| |
14
|
|
 |
15
|
|
| |
16
|
Gassner, P., Lohman, G. M., Schiefer, K. B., and Wang, Y. 1993. Query optimization in the IBM DB2 family. IEEE Data Eng. Bulletin 16, 4, 4--18.
|
| |
17
|
|
 |
18
|
|
| |
19
|
Graefe, G. 2003. Sorting and indexing with partitioned B-Trees. In Proceedings of the Conference on Innovative Data Systems Research (CIDR). Asilomar, CA.
|
| |
20
|
Graefe, G. 2003b. Executing nested queries. In Proceedings of the Datenbanksysteme für Business, Technologie und Web (BTW) Conference. Leipzig, Germany, 58--77.
|
| |
21
|
|
| |
22
|
|
| |
23
|
Härder, T. 1977. A Scan-driven sort facility for a relational database system. In Proceedings of the Conference on Very Large Databases (VLDB). 236--244.
|
| |
24
|
Harizopoulos, S. and Ailamaki, A. 2003. A case for staged database systems. In Proceedings of the Conference on Innovative Data Systems Research (CIDR). Asilomar, CA.
|
| |
25
|
Hu, T. C. and Tucker, A. C. 1971. Optimal computer search trees and variable-length alphabetic codes. SIAM J. Appl. Math. 21, 4, 514--532.
|
| |
26
|
|
 |
27
|
Tom Keller , Goetz Graefe , David Maier, Efficient assembly for complex objects, Proceedings of the 1991 ACM SIGMOD international conference on Management of data, p.148-157, May 29-31, 1991, Denver, Colorado, United States
|
| |
28
|
|
| |
29
|
|
| |
30
|
|
| |
31
|
Kwan, S. C. and Baer, J.-L. 1985. The I/O performance of multiway mergesort and tag sort. IEEE Trans. Comput. 34, 4, 383--387.
|
| |
32
|
Larus, J. R. and Parkes, M. 2001. Using cohort scheduling to enhance server performance. Microsoft Research Tech. Rep. 39.
|
| |
33
|
Larson, P.-L. 2003. External sorting: Run formation revisited. IEEE Trans. Knowl. Data Eng. 15, 4, 961--972.
|
 |
34
|
|
 |
35
|
|
 |
36
|
Gurmeet Singh Manku , Sridhar Rajagopalan , Bruce G. Lindsay, Approximate medians and other quantiles in one pass and with limited memory, Proceedings of the 1998 ACM SIGMOD international conference on Management of data, p.426-435, June 01-04, 1998, Seattle, Washington, United States
|
| |
37
|
McJones, P. Ed. 1997. The 1995 SQL reunion: People, projects, and politics. SRC Tech. Note 1997-018, Digital Systems Research Center. Palo Alto, CA.
|
 |
38
|
|
| |
39
|
|
| |
40
|
|
| |
41
|
|
 |
42
|
|
 |
43
|
|
| |
44
|
|
 |
45
|
P. Griffiths Selinger , M. M. Astrahan , D. D. Chamberlin , R. A. Lorie , T. G. Price, Access path selection in a relational database management system, Proceedings of the 1979 ACM SIGMOD international conference on Management of data, May 30-June 01, 1979, Boston, Massachusetts
[doi> 10.1145/582095.582099]
|
 |
46
|
Praveen Seshadri , Joseph M. Hellerstein , Hamid Pirahesh , T. Y. Cliff Leung , Raghu Ramakrishnan , Divesh Srivastava , Peter J. Stuckey , S. Sudarshan, Cost-based optimization for magic: algebra and implementation, Proceedings of the 1996 ACM SIGMOD international conference on Management of data, p.435-446, June 04-06, 1996, Montreal, Quebec, Canada
|
| |
47
|
|
| |
48
|
Stonebraker, M. and Kumar, A. 1986. Operating system support for data management. IEEE Database Eng. Bulletin 9, 3, 43--50.
|
 |
49
|
|
| |
50
|
|
| |
51
|
|
 |
52
|
Chun Zhang , Jeffrey Naughton , David DeWitt , Qiong Luo , Guy Lohman, On supporting containment queries in relational database management systems, Proceedings of the 2001 ACM SIGMOD international conference on Management of data, p.425-436, May 21-24, 2001, Santa Barbara, California, United States
|
| |
53
|
|
|