|
ABSTRACT
We consider a distributed server system and ask which policy should be used for assigning jobs (tasks) to hosts. In our server, jobs are not preemptible. Also, the job's service demand is not known a priori. We are particularly concerned with the case where the workload is heavy-tailed, as is characteristic of many empirically measured computer workloads. We analyze several natural task assignment policies and propose a new one TAGS (Task Assignment based on Guessing Size). The TAGS algorithm is counterintuitive in many respects, including load unbalancing, non-work-conserving, and fairness. We find that under heavy-tailed workloads, TAGS can outperform all task assignment policies known to us by several orders of magnitude with respect to both mean response time and mean slowdown, provided the system load is not too high. We also introduce a new practical performance metric for distributed servers called server expansion. Under the server expansion metric, TAGS significantly outperforms all other task assignment policies, regardless of system load.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
|
 |
3
|
Mark E. Crovella , Mor Harchol-Balter , Cristina D. Murta, Task assignment in a distributed system (extended abstract): improving performance by unbalancing load, Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, p.268-269, June 22-26, 1998, Madison, Wisconsin, United States
|
| |
4
|
|
| |
5
|
|
| |
6
|
Ephremides, A., Varaiya, P., and Walrand, J. 1980. A simple dynamic routing problem. IEEE Trans. Automat. Cont. AC-25, 4, 690--693.
|
| |
7
|
|
| |
8
|
|
| |
9
|
|
 |
10
|
|
| |
11
|
Irlam, G. 1994. Unix file size survey---1993. Available at http://www.base.com/gordoni/ ufs93.html.
|
| |
12
|
Khinchin, A. Y. 1932. Mathematical theory of stationary queues. Mat. Sbornik 39, 73--84.
|
| |
13
|
Koole, G., Sparaggis, P., and Towsley, D. 1999. Minimizing response times and queue lengths in systems of parallel queues. J. Appl. Prob. 36, 1185--1193.
|
| |
14
|
Leiserson, C. 1998a. The Pleiades alpha cluster at M.I.T. Documentation at: http://bonanza.lcs. mit.edu.
|
| |
15
|
Leiserson, C. 1998b. The Xolas supercomputing project at M.I.T. Documentation available at: http://xolas.lcs.mit.edu.
|
 |
16
|
|
 |
17
|
|
| |
18
|
|
| |
19
|
|
| |
20
|
|
| |
21
|
Peterson, D. L., and Adams, D. R. 1996. Fractal patterns in DASD I/O traffic. In CMG Proc.
|
| |
22
|
Pollaczek, F. 1930. Uber eine aufgabe dev wahrscheinlichkeitstheorie. I-II Math. Zeitschrift. 32, 64--100.
|
| |
23
|
The PSC's Cray J90's. 1998. http://www.psc.edu/machine/cray/j90/j90.html.
|
| |
24
|
. Supercomputing at the NAS facility. 1998. http://www.nas.nasa.gov/Technology/Supercomputing/.
|
| |
25
|
|
| |
26
|
|
 |
27
|
Anees Shaikh , Jennifer Rexford , Kang G. Shin, Load-sensitive routing of long-lived IP flows, Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication, p.215-226, August 30-September 03, 1999, Cambridge, Massachusetts, United States
|
| |
28
|
Sozaki, S., and Ross, R. 1978. Approximations in finite capacity multiserver queues with poisson arrivals. J. Appl. Prob. 13, 826--834.
|
| |
29
|
|
| |
30
|
Winston, W. 1977. Optimality of the shortest line discipline. J. Appl. Prob. 14, 181--189.
|
| |
31
|
Wolff, R. W. 1989. Stochastic Modeling and the Theory of Queues. Prentice-Hall, Englewood Cliffs, N.J.
|
CITED BY 11
|
|
|
|
|
|
|
|
|
|
|
Mor Harchol-Balter , Cuihong Li , Takayuki Osogami , Alan Scheller-Wolf , Mark S. Squillante, Cycle stealing under immediate dispatch task assignment, Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures, June 07-09, 2003, San Diego, California, USA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
INDEX TERMS
Primary Classification:
C.
Computer Systems Organization
C.1
PROCESSOR ARCHITECTURES
C.1.4
Parallel Architectures
Subjects:
Distributed architectures
Additional Classification:
C.
Computer Systems Organization
C.2
COMPUTER-COMMUNICATION NETWORKS
C.2.4
Distributed Systems
C.4
PERFORMANCE OF SYSTEMS
Subjects:
Design studies
D.
Software
D.4
OPERATING SYSTEMS
D.4.8
Performance
Subjects:
Queueing theory;
Modeling and prediction
General Terms:
Design,
Performance
Keywords:
Clusters,
contrary behavior,
distributed servers,
fairness,
heavy-tailed workloads,
high variance,
job scheduling,
load balancing,
load sharing,
supercomputing,
task assignment
REVIEW
"Seetharami R Seelam : Reviewer"
The author has done a great job in describing the “task assignment by guessing size” algorithm for high variability workloads. This work is clearly intended for researchers and practitioners in computing and advanced sciences. Readers
more...
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
|