skip to main content
10.1145/1248377.1248384acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
Article

Scheduling DAGs on asynchronous processors

Published: 09 June 2007 Publication History

Abstract

This paper addresses the problem of scheduling a DAG of unit-length tasks on asynchronous processors, that is, processors having different and changing speeds. The objective is to minimize the makespan, that is, the time to execute the entire DAG. Asynchrony is modeled by an oblivious adversary, which is assumed to determine the processor speeds at each point in time. The oblivious adversary may change processor speeds arbitrarily and arbitrarily often, but makes speed decisions independently of any random choices of the scheduling algorithm.
This paper gives bounds on the makespan of two randomized online firing-squad scheduling algorithms, All and Level. These two schedulers are shown to have good makespan even when asynchrony is arbitrarily extreme. Let W and D denote, respectively, the number of tasks and the longest path in the DAG, and let πave denote the average speed of the p processors during the execution.
In All each processor repeatedly chooses a random task to execute from among all ready tasks (tasks whose predecessors have been executed). Scheduler All is shown to have a makespan Tp=
Θ(W<over>pπave), when W<over>Dp log p
Θ((log p)α W<over>pπave + (log p) 1-α D<over>πave), when W<over>D= p(log p)1-2α, for α ∈ [0, 1]
Θ (D<over>πave, when W<over>Dp<over>log p,
both expected and with high probability. A family of DAGs is exhibited for which this analysis is tight.
In Level each of the processors repeatedly chooses a random task to execute from among all critical tasks (ready tasks at the lowest level of the DAG). This second scheduler is shown to have a makespan of.

References

[1]
D. P. Anderson, J. Cobb, E. Korpela, M. Lebofsky, and D. Werthimer. Seti@home: an experiment in public-resource computing. Commun. ACM, 45(11):56--61, 2002.
[2]
Y. Aumann, M. A. Bender, and L. Zhang. Efficient execution of nondeterministic parallel programs on asynchronous systems. In Proceedings of the 8th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), pages 270--276, 1996.
[3]
Y. Aumann, M. A. Bender, and L. Zhang. Efficient execution of nondeterministic parallel programs on asynchronous systems. Information and Computation, 139(1):1--16, 25 Nov. 1997.
[4]
Y. Aumann, Z. M. Kedem, K. V. Palem, and M. O. Rabin. Highly efficient asynchronous execution of large-grained parallel programs. In Proceedings of the 34th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 271--280, 1993.
[5]
Y. Aumann and M. O. Rabin. Clock construction in fully asynchronous parallel systems and pram simulation. Theoretical Computer Science, 128:3--30, 1994.
[6]
A. Baratloo, P. Dasgupta, V. Karamcheti, and Z. M. Kedem. Metacomputing with milan. In Proceedings of the Eighth Heterogeneous Computing Workshop (HCW), page 169, 1999.
[7]
A. Baratloo, P. Dasgupta, and Z. M. Kedem. Calypso: A novel software system for fault-tolerant parallel processing on distributed platforms. In Proceedings of the 4th International Symposium on High Performance Distributed Computing (HPDC), pages 122--129, 1995.
[8]
A. Baratloo, M. Karaul, Z. M. Kedem, and P. Wijckoff. Charlotte: metacomputing on the web. 9th International Conference on Parallel and Distributed Computing Systems (PDCS), 1996.
[9]
M. A. Bender and M. O. Rabin. Scheduling Cilk multithreaded computations on processors of different speeds. In Proceedings of the 12th Annual ACM Symposium on Parallel Algorithms and Architectures (SPAA), pages 13--21, July 2000.
[10]
M. A. Bender and M. O. Rabin. Online scheduling of parallel programs on heterogeneous systems with applications to Cilk. Theory of Computing Systems Special Issue on SPAA00, 35:289--304, 2002.
[11]
R. D. Blumofe and C. E. Leiserson. Scheduling multithreaded computations by work stealing. In Proceedings of the 35th Annual Symposium on Foundations of Computer Science (FOCS), pages 356--368, Santa Fe, New Mexico, Nov. 1994.
[12]
R. P. Brent. The parallel evaluation of general arithmetic expressions. Journal of the ACM, 21(2):201--206, April 1974.
[13]
C. Chekuri and M. A. Bender. An efficient approximation algorithm for minimizing makespan on uniformly related machines. In Proceedings of the 6th Conference on Integer Programming and Combinatorial Optimization (IPCO), volume 1412, pages 383--393, 1998.
[14]
C. Chekuri and M. A. Bender. An efficient approximation algorithm for minimizing makespan on uniformly related machines. Journal of Algorithms, 41:212--224, 2001.
[15]
F. A. Chudak and D. B. Shmoys. Approximation algorithms for precedence-constrained scheduling problems on parallel machines that run at different speeds (extended abstract). In Proceedings of the Eighth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 581--590, New Orleans, Louisiana, 5--7 Jan. 1997.
[16]
F. A. Chudak and D. B. Shmoys. Approximation algorithms for precedence-constrained scheduling problems on parallel machines that run at different speeds. Journal of Algorithms, 30(2):323--343, February 1999.
[17]
R. Cole and O. Zajicek. The expected advantage of asynchrony. In Proc. of the ACM Symposium on Parallel Architectures and Algorithms, pages 85--94, 1989.
[18]
I. Foster and C. Kesselman. Globus: A metacomputing infrastructure toolkit. The International Journal of Supercomputer Applications and High Performance Computing, 11(2):115--128, 1997.
[19]
M. Frigo, C. E. Leiserson, and K. H. Randall. The implementation of the cilk-5 multithreaded language. In Proceedings of the ACM SIGPLAN 1998 Conference on Programming Language Design and Implementation (PLDI), pages 212--223, 1998.
[20]
P. B. Gibbons. A more practical PRAM model. In Proc. of the 1st ACM Symposium on Parallel Architectures and Algorithms, pages 158--168, June 1989.
[21]
R. L. Graham. Bounds on multiprocessing timing anomalies. SIAM Journal on Applied Mathematics, 17(2):416--429, Mar. 1969.
[22]
J. M. Jaffe. An analysis of preemptive multiprocessor job scheduling. Mathematics of Operations Research, 5(3):415--421, Aug. 1980.
[23]
J. M. Jaffe. Efficient scheduling of tasks without full use of processor resources. Theoretical Computer Science, 12:1--17, Aug. 1980.
[24]
P. C. Kanellakis and A. A. Shvartsman. Efficient parallel algorithms can be made robust. Distributed Computing, 5(4):201--217, 1992.
[25]
Z. M. Kedem, K. V. Palem, M. O. Rabin, and A. Raghunathan. Efficient program transformation for resilient parallel computation via randomization. In Proceedings of the 24th Annual ACM Symposium on the Theory of Computing (STOC), pages 306--317, May 1992.
[26]
Z. M. Kedem, K. V. Palem, A. Raghunathan, and P. G. Spirakis. Combining tentative and definite executions for very fast dependable parallel computing. In Proceedings of the 23rd Annual ACM Symposium on Theory of Computing (STOC), pages 381--390, May 1991.
[27]
Z. M. Kedem, K. V. Palem, and P. G. Spirakis. Efficient robust parallel computations. In Proceedings of the 22rd Annual ACM Symposium on Theory of Computing (STOC), pages 138--148, May 1990.
[28]
S. C. Kontogiannis, G. E. Pantziou, P. G. Spirakis, and M. Yung. Robust parallel computations through randomization. Theory of Computing Systems, 33(5/6):427--464, 2000.
[29]
G. Malewicz. Parallel scheduling of complex dags under uncertainty. In Proceedings of the 17th Ann. ACM Symp. Parallelism in Algorithms and Architectures (SPAA), pages 66--75, 2005.
[30]
C. Martel, A. Park, and R. Subramonian. Asynchronous PRAMs are (almost) as good as synchronous PRAMs. In Proceedings of the 31st Annual Symposium on the Foundations of Computer Science (FOCS), pages 590--599, 1990.
[31]
M. O. Neary and P. Cappello. Advanced eager scheduling for java-based adaptive parallel computing: Research articles. Concurrency and Computation: Practice and Experience, 17(7-8):797--819, 2005.
[32]
N. Nishimura. Asynchronous shared memory parallel computation. In Proc. of the 2nd ACM Symposium on Parallel Architectures and Algorithms, pages 76--84, 1990.
[33]
A. Panconesi and A. Srinivasan. Randomized distributed edge coloring via an extension of the Chernoff-Hoeffding bounds. SIAM J. Comput., 26(2):350--368, 1997.
[34]
A. Srinivasan. Distributions on level-sets with applications to approximation algorithms. In Proceedings of the 42 Annual IEEE Symposium on Foundations of Computer Science (FOCS), pages 588--597, 2001.
[35]
J. Ullman. NP-complete scheduling problems. Journal Computing System Science, 10:384--393, 1975.

Cited By

View all
  • (2015)An AREA-Oriented Heuristic for Scheduling DAGs on Volatile Computing PlatformsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.234618926:8(2164-2177)Online publication date: 1-Aug-2015
  • (2015)On constructing DAG‐schedules with large areasConcurrency and Computation: Practice and Experience10.1002/cpe.356027:16(4107-4121)Online publication date: 25-Jun-2015
  • (2014)On Constructing DAG-Schedules with Large AREAsEuro-Par 2014 Parallel Processing10.1007/978-3-319-09873-9_52(620-631)Online publication date: 2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SPAA '07: Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
June 2007
376 pages
ISBN:9781595936677
DOI:10.1145/1248377
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. asynchronous parallel computing
  2. firing-squad scheduling
  3. online scheduling
  4. precedence-constrained scheduling

Qualifiers

  • Article

Conference

SPAA07

Acceptance Rates

Overall Acceptance Rate 447 of 1,461 submissions, 31%

Upcoming Conference

SPAA '25
37th ACM Symposium on Parallelism in Algorithms and Architectures
July 28 - August 1, 2025
Portland , OR , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2015)An AREA-Oriented Heuristic for Scheduling DAGs on Volatile Computing PlatformsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2014.234618926:8(2164-2177)Online publication date: 1-Aug-2015
  • (2015)On constructing DAG‐schedules with large areasConcurrency and Computation: Practice and Experience10.1002/cpe.356027:16(4107-4121)Online publication date: 25-Jun-2015
  • (2014)On Constructing DAG-Schedules with Large AREAsEuro-Par 2014 Parallel Processing10.1007/978-3-319-09873-9_52(620-631)Online publication date: 2014
  • (2013)On the Sublinear Processor Gap for Parallel ArchitecturesTheory and Applications of Models of Computation10.1007/978-3-642-38236-9_18(193-204)Online publication date: 2013
  • (2012)Improving performance of adaptive component-based dataflow middlewareParallel Computing10.1016/j.parco.2012.03.00538:6-7(289-309)Online publication date: 1-Jun-2012
  • (2012)On scheduling dag s for volatile computing platformsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2012.06.00772:10(1347-1360)Online publication date: 1-Oct-2012
  • (2012)A dynamic rescheduling algorithm for resource management in large scale dependable distributed systemsComputers & Mathematics with Applications10.1016/j.camwa.2012.02.06663:9(1409-1423)Online publication date: 1-May-2012
  • (2010)Automatic dataflow application tuning for heterogeneous systems2010 International Conference on High Performance Computing10.1109/HIPC.2010.5713173(1-10)Online publication date: Dec-2010
  • (2010)Extending IC-scheduling via the Sweep AlgorithmJournal of Parallel and Distributed Computing10.1016/j.jpdc.2009.11.00170:3(201-211)Online publication date: 1-Mar-2010
  • (2008)Extending IC-Scheduling via the Sweep AlgorithmProceedings of the 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008)10.1109/PDP.2008.16(366-373)Online publication date: 13-Feb-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media