skip to main content
research-article

Smart Dispatching in Heterogeneous Systems

Published:04 December 2019Publication History
Skip Abstract Section

Abstract

In multi-server systems, selecting to which server to dispatch an arriving job is a key factor influencing system response time. One of the most widely studied policies is Join-the-Shortest-Queue (JSQ), which is known to minimize mean response time in certain settings [7]. Many variants on JSQ have been proposed, including JSQ-d, under which a job is dispatched to the shortest queue among d servers selected uniformly at random [3, 5]; Join-Idle-Queue (JIQ), under which the dispatcher knows which servers are idle but not the queue lengths of non-idle servers [2]; and others.

The vast majority of work analyzing JSQ and related policies makes a key assumption: that the system is homogeneous, meaning that all servers have the same speed. This assumption is inaccurate in most modern computer systems. Server heterogeneity can arise, e.g., when a server farm consists of several generations of hardware, or when many virtual machines contend for resources on the same physical machine. Unfortunately, the wealth of results about how best to dispatch in homogeneous systems does not translate well to heterogeneous systems. Policies like JSQ-d and JIQ, which can achieve near-optimal performance in homogeneous systems, can lead to unacceptably high response times and even instability in heterogeneous systems [4, 8].

References

  1. S. Banawan and N. Zeidat. A comparative study of load sharing in heterogeneous multicomputer systems. In Proceedings. 25th Annual Simulation Symposium, pages 22--31. IEEE, 1992.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Y. Lu, Q. Xie, G. Kliot, A. Geller, J. Larus, and A. Greenberg. Join-idle-queue: A novel load balancing algorithm for dynamically scalable web services. Performance Evaluation, 68(11):1056--1071, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Mitzenmacher. The power of two choices in randomized load balancing. IEEE Transactions on Parallel and Distributed Systems, 12(10):1094--1104, 2001.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Stolyar. Pull-based load distribution in large-scale heterogeneous service systems. Queueing Systems, 80(4):341--361, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Vvedenskaya, R. Dobrushin, and F. Karpelevich. Queueing system with selection of the shortest of two queues: An asymptotic approach. Problemy Peredachi Informatsii, 32(1):20--34, 1996.Google ScholarGoogle Scholar
  6. W. Whitt. Deciding which queue to join: Some counterexamples. Operations research, 34(1):55--62, 1986.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. W. Winston. Optimality of the shortest line discipline. Journal of Applied Probability, 14(1):181--189, 1977.Google ScholarGoogle ScholarCross RefCross Ref
  8. X. Zhou, F. Wu, J. Tan, Y. Sun, and N. Shro". Designing low-complexity heavy-traffic delay-optimal load balancing schemes: Theory to algorithms. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 1(2):39, 2017.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Smart Dispatching in Heterogeneous Systems

      Recommendations

      Reviews

      Mohammad Sadegh Kayhani Pirdehi

      For modern computing platforms, heterogeneous processing nodes are the reality. Moreover, most traditional job assignment algorithms, which are designed for homogeneous systems, are not efficient for heterogeneous platforms. For the sake of gaining the least response time, this paper investigates how to best dispatch the arriving jobs in heterogeneous systems, where the available servers have different processing speeds. The introduction presents well-known job dispatching policies like join-the-shortest-queue (JSQ), its variant JSQ- d (the server is chosen among d randomly selected servers), and join-idle-queue. Emphasizing their inefficiency in heterogeneous platforms, "power-of- d " policies are introduced. The paper presents techniques to smartly manage the utilization of faster processing nodes in heterogeneous systems by selecting which servers to be polled and how to assign the arrived jobs to them. In this manner, knowing the speed of polled servers is the key factor of performance. System modeling is included; k servers are divided into fast and slow servers with exponentially distributed service times. Two scenarios are examined and analyzed. In the first scenario, to evaluate response time, a heterogeneous version of JSQ is detailed. The servers are distinguished based on fast or slow, and the job assignments on idle fast, idle slow, and no idle fast orders. In order to determine the system's response time, the authors use a deterministic limiting system and a differential equation system to model behavior. The second scenario considers a set of servers without any prior knowledge of speed. After a period of random job assignments, servers with more accomplished jobs are assigned higher priority for polling and job assignments. Based on innovations from system results, JSQ- d with accomplishment sampling (JSQ-DAS) is a nice idea. While the paper is good, it could be improved by including an analytical discussion of JSQ-DAS.

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGMETRICS Performance Evaluation Review
        ACM SIGMETRICS Performance Evaluation Review  Volume 47, Issue 2
        September 2019
        37 pages
        ISSN:0163-5999
        DOI:10.1145/3374888
        Issue’s Table of Contents

        Copyright © 2019 Copyright is held by the owner/author(s)

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 4 December 2019

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader