Optimal hyper-scalable load balancing with a strict queue limit

Mark van der Boor, Sem Borst, Johan van Leeuwaarden

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review

3 Citaten (Scopus)
71 Downloads (Pure)

Samenvatting

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off between delay performance and implementation overhead (e.g. communication or memory usage). This trade-off has primarily been studied so far from the angle of the amount of overhead required to achieve asymptotically optimal performance, particularly vanishing delay in large-scale systems. In contrast, in the present paper, we focus on an arbitrarily sparse communication budget, possibly well below the minimum requirement for vanishing delay, referred to as the hyper-scalable operating region. Furthermore, jobs may only be admitted when a specific limit on the queue position of the job can be guaranteed. The centerpiece of our analysis is a universal upper bound for the achievable throughput of any dispatcher-driven algorithm for a given communication budget and queue limit. We also propose a specific hyper-scalable scheme which can operate at any given message rate and enforce any given queue limit, while allowing the server states to be captured via a closed product-form network, in which servers act as customers traversing various nodes. The product-form distribution is leveraged to prove that the bound is tight and that the proposed hyper-scalable scheme is throughput-optimal in a many-server regime given the communication and queue limit constraints. Extensive simulation experiments are conducted to illustrate the results.

Originele taal-2Engels
Artikelnummer102217
Aantal pagina's20
TijdschriftPerformance Evaluation
Volume149-150
DOI's
StatusGepubliceerd - sep. 2021

Bibliografische nota

Funding Information:
This work is supported by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO), Netherlands Gravitation Networks grant 024.002.003 and an ERC Starting Grant, Netherlands . We would like to thank Céline Comte and Martin Zubeldia for several helpful discussions and suggestions.

Publisher Copyright:
© 2021 The Author(s)

Financiering

This work is supported by the Nederlandse Organisatie voor Wetenschappelijk Onderzoek (NWO), Netherlands Gravitation Networks grant 024.002.003 and an ERC Starting Grant, Netherlands . We would like to thank Céline Comte and Martin Zubeldia for several helpful discussions and suggestions.

Vingerafdruk

Duik in de onderzoeksthema's van 'Optimal hyper-scalable load balancing with a strict queue limit'. Samen vormen ze een unieke vingerafdruk.

Citeer dit