Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Modeling of Request Cloning in Cloud Server Systems using Processor Sharing

Nylander, Tommi LU orcid ; Ruuskanen, Johan LU orcid ; Årzén, Karl-Erik LU orcid and Maggio, Martina LU (2020) 11th ACM/SPEC International Conference on Performance Engineering p.24-35
Abstract
The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server. The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.

The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We... (More)
The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server. The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.

The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We show examples of server systems fulfilling this requirement. We also use our G/G/1 model to co-design traditional load-balancing algorithms together with cloning strategies, providing well-performing and provably stable designs.

Finally, we also relax the synchronized service requirement and study the effects of non-perfect synchronization. We derive bounds for how common imperfections that occur in practice, such as arrival and cancellation delays, affect the accuracy of our model. We empirically demonstrate that the bounds are tight for small imperfections, and that our co-design method for the popular Join-Shortest-Queue (JSQ) policy can be used even under relaxed synchronization assumptions with small loss in accuracy. (Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
keywords
Cloning, Cloud computing, Datacenters
host publication
Proceedings of the 2020 ACM/SPEC International Conference on Performance Engineering
pages
24 - 35
conference name
11th ACM/SPEC International Conference on Performance Engineering
conference location
Edmonton, Canada
conference dates
2020-04-20 - 2020-04-24
external identifiers
  • scopus:85085926990
ISBN
978-145036991-6
DOI
10.1145/3358960.3379128
project
Control-based resource management in the distributed cloud
Event-Based Information Fusion for the Self-Adaptive Cloud
WASP: Autonomous Cloud
language
English
LU publication?
yes
id
7b37c820-6357-441a-b8d5-2fa45ec3b20b
date added to LUP
2020-02-25 17:56:13
date last changed
2022-05-12 00:45:48
@inproceedings{7b37c820-6357-441a-b8d5-2fa45ec3b20b,
  abstract     = {{The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server.  The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.<br/><br/>The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We show examples of server systems fulfilling this requirement. We also use our G/G/1 model to co-design traditional load-balancing algorithms together with cloning strategies, providing well-performing and provably stable designs.<br/><br/>Finally, we also relax the synchronized service requirement and study the effects of non-perfect synchronization. We derive bounds for how common imperfections that occur in practice, such as arrival and cancellation delays, affect the accuracy of our model. We empirically demonstrate that the bounds are tight for small imperfections, and that our co-design method for the popular Join-Shortest-Queue (JSQ) policy can be used even under relaxed synchronization assumptions with small loss in accuracy.}},
  author       = {{Nylander, Tommi and Ruuskanen, Johan and Årzén, Karl-Erik and Maggio, Martina}},
  booktitle    = {{Proceedings of the 2020 ACM/SPEC International Conference on Performance Engineering}},
  isbn         = {{978-145036991-6}},
  keywords     = {{Cloning; Cloud computing; Datacenters}},
  language     = {{eng}},
  pages        = {{24--35}},
  title        = {{Modeling of Request Cloning in Cloud Server Systems using Processor Sharing}},
  url          = {{https://lup.lub.lu.se/search/files/76541630/ICPE_2020_tommi_final.pdf}},
  doi          = {{10.1145/3358960.3379128}},
  year         = {{2020}},
}