Modeling of Request Cloning in Cloud Server Systems using Processor Sharing
(2020) 11th ACM/SPEC International Conference on Performance Engineering p.24-35- Abstract
- The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server. The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.
The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We... (More) - The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server. The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.
The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We show examples of server systems fulfilling this requirement. We also use our G/G/1 model to co-design traditional load-balancing algorithms together with cloning strategies, providing well-performing and provably stable designs.
Finally, we also relax the synchronized service requirement and study the effects of non-perfect synchronization. We derive bounds for how common imperfections that occur in practice, such as arrival and cancellation delays, affect the accuracy of our model. We empirically demonstrate that the bounds are tight for small imperfections, and that our co-design method for the popular Join-Shortest-Queue (JSQ) policy can be used even under relaxed synchronization assumptions with small loss in accuracy. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/7b37c820-6357-441a-b8d5-2fa45ec3b20b
- author
- Nylander, Tommi LU ; Ruuskanen, Johan LU ; Årzén, Karl-Erik LU and Maggio, Martina LU
- organization
- publishing date
- 2020
- type
- Chapter in Book/Report/Conference proceeding
- publication status
- published
- subject
- keywords
- Cloning, Cloud computing, Datacenters
- host publication
- Proceedings of the 2020 ACM/SPEC International Conference on Performance Engineering
- pages
- 24 - 35
- conference name
- 11th ACM/SPEC International Conference on Performance Engineering
- conference location
- Edmonton, Canada
- conference dates
- 2020-04-20 - 2020-04-24
- external identifiers
-
- scopus:85085926990
- ISBN
- 978-145036991-6
- DOI
- 10.1145/3358960.3379128
- project
- Control-based resource management in the distributed cloud
- Event-Based Information Fusion for the Self-Adaptive Cloud
- WASP: Autonomous Cloud
- language
- English
- LU publication?
- yes
- id
- 7b37c820-6357-441a-b8d5-2fa45ec3b20b
- date added to LUP
- 2020-02-25 17:56:13
- date last changed
- 2022-05-12 00:45:48
@inproceedings{7b37c820-6357-441a-b8d5-2fa45ec3b20b, abstract = {{The interest for studying server systems subject to cloned requests has recently increased. In this paper we present a model that allows us to equivalently represent a system of servers with cloned requests, as a single server. The model is very general, and we show that no assumptions on either inter-arrival or service time distributions are required, allowing for, e.g., both heterogeneity and dependencies. Further, we show that the model holds for any queuing discipline. However, we focus our attention on Processor Sharing, as the discipline has not been studied before in this context.<br/><br/>The key requirement that enables us to use the single server G/G/1 model is that the request clones have to receive synchronized service. We show examples of server systems fulfilling this requirement. We also use our G/G/1 model to co-design traditional load-balancing algorithms together with cloning strategies, providing well-performing and provably stable designs.<br/><br/>Finally, we also relax the synchronized service requirement and study the effects of non-perfect synchronization. We derive bounds for how common imperfections that occur in practice, such as arrival and cancellation delays, affect the accuracy of our model. We empirically demonstrate that the bounds are tight for small imperfections, and that our co-design method for the popular Join-Shortest-Queue (JSQ) policy can be used even under relaxed synchronization assumptions with small loss in accuracy.}}, author = {{Nylander, Tommi and Ruuskanen, Johan and Årzén, Karl-Erik and Maggio, Martina}}, booktitle = {{Proceedings of the 2020 ACM/SPEC International Conference on Performance Engineering}}, isbn = {{978-145036991-6}}, keywords = {{Cloning; Cloud computing; Datacenters}}, language = {{eng}}, pages = {{24--35}}, title = {{Modeling of Request Cloning in Cloud Server Systems using Processor Sharing}}, url = {{https://lup.lub.lu.se/search/files/76541630/ICPE_2020_tommi_final.pdf}}, doi = {{10.1145/3358960.3379128}}, year = {{2020}}, }