Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Improved dynamic modeling for controlled server queues

Berner, Tommi LU orcid ; Nyberg Carlsson, Max LU orcid ; Ruuskanen, Johan LU orcid ; Maggio, Martina LU and Årzén, Karl Erik LU orcid (2025) In Control Engineering Practice 164.
Abstract

Resource provisioning for applications hosted in the cloud is a difficult task due to inherent performance variability in the infrastructure. Control theory has proven to be an efficient tool to increase the predictability of cloud applications. However, a prerequisite for a successful control design is an adequate model of the involved dynamics. In this paper we focus on modeling of controlled server queues that are subject to actuators, such as frequency scaling or admission control. We show that today's models are only applicable to specific server types, characterized by queuing disciplines, and propose a model structure that can be applied for more general settings. Our structure is nonlinear, yet simple enough to allow for control... (More)

Resource provisioning for applications hosted in the cloud is a difficult task due to inherent performance variability in the infrastructure. Control theory has proven to be an efficient tool to increase the predictability of cloud applications. However, a prerequisite for a successful control design is an adequate model of the involved dynamics. In this paper we focus on modeling of controlled server queues that are subject to actuators, such as frequency scaling or admission control. We show that today's models are only applicable to specific server types, characterized by queuing disciplines, and propose a model structure that can be applied for more general settings. Our structure is nonlinear, yet simple enough to allow for control design. We compare our approach to state-of-the-art models in an extensive simulation campaign, showing the superior versatility of our model. We also evaluate the model using measured data from a cloud-based face detection algorithm run in Kubernetes. Furthermore, we use our model in control design examples to show the insights that can be gained. We identify a critical frequency range where the characteristics of the involved service time distribution affect the control design, and where a more advanced controller structure might be needed. Finally, we present a feedback linearization control design based on our model that is evaluated using both simulations and a cloud-based application.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Cloud control, Modeling, Queuing theory, Resource management
in
Control Engineering Practice
volume
164
article number
106473
publisher
Elsevier
external identifiers
  • scopus:105010204244
ISSN
0967-0661
DOI
10.1016/j.conengprac.2025.106473
language
English
LU publication?
yes
id
01c5936c-6be4-4954-9c88-419353c11c26
date added to LUP
2025-10-29 10:08:10
date last changed
2025-10-29 10:09:12
@article{01c5936c-6be4-4954-9c88-419353c11c26,
  abstract     = {{<p>Resource provisioning for applications hosted in the cloud is a difficult task due to inherent performance variability in the infrastructure. Control theory has proven to be an efficient tool to increase the predictability of cloud applications. However, a prerequisite for a successful control design is an adequate model of the involved dynamics. In this paper we focus on modeling of controlled server queues that are subject to actuators, such as frequency scaling or admission control. We show that today's models are only applicable to specific server types, characterized by queuing disciplines, and propose a model structure that can be applied for more general settings. Our structure is nonlinear, yet simple enough to allow for control design. We compare our approach to state-of-the-art models in an extensive simulation campaign, showing the superior versatility of our model. We also evaluate the model using measured data from a cloud-based face detection algorithm run in Kubernetes. Furthermore, we use our model in control design examples to show the insights that can be gained. We identify a critical frequency range where the characteristics of the involved service time distribution affect the control design, and where a more advanced controller structure might be needed. Finally, we present a feedback linearization control design based on our model that is evaluated using both simulations and a cloud-based application.</p>}},
  author       = {{Berner, Tommi and Nyberg Carlsson, Max and Ruuskanen, Johan and Maggio, Martina and Årzén, Karl Erik}},
  issn         = {{0967-0661}},
  keywords     = {{Cloud control; Modeling; Queuing theory; Resource management}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Control Engineering Practice}},
  title        = {{Improved dynamic modeling for controlled server queues}},
  url          = {{http://dx.doi.org/10.1016/j.conengprac.2025.106473}},
  doi          = {{10.1016/j.conengprac.2025.106473}},
  volume       = {{164}},
  year         = {{2025}},
}