Advanced

Introducing Service-level Awareness in the Cloud

Klein, Cristian; Maggio, Martina LU ; Årzén, Karl-Erik LU and Hernández-Rodriguez, Francisco (2013) 2013 ACM Symposium on Cloud Computing
Abstract
Managing the resources of a virtualized data-center is a key issue in cloud computing. Existing research mostly assumes that applications are either allocated the required resources or fail. Combined with the fact that most cloud applications have dynamic resource requirements, this imposes a fundamental limitation to cloud computing: To guarantee on-demand resource allocations, the data-center needs large spare capacity, leading to inefficient resource utilization.



This is especially problematic when dealing with unexpected events, such as flash crowds, hardware failures

and performance inference among applications.

These phenomena are well-known and software is readily written to cope with them, as... (More)
Managing the resources of a virtualized data-center is a key issue in cloud computing. Existing research mostly assumes that applications are either allocated the required resources or fail. Combined with the fact that most cloud applications have dynamic resource requirements, this imposes a fundamental limitation to cloud computing: To guarantee on-demand resource allocations, the data-center needs large spare capacity, leading to inefficient resource utilization.



This is especially problematic when dealing with unexpected events, such as flash crowds, hardware failures

and performance inference among applications.

These phenomena are well-known and software is readily written to cope with them, as long as resource provisioning is sufficient. However, given the short duration and large magnitude of such events, provisioning enough capacity is often economically unfeasible. Hence, the data-center may become overloaded, rendering hosted applications unresponsive.



To efficiently and robustly deal with unexpected events, we introduce service-level awareness in the cloud. Application are augmented with a dynamic parameter, the service level, that monotonically affects both their resource requirements and their delivered end-user experience. For example, online shops offer end-users recommendations of similar products. No doubt, such recommender engines greatly increase usability, however, they are highly resource demanding. By selectively deactivating the corresponding code, resource requirements can be controlled at the expense of end-user experience.

In case of unexpected events, the infrastructure can simply ask applications to temporarily reduce their requirements. Consequently, end-user experience is downgraded, but the user is at least provided with partial content in a timely manner.



We built the necessary software to add service level-awareness to clouds, with contributions both on application-side as well as infrastructure-side. (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Contribution to conference
publication status
published
subject
pages
2 pages
conference name
2013 ACM Symposium on Cloud Computing
external identifiers
  • Scopus:84893297340
language
English
LU publication?
yes
id
fe90fe4a-dbba-4fa1-8f8f-d5775e07108e (old id 4249327)
date added to LUP
2014-01-17 12:04:48
date last changed
2016-10-13 04:58:33
@misc{fe90fe4a-dbba-4fa1-8f8f-d5775e07108e,
  abstract     = {Managing the resources of a virtualized data-center is a key issue in cloud computing. Existing research mostly assumes that applications are either allocated the required resources or fail. Combined with the fact that most cloud applications have dynamic resource requirements, this imposes a fundamental limitation to cloud computing: To guarantee on-demand resource allocations, the data-center needs large spare capacity, leading to inefficient resource utilization.<br/><br>
<br/><br>
This is especially problematic when dealing with unexpected events, such as flash crowds, hardware failures<br/><br>
and performance inference among applications.<br/><br>
These phenomena are well-known and software is readily written to cope with them, as long as resource provisioning is sufficient. However, given the short duration and large magnitude of such events, provisioning enough capacity is often economically unfeasible. Hence, the data-center may become overloaded, rendering hosted applications unresponsive.<br/><br>
<br/><br>
To efficiently and robustly deal with unexpected events, we introduce service-level awareness in the cloud. Application are augmented with a dynamic parameter, the service level, that monotonically affects both their resource requirements and their delivered end-user experience. For example, online shops offer end-users recommendations of similar products. No doubt, such recommender engines greatly increase usability, however, they are highly resource demanding. By selectively deactivating the corresponding code, resource requirements can be controlled at the expense of end-user experience.<br/><br>
In case of unexpected events, the infrastructure can simply ask applications to temporarily reduce their requirements. Consequently, end-user experience is downgraded, but the user is at least provided with partial content in a timely manner.<br/><br>
<br/><br>
We built the necessary software to add service level-awareness to clouds, with contributions both on application-side as well as infrastructure-side.},
  author       = {Klein, Cristian and Maggio, Martina and Årzén, Karl-Erik and Hernández-Rodriguez, Francisco},
  language     = {eng},
  pages        = {2},
  title        = {Introducing Service-level Awareness in the Cloud},
  year         = {2013},
}