Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Optimizing databases in the cloud based on performance and cost savings

Nilsson, Felix LU and Mihajlovic, Daniel LU (2022) EITM01 20221
Department of Electrical and Information Technology
Abstract
As cloud service providers becomes more prevalent, so does questions related to cost efficiency of hosted resources. Payment models for cloud hosted resources tend to be either subscription based or pay-as-you-go for computing resources, in this case for compound metrics of CPU, Data IO and Log IO. In order to find necessary resource provisioning, previous methods have tended towards observing the utilization of these compound processing units and deciding based of some utilization threshold. This report aims to find metrics which can methodically present the state of a hosted database as well as suggest whether the current demand for resources is necessary. To do this, first a collection of metrics are found and analyzed in relation to... (More)
As cloud service providers becomes more prevalent, so does questions related to cost efficiency of hosted resources. Payment models for cloud hosted resources tend to be either subscription based or pay-as-you-go for computing resources, in this case for compound metrics of CPU, Data IO and Log IO. In order to find necessary resource provisioning, previous methods have tended towards observing the utilization of these compound processing units and deciding based of some utilization threshold. This report aims to find metrics which can methodically present the state of a hosted database as well as suggest whether the current demand for resources is necessary. To do this, first a collection of metrics are found and analyzed in relation to how they present the database state. Then optimizations are done, as suggested by chosen metrics, to make the database efficient enough to require less processing power. From these experiments, it is clear that the behaviour of a less provisioned database come with indirect changes to query processing. As less memory is available, the reliance on reading data from disk becomes more prominent, leading to less efficient execution. As this occurs for many queries that run concurrently, wait times become a more dominant part of execution for overutilized resources. The execution plan for processing a query depends on the predicted impact estimation for available resources which can drastically change the nature of execution. If more resources are provided, and statistics related to previous resource provisioning is available, it is possible that some query performance degrades with more available resources. Essentially, the results of these experiments is that finding over-/under utilized resources depend on not only considering the utilization of resources but also wait times, and limitations to executions as a result of the available resources. The metrics suggested for optimizing the databases showed promising results, but the impact of the optimization remain hard to predict. This limits possibilities to choose changes that will lead to cost reductions under the limitations set by the cloud service provider. (Less)
Popular Abstract
As different amounts of computing resources are used, execution changes to match. Not only does less computing power lead to lower capabilities to handle high workloads but the actual execution of functions often tend to become more resource demanding. In order to save costs for processing power, this is a consistent behaviour that has been observed and the key to understanding this is using metrics that can properly characterize the database under different provisioning.
Please use this url to cite or link to this publication:
author
Nilsson, Felix LU and Mihajlovic, Daniel LU
supervisor
organization
alternative title
Optimering av databaser i molnet med avseende på prestanda och kostnad
course
EITM01 20221
year
type
H2 - Master's Degree (Two Years)
subject
keywords
Resource provisioning, Cloud service platform, Optimizing cost, Database, Performance, Metrics
report number
LU/LTH-EIT 2022-886
language
English
id
9097159
date added to LUP
2022-08-24 10:38:38
date last changed
2022-08-24 10:38:38
@misc{9097159,
  abstract     = {{As cloud service providers becomes more prevalent, so does questions related to cost efficiency of hosted resources. Payment models for cloud hosted resources tend to be either subscription based or pay-as-you-go for computing resources, in this case for compound metrics of CPU, Data IO and Log IO. In order to find necessary resource provisioning, previous methods have tended towards observing the utilization of these compound processing units and deciding based of some utilization threshold. This report aims to find metrics which can methodically present the state of a hosted database as well as suggest whether the current demand for resources is necessary. To do this, first a collection of metrics are found and analyzed in relation to how they present the database state. Then optimizations are done, as suggested by chosen metrics, to make the database efficient enough to require less processing power. From these experiments, it is clear that the behaviour of a less provisioned database come with indirect changes to query processing. As less memory is available, the reliance on reading data from disk becomes more prominent, leading to less efficient execution. As this occurs for many queries that run concurrently, wait times become a more dominant part of execution for overutilized resources. The execution plan for processing a query depends on the predicted impact estimation for available resources which can drastically change the nature of execution. If more resources are provided, and statistics related to previous resource provisioning is available, it is possible that some query performance degrades with more available resources. Essentially, the results of these experiments is that finding over-/under utilized resources depend on not only considering the utilization of resources but also wait times, and limitations to executions as a result of the available resources. The metrics suggested for optimizing the databases showed promising results, but the impact of the optimization remain hard to predict. This limits possibilities to choose changes that will lead to cost reductions under the limitations set by the cloud service provider.}},
  author       = {{Nilsson, Felix and Mihajlovic, Daniel}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Optimizing databases in the cloud based on performance and cost savings}},
  year         = {{2022}},
}