Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

The effects of station density in geostatistical prediction of air temperatures in Sweden : A comparison of two interpolation techniques

Njoku, Elijah Akwarandu ; Akpan, Patrick Etim ; Effiong, Augustine Edet and Babatunde, Isaac Oluwatosin (2023) In Resources, Environment and Sustainability 11.
Abstract

High fidelity gridded temperature datasets are difficult to obtain for areas with sparse coverage of meteorological stations given that sparsity of stations is known to introduce uncertainty in the interpolation of climatic variables generally. Inspired by their potential for optimal results especially for small sample datasets, we assessed and compared the accuracy of interpolation results of Empirical Bayesian Kriging (EBK) and EBK-Regression Prediction (EBKRP) spatial prediction techniques under varying sampling density scenarios, using monthly maximum temperature normals (1991–2020) for the entire area of Sweden. The objectives of the study were to understand how EBK and EBKRP interpolation techniques perform in different sampling... (More)

High fidelity gridded temperature datasets are difficult to obtain for areas with sparse coverage of meteorological stations given that sparsity of stations is known to introduce uncertainty in the interpolation of climatic variables generally. Inspired by their potential for optimal results especially for small sample datasets, we assessed and compared the accuracy of interpolation results of Empirical Bayesian Kriging (EBK) and EBK-Regression Prediction (EBKRP) spatial prediction techniques under varying sampling density scenarios, using monthly maximum temperature normals (1991–2020) for the entire area of Sweden. The objectives of the study were to understand how EBK and EBKRP interpolation techniques perform in different sampling density scenarios and particularly in a sparse data setting, and the possible difference in the prediction accuracy between the two techniques. The 708 sampled stations obtained from the historical climatology database of the Swedish Meteorological and Hydrological Institute (SMHI) were split into seven sampling density subsets, ranging from 1 sample per 63,614 km2 to 1 sample per 634,350 km2 and representing both low and high sampling density scenarios. EBK interpolation technique was implemented using temperature data while land use land cover (LULC) and digital elevation model (DEM) were used as temperature covariates for the EBKRP interpolation models. The prediction accuracy assessment was based on five robust prediction performance indicators – mean error, mean absolute error, mean square error, root mean square error and Pearson correlation (R) – obtained from independent validation/cross-validation operations. Prediction accuracy was found to be generally positively related to sampling density, and sampling density accounted for about 85%–87% of interpolation accuracy for both EBK and EBKRP techniques. Although sampling density increased linearly, the rate of change in accuracy from one sampling density step to the next was not particularly proportional. For equivalent sampling density set-ups, EBKRP consistently outperformed EBK in all the accuracy metrics and EBKRP proved to be approximately 40% better than EBK. However, the two interpolation techniques produced generally low prediction biases for all the sampling density scenarios investigated. Our study suggests that potential effects of low sampling density and non-stationarity of temperature data can be significantly reduced by applying EBK but especially EBKRP when coupled with relevant covariates. This is especially true for continuous and slowly varying phenomena such as temperature and similar variables.

(Less)
Please use this url to cite or link to this publication:
author
; ; and
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Empirical Bayesian kriging, Interpolation, Sampling density, Sweden, Temperature
in
Resources, Environment and Sustainability
volume
11
article number
100092
publisher
Elsevier
external identifiers
  • scopus:85140216517
ISSN
2666-9161
DOI
10.1016/j.resenv.2022.100092
language
English
LU publication?
no
id
b8bb465d-35df-4708-8610-bbea351ece7a
date added to LUP
2023-01-23 11:14:20
date last changed
2023-01-23 11:14:20
@article{b8bb465d-35df-4708-8610-bbea351ece7a,
  abstract     = {{<p>High fidelity gridded temperature datasets are difficult to obtain for areas with sparse coverage of meteorological stations given that sparsity of stations is known to introduce uncertainty in the interpolation of climatic variables generally. Inspired by their potential for optimal results especially for small sample datasets, we assessed and compared the accuracy of interpolation results of Empirical Bayesian Kriging (EBK) and EBK-Regression Prediction (EBKRP) spatial prediction techniques under varying sampling density scenarios, using monthly maximum temperature normals (1991–2020) for the entire area of Sweden. The objectives of the study were to understand how EBK and EBKRP interpolation techniques perform in different sampling density scenarios and particularly in a sparse data setting, and the possible difference in the prediction accuracy between the two techniques. The 708 sampled stations obtained from the historical climatology database of the Swedish Meteorological and Hydrological Institute (SMHI) were split into seven sampling density subsets, ranging from 1 sample per 63,614 km<sup>2</sup> to 1 sample per 634,350 km<sup>2</sup> and representing both low and high sampling density scenarios. EBK interpolation technique was implemented using temperature data while land use land cover (LULC) and digital elevation model (DEM) were used as temperature covariates for the EBKRP interpolation models. The prediction accuracy assessment was based on five robust prediction performance indicators – mean error, mean absolute error, mean square error, root mean square error and Pearson correlation (R) – obtained from independent validation/cross-validation operations. Prediction accuracy was found to be generally positively related to sampling density, and sampling density accounted for about 85%–87% of interpolation accuracy for both EBK and EBKRP techniques. Although sampling density increased linearly, the rate of change in accuracy from one sampling density step to the next was not particularly proportional. For equivalent sampling density set-ups, EBKRP consistently outperformed EBK in all the accuracy metrics and EBKRP proved to be approximately 40% better than EBK. However, the two interpolation techniques produced generally low prediction biases for all the sampling density scenarios investigated. Our study suggests that potential effects of low sampling density and non-stationarity of temperature data can be significantly reduced by applying EBK but especially EBKRP when coupled with relevant covariates. This is especially true for continuous and slowly varying phenomena such as temperature and similar variables.</p>}},
  author       = {{Njoku, Elijah Akwarandu and Akpan, Patrick Etim and Effiong, Augustine Edet and Babatunde, Isaac Oluwatosin}},
  issn         = {{2666-9161}},
  keywords     = {{Empirical Bayesian kriging; Interpolation; Sampling density; Sweden; Temperature}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Resources, Environment and Sustainability}},
  title        = {{The effects of station density in geostatistical prediction of air temperatures in Sweden : A comparison of two interpolation techniques}},
  url          = {{http://dx.doi.org/10.1016/j.resenv.2022.100092}},
  doi          = {{10.1016/j.resenv.2022.100092}},
  volume       = {{11}},
  year         = {{2023}},
}