Spatial self-confounding : smoothness-related estimation bias in spatial regression models
(2026) In Biometrika 113(1).- Abstract
The estimation of regression parameters in spatially referenced data plays a crucial role across various scientific domains. A common approach involves employing an additive regression model to capture the relationship between observations and covariates, accounting for spatial variability not explained by the covariates through a Gaussian random field. We study the effect of misspecified covariates, in particular when the misspecification changes the smoothness. We analyse the theoretical properties of the generalized least-squares estimator under infill asymptotics, and show that the estimator can have counter-intuitive properties. In particular, the estimated regression coefficients can converge to zero as the number of observations... (More)
The estimation of regression parameters in spatially referenced data plays a crucial role across various scientific domains. A common approach involves employing an additive regression model to capture the relationship between observations and covariates, accounting for spatial variability not explained by the covariates through a Gaussian random field. We study the effect of misspecified covariates, in particular when the misspecification changes the smoothness. We analyse the theoretical properties of the generalized least-squares estimator under infill asymptotics, and show that the estimator can have counter-intuitive properties. In particular, the estimated regression coefficients can converge to zero as the number of observations increases if the covariates are too rough, despite high correlations between observations and covariates. This has important implications for practical applications as the importance of rough covariates can be severely underestimated, leading to incorrect scientific conclusions. We also show that the estimates can diverge to infinity under certain conditions, which can also lead to incorrect conclusions in practical applications. Through an application to temperature and precipitation data, we show that both behaviours can be observed for real data. Finally, we propose adding a smoothing step in the regression and show both theoretically and practically that this can solve the problem.
(Less)
- author
- Bolin, David LU and Wallin, Jonas LU
- organization
- publishing date
- 2026
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Estimation, Generalized least squares, Maximum likelihood, Misspecification, Spatial regression
- in
- Biometrika
- volume
- 113
- issue
- 1
- article number
- asaf076
- publisher
- Oxford University Press
- external identifiers
-
- scopus:105033062223
- ISSN
- 0006-3444
- DOI
- 10.1093/biomet/asaf076
- language
- English
- LU publication?
- yes
- additional info
- Publisher Copyright: © The Author(s) 2025. Published by Oxford University Press on behalf of Biometrika Trust.
- id
- 297acc8a-2916-4e02-938d-f1a90fbbe1fc
- date added to LUP
- 2026-05-04 14:29:49
- date last changed
- 2026-05-04 14:30:37
@article{297acc8a-2916-4e02-938d-f1a90fbbe1fc,
abstract = {{<p>The estimation of regression parameters in spatially referenced data plays a crucial role across various scientific domains. A common approach involves employing an additive regression model to capture the relationship between observations and covariates, accounting for spatial variability not explained by the covariates through a Gaussian random field. We study the effect of misspecified covariates, in particular when the misspecification changes the smoothness. We analyse the theoretical properties of the generalized least-squares estimator under infill asymptotics, and show that the estimator can have counter-intuitive properties. In particular, the estimated regression coefficients can converge to zero as the number of observations increases if the covariates are too rough, despite high correlations between observations and covariates. This has important implications for practical applications as the importance of rough covariates can be severely underestimated, leading to incorrect scientific conclusions. We also show that the estimates can diverge to infinity under certain conditions, which can also lead to incorrect conclusions in practical applications. Through an application to temperature and precipitation data, we show that both behaviours can be observed for real data. Finally, we propose adding a smoothing step in the regression and show both theoretically and practically that this can solve the problem.</p>}},
author = {{Bolin, David and Wallin, Jonas}},
issn = {{0006-3444}},
keywords = {{Estimation; Generalized least squares; Maximum likelihood; Misspecification; Spatial regression}},
language = {{eng}},
number = {{1}},
publisher = {{Oxford University Press}},
series = {{Biometrika}},
title = {{Spatial self-confounding : smoothness-related estimation bias in spatial regression models}},
url = {{http://dx.doi.org/10.1093/biomet/asaf076}},
doi = {{10.1093/biomet/asaf076}},
volume = {{113}},
year = {{2026}},
}