Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Triple collocation-based error estimation and data fusion of global gridded precipitation products over the Yangtze River basin

Chen, Cheng ; He, Mengnan ; Chen, Qiuwen ; Zhang, Jianyun ; Li, Zhe ; Wang, Zhiyuan and Duan, Zheng LU (2022) In Journal of Hydrology 605.
Abstract

Error estimation and data fusion are critical to improving the accuracy of global model- and satellite-based precipitation products for practical applications. However, they face challenges over vast areas of the world due to limited ground observations. Triple collocation (TC) method can overcome this limitation and provide an efficient way for error estimation without the “ground truth” and thus also for data fusion, by leveraging multi-source observations and model outputs, which have been increasingly available in recent years. In this work, we conducted a comprehensive study on error estimation and data fusion of a number of global gridded precipitation products over the Yangtze River basin from 2015 to 2018 using TC and... (More)

Error estimation and data fusion are critical to improving the accuracy of global model- and satellite-based precipitation products for practical applications. However, they face challenges over vast areas of the world due to limited ground observations. Triple collocation (TC) method can overcome this limitation and provide an efficient way for error estimation without the “ground truth” and thus also for data fusion, by leveraging multi-source observations and model outputs, which have been increasingly available in recent years. In this work, we conducted a comprehensive study on error estimation and data fusion of a number of global gridded precipitation products over the Yangtze River basin from 2015 to 2018 using TC and multiplicative TC (MTC) methods. We use three satellite-based precipitation products such as the IMERG Final (IMERG-F), PERSIANN-CDR (PCDR) and SM2RAIN-ASCAT (SM2R), and one reanalysis dataset ERA5 which contains precipitation estimates. They were grouped into two TC triplets based on different combinations: IMERG-F + SM2R + ERA5 and PCDR + SM2R + ERA5. For performance evaluation, the TC-based error estimation methods were compared to the traditional method using rain gauge data, and the TC-based data fusion methods were compared with two widely-used data fusion methods Bayesian Model Averaging (BMA) and Random Forest based MErging Procedure (RF-MEP). Results showed that ERA5 had the best performance with the largest correlation coefficient (CC, 0.435), while PCDR had the worst accuracy with the smallest CC (0.304) and the largest absolute relative bias (RB, 0.365). TC tended to underestimate the root mean square error (RMSE) with respect to the traditional gauged-based method, but MTC showed a consistent result owing to the employment of a multiplicative error model. The performance of TC-based data fusion methods had no significant difference from BMA and RF-MEP. All data fusion results were better than the original triplets, as the mean CC value increased from 0.38 to 0.47 and the mean RMSE decreased from 15.0 to 13.5 mm/day. In addition, we found that the zero value replacement in MTC had great influence on error estimation, while had limited impacts on data fusion.

(Less)
Please use this url to cite or link to this publication:
author
; ; ; ; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Data fusion, Error estimation, Multiple gridded precipitation estimates, Triple collocation, Yangtze River basin
in
Journal of Hydrology
volume
605
article number
127307
publisher
Elsevier
external identifiers
  • scopus:85121206748
ISSN
0022-1694
DOI
10.1016/j.jhydrol.2021.127307
language
English
LU publication?
yes
additional info
Publisher Copyright: © 2021 Elsevier B.V.
id
d6850436-ca1a-451d-84de-20a83922d052
date added to LUP
2022-01-11 17:34:29
date last changed
2023-05-10 09:59:06
@article{d6850436-ca1a-451d-84de-20a83922d052,
  abstract     = {{<p>Error estimation and data fusion are critical to improving the accuracy of global model- and satellite-based precipitation products for practical applications. However, they face challenges over vast areas of the world due to limited ground observations. Triple collocation (TC) method can overcome this limitation and provide an efficient way for error estimation without the “ground truth” and thus also for data fusion, by leveraging multi-source observations and model outputs, which have been increasingly available in recent years. In this work, we conducted a comprehensive study on error estimation and data fusion of a number of global gridded precipitation products over the Yangtze River basin from 2015 to 2018 using TC and multiplicative TC (MTC) methods. We use three satellite-based precipitation products such as the IMERG Final (IMERG-F), PERSIANN-CDR (PCDR) and SM2RAIN-ASCAT (SM2R), and one reanalysis dataset ERA5 which contains precipitation estimates. They were grouped into two TC triplets based on different combinations: IMERG-F + SM2R + ERA5 and PCDR + SM2R + ERA5. For performance evaluation, the TC-based error estimation methods were compared to the traditional method using rain gauge data, and the TC-based data fusion methods were compared with two widely-used data fusion methods Bayesian Model Averaging (BMA) and Random Forest based MErging Procedure (RF-MEP). Results showed that ERA5 had the best performance with the largest correlation coefficient (CC, 0.435), while PCDR had the worst accuracy with the smallest CC (0.304) and the largest absolute relative bias (RB, 0.365). TC tended to underestimate the root mean square error (RMSE) with respect to the traditional gauged-based method, but MTC showed a consistent result owing to the employment of a multiplicative error model. The performance of TC-based data fusion methods had no significant difference from BMA and RF-MEP. All data fusion results were better than the original triplets, as the mean CC value increased from 0.38 to 0.47 and the mean RMSE decreased from 15.0 to 13.5 mm/day. In addition, we found that the zero value replacement in MTC had great influence on error estimation, while had limited impacts on data fusion.</p>}},
  author       = {{Chen, Cheng and He, Mengnan and Chen, Qiuwen and Zhang, Jianyun and Li, Zhe and Wang, Zhiyuan and Duan, Zheng}},
  issn         = {{0022-1694}},
  keywords     = {{Data fusion; Error estimation; Multiple gridded precipitation estimates; Triple collocation; Yangtze River basin}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Journal of Hydrology}},
  title        = {{Triple collocation-based error estimation and data fusion of global gridded precipitation products over the Yangtze River basin}},
  url          = {{http://dx.doi.org/10.1016/j.jhydrol.2021.127307}},
  doi          = {{10.1016/j.jhydrol.2021.127307}},
  volume       = {{605}},
  year         = {{2022}},
}