An explainable hybrid framework for estimating daily reference evapotranspiration : Combining extreme gradient boosting with Nelder-Mead method
(2024) In Journal of Hydrology 644.- Abstract
Accurate estimation of reference evapotranspiration (ETo) is essential for effective water resources management, irrigation system design, and various hydrological and agricultural applications. This study employed extreme gradient boosting (XGBoost) model, signal decomposition techniques, and XGBoost coupled with Nelder–Mead (NM) method to enhance ETo prediction across two meteorological stations in Iran. This study proposed a novel framework which lies in its comprehensive integration of advanced techniques to create a model that is both interpretable and highly accurate for ETo estimation. For this aim, sixty meteorological variables were categorized into solar and cloud-based, temperature-based, wind and humidity-based, and... (More)
Accurate estimation of reference evapotranspiration (ETo) is essential for effective water resources management, irrigation system design, and various hydrological and agricultural applications. This study employed extreme gradient boosting (XGBoost) model, signal decomposition techniques, and XGBoost coupled with Nelder–Mead (NM) method to enhance ETo prediction across two meteorological stations in Iran. This study proposed a novel framework which lies in its comprehensive integration of advanced techniques to create a model that is both interpretable and highly accurate for ETo estimation. For this aim, sixty meteorological variables were categorized into solar and cloud-based, temperature-based, wind and humidity-based, and pressure-based groups to analyze their effect on ETo estimation. Feature selection methods, including the Gradient Boosting Machine, Kendall's Tau, and Relief Algorithm, were employed to identify the most influential predictors. Variables with normalized weights equal to or greater than 0.9 were selected for model input, resulting in the top 10% of variables being utilized. The XGBoost models were then developed using these selected inputs (level 1), a wavelet-based hybrid was developed according to the most effective features (level 2), and the XGBoost model was coupled by NM algorithm (level 3) for ETo estimation. Results of estimated ETo by levels 1 to 3 were compared with four common empirical approaches. The results indicated that Kendall's Tau-based feature selection provided the most accurate predictions in Shiraz, achieving an RMSE of 0.721 (mm/day) for solar and cloud-based variables during the test phase. Additionally, the application of wavelet analysis further refined the model inputs, which enhanced ETo estimation in most variable groups. Integrating the NM algorithm with XGBoost demonstrated significant improvements in ETo estimation, where it could improve RMSE to 0.091 and 0.155 (mm/day) for testing section in Fasa and Shiraz, respectively.
(Less)
- author
- Mohammadi, Babak
LU
; Chen, Mingjie ; Reza Nikoo, Mohammad ; Cheraghalizadeh, Majid ; Yu, Yang ; Zhang, Haiyan and Yu, Ruide
- organization
- publishing date
- 2024-11
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Meteorological variables, Optimization technique, Penman–Monteith equation, Reference evapotranspiration, Wavelet transformation, XGBoost
- in
- Journal of Hydrology
- volume
- 644
- article number
- 132130
- publisher
- Elsevier
- external identifiers
-
- scopus:85205909882
- ISSN
- 0022-1694
- DOI
- 10.1016/j.jhydrol.2024.132130
- language
- English
- LU publication?
- yes
- id
- 429a84f9-b41d-44a1-9af9-0a207a83eb47
- date added to LUP
- 2024-11-27 10:48:47
- date last changed
- 2025-04-04 14:34:24
@article{429a84f9-b41d-44a1-9af9-0a207a83eb47, abstract = {{<p>Accurate estimation of reference evapotranspiration (ETo) is essential for effective water resources management, irrigation system design, and various hydrological and agricultural applications. This study employed extreme gradient boosting (XGBoost) model, signal decomposition techniques, and XGBoost coupled with Nelder–Mead (NM) method to enhance ETo prediction across two meteorological stations in Iran. This study proposed a novel framework which lies in its comprehensive integration of advanced techniques to create a model that is both interpretable and highly accurate for ETo estimation. For this aim, sixty meteorological variables were categorized into solar and cloud-based, temperature-based, wind and humidity-based, and pressure-based groups to analyze their effect on ETo estimation. Feature selection methods, including the Gradient Boosting Machine, Kendall's Tau, and Relief Algorithm, were employed to identify the most influential predictors. Variables with normalized weights equal to or greater than 0.9 were selected for model input, resulting in the top 10% of variables being utilized. The XGBoost models were then developed using these selected inputs (level 1), a wavelet-based hybrid was developed according to the most effective features (level 2), and the XGBoost model was coupled by NM algorithm (level 3) for ETo estimation. Results of estimated ETo by levels 1 to 3 were compared with four common empirical approaches. The results indicated that Kendall's Tau-based feature selection provided the most accurate predictions in Shiraz, achieving an RMSE of 0.721 (mm/day) for solar and cloud-based variables during the test phase. Additionally, the application of wavelet analysis further refined the model inputs, which enhanced ETo estimation in most variable groups. Integrating the NM algorithm with XGBoost demonstrated significant improvements in ETo estimation, where it could improve RMSE to 0.091 and 0.155 (mm/day) for testing section in Fasa and Shiraz, respectively.</p>}}, author = {{Mohammadi, Babak and Chen, Mingjie and Reza Nikoo, Mohammad and Cheraghalizadeh, Majid and Yu, Yang and Zhang, Haiyan and Yu, Ruide}}, issn = {{0022-1694}}, keywords = {{Meteorological variables; Optimization technique; Penman–Monteith equation; Reference evapotranspiration; Wavelet transformation; XGBoost}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Journal of Hydrology}}, title = {{An explainable hybrid framework for estimating daily reference evapotranspiration : Combining extreme gradient boosting with Nelder-Mead method}}, url = {{http://dx.doi.org/10.1016/j.jhydrol.2024.132130}}, doi = {{10.1016/j.jhydrol.2024.132130}}, volume = {{644}}, year = {{2024}}, }