Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Forecasting copper price using VAR and the XGBoost model: an experiment with a relatively small dataset

Hu, Juanli LU (2023) DABN01 20231
Department of Economics
Department of Statistics
Abstract
Given the importance of copper prices to investors, governments, and policymakers, this paper investigates short-term price predictability using VAR and XGBoost models. All models are trained with historical data from November 2021 to December 2022 and using MSE, RMSE and MAE for evaluating the model performance. The results show that the XGBoost model outperforms VAR models, implying that machine learning models are more robust than traditional statistical models. However, specific scenarios with a lag of 1to predict one day ahead(h=1day) show similar performance between XGBoost and VAR, indicating that traditional statistical models can still be competitive in certain situations. Therefore, it is critical not to dismiss traditional... (More)
Given the importance of copper prices to investors, governments, and policymakers, this paper investigates short-term price predictability using VAR and XGBoost models. All models are trained with historical data from November 2021 to December 2022 and using MSE, RMSE and MAE for evaluating the model performance. The results show that the XGBoost model outperforms VAR models, implying that machine learning models are more robust than traditional statistical models. However, specific scenarios with a lag of 1to predict one day ahead(h=1day) show similar performance between XGBoost and VAR, indicating that traditional statistical models can still be competitive in certain situations. Therefore, it is critical not to dismiss traditional statistical models entirely, as they provide benefits in terms of interpretability and computational simplicity. Moreover, we also find that the selection of lag values for models is demonstrated to be empirical, with different lag values resulting in varying model performance. Thus, practitioners are encouraged to experiment with different lag settings in order to find the best model for their specific tasks and dataset sizes. (Less)
Popular Abstract
Given the importance of copper prices to investors, governments, and policymakers, this paper investigates short-term price predictability using VAR and XGBoost models. All models are trained with historical data from November 2021 to December 2022 and using MSE, RMSE and MAE for evaluating the model performance. The results show that the XGBoost model outperforms VAR models, implying that machine learning models are more robust than traditional statistical models. However, specific scenarios with a lag of 1to predict one day ahead(h=1day) show similar performance between XGBoost and VAR, indicating that traditional statistical models can still be competitive in certain situations. Therefore, it is critical not to dismiss traditional... (More)
Given the importance of copper prices to investors, governments, and policymakers, this paper investigates short-term price predictability using VAR and XGBoost models. All models are trained with historical data from November 2021 to December 2022 and using MSE, RMSE and MAE for evaluating the model performance. The results show that the XGBoost model outperforms VAR models, implying that machine learning models are more robust than traditional statistical models. However, specific scenarios with a lag of 1to predict one day ahead(h=1day) show similar performance between XGBoost and VAR, indicating that traditional statistical models can still be competitive in certain situations. Therefore, it is critical not to dismiss traditional statistical models entirely, as they provide benefits in terms of interpretability and computational simplicity. Moreover, we also find that the selection of lag values for models is demonstrated to be empirical, with different lag values resulting in varying model performance. Thus, practitioners are encouraged to experiment with different lag settings in order to find the best model for their specific tasks and dataset sizes. (Less)
Please use this url to cite or link to this publication:
author
Hu, Juanli LU
supervisor
organization
course
DABN01 20231
year
type
H1 - Master's Degree (One Year)
subject
keywords
copper price, Vector autoregressive model, XGBoost, Time series
language
English
id
9136057
date added to LUP
2023-11-21 12:54:01
date last changed
2023-11-21 12:54:01
@misc{9136057,
  abstract     = {{Given the importance of copper prices to investors, governments, and policymakers, this paper investigates short-term price predictability using VAR and XGBoost models. All models are trained with historical data from November 2021 to December 2022 and using MSE, RMSE and MAE for evaluating the model performance. The results show that the XGBoost model outperforms VAR models, implying that machine learning models are more robust than traditional statistical models. However, specific scenarios with a lag of 1to predict one day ahead(h=1day) show similar performance between XGBoost and VAR, indicating that traditional statistical models can still be competitive in certain situations. Therefore, it is critical not to dismiss traditional statistical models entirely, as they provide benefits in terms of interpretability and computational simplicity. Moreover, we also find that the selection of lag values for models is demonstrated to be empirical, with different lag values resulting in varying model performance. Thus, practitioners are encouraged to experiment with different lag settings in order to find the best model for their specific tasks and dataset sizes.}},
  author       = {{Hu, Juanli}},
  language     = {{eng}},
  note         = {{Student Paper}},
  title        = {{Forecasting copper price using VAR and the XGBoost model: an experiment with a relatively small dataset}},
  year         = {{2023}},
}