Skip to main content

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Processing, Modeling, and Forecasting: A Time Series Analysis of Sick Leave Absences in Sweden and Evaluating the Impact of Macro Factors

Lindell, Marcus LU and Schwerin, Casper LU (2023) In Master's Theses in Mathematical Sciences FMSM01 20231
Mathematical Statistics
Abstract
Sick absence affects companies both operationally and economically and the
matter has become increasingly prominent following the COVID-19 pandemic.
MedHelp Care is an e-health company working towards increasing workplace
wellness by offering insights into absence and rehabilitation matters. Forecasting future absence levels could help with matters such as staffing and garner more insight into which factors that impact absenteeism. Moreover, gauging how external factors affect the absence could help deepen the knowledge further. With absence data now being digital, it is able to be modeled using statistical tools. The dataset was aggregated to a daily basis and preprocessed to better represent the Swedish labour force. Then, by applying... (More)
Sick absence affects companies both operationally and economically and the
matter has become increasingly prominent following the COVID-19 pandemic.
MedHelp Care is an e-health company working towards increasing workplace
wellness by offering insights into absence and rehabilitation matters. Forecasting future absence levels could help with matters such as staffing and garner more insight into which factors that impact absenteeism. Moreover, gauging how external factors affect the absence could help deepen the knowledge further. With absence data now being digital, it is able to be modeled using statistical tools. The dataset was aggregated to a daily basis and preprocessed to better represent the Swedish labour force. Then, by applying time series analysis and machine learning methods, predictions of sick absence were formed. Datasets of the Swedish stock market and the discourse around COVID-19 on Twitter, respectively, were processed to time series and their relation to the sick absence was explored using crosscorrelation. The sets were then individually incorporated into exogenous models using autoregression, and then their impact was evaluated. Results show that the SARIMA model is better for predicting short and medium length horizons while the machine learning methods used were more apt for long horizons. Results from the exogenous models were mixed; the OMX set improved short term prediction accuracy while other horizons were largely unaffected, and the Twitter set worsened performance for all horizon lengths. While the model results are promising, their current complexity hinder large-scale deployment and further research is needed to further verify the effect of exogenous datasets. (Less)
Please use this url to cite or link to this publication:
author
Lindell, Marcus LU and Schwerin, Casper LU
supervisor
organization
course
FMSM01 20231
year
type
H2 - Master's Degree (Two Years)
subject
keywords
AI, Machine Learning, Time series analysis, Modeling, Sick absence, Forecasting, Macro factors, Stochastic processes
publication/series
Master's Theses in Mathematical Sciences
report number
LUTFMS-3475-2023
ISSN
1404-6342
other publication id
2023:E31
language
English
id
9118339
date added to LUP
2023-06-07 15:22:35
date last changed
2023-06-19 14:00:42
@misc{9118339,
  abstract     = {{Sick absence affects companies both operationally and economically and the
matter has become increasingly prominent following the COVID-19 pandemic.
MedHelp Care is an e-health company working towards increasing workplace
wellness by offering insights into absence and rehabilitation matters. Forecasting future absence levels could help with matters such as staffing and garner more insight into which factors that impact absenteeism. Moreover, gauging how external factors affect the absence could help deepen the knowledge further. With absence data now being digital, it is able to be modeled using statistical tools. The dataset was aggregated to a daily basis and preprocessed to better represent the Swedish labour force. Then, by applying time series analysis and machine learning methods, predictions of sick absence were formed. Datasets of the Swedish stock market and the discourse around COVID-19 on Twitter, respectively, were processed to time series and their relation to the sick absence was explored using crosscorrelation. The sets were then individually incorporated into exogenous models using autoregression, and then their impact was evaluated. Results show that the SARIMA model is better for predicting short and medium length horizons while the machine learning methods used were more apt for long horizons. Results from the exogenous models were mixed; the OMX set improved short term prediction accuracy while other horizons were largely unaffected, and the Twitter set worsened performance for all horizon lengths. While the model results are promising, their current complexity hinder large-scale deployment and further research is needed to further verify the effect of exogenous datasets.}},
  author       = {{Lindell, Marcus and Schwerin, Casper}},
  issn         = {{1404-6342}},
  language     = {{eng}},
  note         = {{Student Paper}},
  series       = {{Master's Theses in Mathematical Sciences}},
  title        = {{Processing, Modeling, and Forecasting: A Time Series Analysis of Sick Leave Absences in Sweden and Evaluating the Impact of Macro Factors}},
  year         = {{2023}},
}