Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

The Application of Tree-Based Algorithms on Classifying Shunting Yard Departure Status

Minbashi, Niloofar ; Bohlin, Markus ; Palmqvist, Carl William LU orcid and Kordnejad, Behzad (2021) In Journal of Advanced Transportation 2021.
Abstract

Shunting yards are one of the main areas impacting the reliability of rail freight networks, and delayed departures from shunting yards can further also affect the punctuality of mixed-traffic networks. Methods for automatic detection of departures, which are likely to be delayed, can therefore contribute towards increasing the reliability and punctuality of both freight and passenger services. In this paper, we compare the performance of tree-based methods (decision trees and random forests), which have been highly successful in a wide range of generic applications, in classifying the status of (delayed, early, and on-time) departing trains from shunting yards, focusing on the delayed departures as the minority class. We use a total... (More)

Shunting yards are one of the main areas impacting the reliability of rail freight networks, and delayed departures from shunting yards can further also affect the punctuality of mixed-traffic networks. Methods for automatic detection of departures, which are likely to be delayed, can therefore contribute towards increasing the reliability and punctuality of both freight and passenger services. In this paper, we compare the performance of tree-based methods (decision trees and random forests), which have been highly successful in a wide range of generic applications, in classifying the status of (delayed, early, and on-time) departing trains from shunting yards, focusing on the delayed departures as the minority class. We use a total number of 6,243 train connections (representing over 21,000 individual wagon connections) for a one-month period from the Hallsberg yard in Sweden, which is the largest shunting yard in Scandinavia. Considering our dataset, our results show a slight difference between the application of decision trees and random forests in detecting delayed departures as the minority class. To remedy this, enhanced sampling for minority classes is applied by the synthetic minority oversampling technique (SMOTE) to improve detecting and assigning delayed departures. Applying SMOTE improved the sensitivity, precision, and F-measure of delayed departures by 20% for decision trees and by 30% for random forests. Overall, random forests show a relative better performance in detecting all three departure classes before and after applying SMOTE. Although the preliminary results presented in this paper are encouraging, future studies are needed to investigate the computational performance of tree-based algorithms using larger datasets and considering additional predictors.

(Less)
Please use this url to cite or link to this publication:
author
; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Journal of Advanced Transportation
volume
2021
article number
3538462
publisher
Hindawi Limited
external identifiers
  • scopus:85115798799
ISSN
0197-6729
DOI
10.1155/2021/3538462
language
English
LU publication?
yes
id
fc8be9cf-0e46-4ec9-9f03-bd5f1b2184d9
date added to LUP
2021-10-04 08:31:22
date last changed
2023-11-08 20:19:39
@article{fc8be9cf-0e46-4ec9-9f03-bd5f1b2184d9,
  abstract     = {{<p>Shunting yards are one of the main areas impacting the reliability of rail freight networks, and delayed departures from shunting yards can further also affect the punctuality of mixed-traffic networks. Methods for automatic detection of departures, which are likely to be delayed, can therefore contribute towards increasing the reliability and punctuality of both freight and passenger services. In this paper, we compare the performance of tree-based methods (decision trees and random forests), which have been highly successful in a wide range of generic applications, in classifying the status of (delayed, early, and on-time) departing trains from shunting yards, focusing on the delayed departures as the minority class. We use a total number of 6,243 train connections (representing over 21,000 individual wagon connections) for a one-month period from the Hallsberg yard in Sweden, which is the largest shunting yard in Scandinavia. Considering our dataset, our results show a slight difference between the application of decision trees and random forests in detecting delayed departures as the minority class. To remedy this, enhanced sampling for minority classes is applied by the synthetic minority oversampling technique (SMOTE) to improve detecting and assigning delayed departures. Applying SMOTE improved the sensitivity, precision, and F-measure of delayed departures by 20% for decision trees and by 30% for random forests. Overall, random forests show a relative better performance in detecting all three departure classes before and after applying SMOTE. Although the preliminary results presented in this paper are encouraging, future studies are needed to investigate the computational performance of tree-based algorithms using larger datasets and considering additional predictors. </p>}},
  author       = {{Minbashi, Niloofar and Bohlin, Markus and Palmqvist, Carl William and Kordnejad, Behzad}},
  issn         = {{0197-6729}},
  language     = {{eng}},
  publisher    = {{Hindawi Limited}},
  series       = {{Journal of Advanced Transportation}},
  title        = {{The Application of Tree-Based Algorithms on Classifying Shunting Yard Departure Status}},
  url          = {{http://dx.doi.org/10.1155/2021/3538462}},
  doi          = {{10.1155/2021/3538462}},
  volume       = {{2021}},
  year         = {{2021}},
}