How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches

Zheng, Lai; Li, Jiayi; Wei, Wei; D'Agostino, Carmelo; Laureshyn, Aliaksei

How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches

Mark

Zheng, Lai ; Li, Jiayi ; Wei, Wei ; D'Agostino, Carmelo ^LU

and Laureshyn, Aliaksei ^LU

(2025) In Accident Analysis and Prevention 221.

Abstract: Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types,... (More); Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types, intersection approaches, and sample sizes. Two safety indicators, the expected annual number of crashes and the crash return level, both derived from the GPD distribution, were employed to assess sample adequacy based on their respective rates of convergence. Results show that the crash return level, unlike traditional crash frequency metrics, remains non-zero and sensitive to observation duration even in low-risk scenarios, facilitating a more robust identification of adequate sample sizes. Notably, the BH_GPD model generally reduced required observation durations compared to standalone GPD models, particularly for low-conflict scenarios (i.e., less than 10 conflicts/hour), while yielding narrower credible intervals due to its ability to pool data across sites. A key finding reveals that adequate sample sizes range from 15 to over 45 hours for different scenarios, inversely correlated with conflict rates. This study establishes a quantitative framework to determine the adequate sample size of traffic conflicts, which has the potential to contribute significantly to the standardization of traffic conflict techniques in road safety research.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/e8aa64cf-dc32-447b-bf99-65c0ab2d75d0

author

Zheng, Lai ; Li, Jiayi ; Wei, Wei ; D'Agostino, Carmelo ^LU

and Laureshyn, Aliaksei ^LU

organization

publishing date

2025-10

type

Contribution to journal

publication status

published

subject

Infrastructure Engineering

keywords

Bayesian hierarchical modeling, Crash return level, Extreme value theory, Sample size determination, Traffic conflict

in

Accident Analysis and Prevention

volume

221

article number

108215

publisher

Elsevier

external identifiers

pmid:40882301
scopus:105014251642

ISSN

0001-4575

DOI

10.1016/j.aap.2025.108215

language

English

LU publication?

yes

id

e8aa64cf-dc32-447b-bf99-65c0ab2d75d0

date added to LUP

2025-10-10 15:17:25

date last changed

2026-01-07 05:59:40

@article{e8aa64cf-dc32-447b-bf99-65c0ab2d75d0,
  abstract     = {{<p>Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types, intersection approaches, and sample sizes. Two safety indicators, the expected annual number of crashes and the crash return level, both derived from the GPD distribution, were employed to assess sample adequacy based on their respective rates of convergence. Results show that the crash return level, unlike traditional crash frequency metrics, remains non-zero and sensitive to observation duration even in low-risk scenarios, facilitating a more robust identification of adequate sample sizes. Notably, the BH_GPD model generally reduced required observation durations compared to standalone GPD models, particularly for low-conflict scenarios (i.e., less than 10 conflicts/hour), while yielding narrower credible intervals due to its ability to pool data across sites. A key finding reveals that adequate sample sizes range from 15 to over 45 hours for different scenarios, inversely correlated with conflict rates. This study establishes a quantitative framework to determine the adequate sample size of traffic conflicts, which has the potential to contribute significantly to the standardization of traffic conflict techniques in road safety research.</p>}},
  author       = {{Zheng, Lai and Li, Jiayi and Wei, Wei and D'Agostino, Carmelo and Laureshyn, Aliaksei}},
  issn         = {{0001-4575}},
  keywords     = {{Bayesian hierarchical modeling; Crash return level; Extreme value theory; Sample size determination; Traffic conflict}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Accident Analysis and Prevention}},
  title        = {{How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches}},
  url          = {{http://dx.doi.org/10.1016/j.aap.2025.108215}},
  doi          = {{10.1016/j.aap.2025.108215}},
  volume       = {{221}},
  year         = {{2025}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches