How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches
(2025) In Accident Analysis and Prevention 221.- Abstract
Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types,... (More)
Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types, intersection approaches, and sample sizes. Two safety indicators, the expected annual number of crashes and the crash return level, both derived from the GPD distribution, were employed to assess sample adequacy based on their respective rates of convergence. Results show that the crash return level, unlike traditional crash frequency metrics, remains non-zero and sensitive to observation duration even in low-risk scenarios, facilitating a more robust identification of adequate sample sizes. Notably, the BH_GPD model generally reduced required observation durations compared to standalone GPD models, particularly for low-conflict scenarios (i.e., less than 10 conflicts/hour), while yielding narrower credible intervals due to its ability to pool data across sites. A key finding reveals that adequate sample sizes range from 15 to over 45 hours for different scenarios, inversely correlated with conflict rates. This study establishes a quantitative framework to determine the adequate sample size of traffic conflicts, which has the potential to contribute significantly to the standardization of traffic conflict techniques in road safety research.
(Less)
- author
- Zheng, Lai
; Li, Jiayi
; Wei, Wei
; D'Agostino, Carmelo
LU
and Laureshyn, Aliaksei LU
- organization
- publishing date
- 2025-10
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Bayesian hierarchical modeling, Crash return level, Extreme value theory, Sample size determination, Traffic conflict
- in
- Accident Analysis and Prevention
- volume
- 221
- article number
- 108215
- publisher
- Elsevier
- external identifiers
-
- scopus:105014251642
- pmid:40882301
- ISSN
- 0001-4575
- DOI
- 10.1016/j.aap.2025.108215
- language
- English
- LU publication?
- yes
- id
- e8aa64cf-dc32-447b-bf99-65c0ab2d75d0
- date added to LUP
- 2025-10-10 15:17:25
- date last changed
- 2025-10-11 04:01:16
@article{e8aa64cf-dc32-447b-bf99-65c0ab2d75d0, abstract = {{<p>Determining the appropriate duration for traffic conflict observation remains a critical yet unresolved challenge in road safety analysis. Existing approaches lack a quantitative approach to determine adequate sample sizes for general conflict-based applications. This study addresses this gap by proposing an extreme value theory based framework to determine appropriate observation durations, based on the inherent stability of road entity safety for a specified period. Using approximately 50 hours of high-resolution LiDAR data from a signalized intersection in Harbin, China, conventional generalized Pareto distribution (GPD) models and Bayesian hierarchical GPD (BH_GPD) models were developed, considering variations in conflict types, intersection approaches, and sample sizes. Two safety indicators, the expected annual number of crashes and the crash return level, both derived from the GPD distribution, were employed to assess sample adequacy based on their respective rates of convergence. Results show that the crash return level, unlike traditional crash frequency metrics, remains non-zero and sensitive to observation duration even in low-risk scenarios, facilitating a more robust identification of adequate sample sizes. Notably, the BH_GPD model generally reduced required observation durations compared to standalone GPD models, particularly for low-conflict scenarios (i.e., less than 10 conflicts/hour), while yielding narrower credible intervals due to its ability to pool data across sites. A key finding reveals that adequate sample sizes range from 15 to over 45 hours for different scenarios, inversely correlated with conflict rates. This study establishes a quantitative framework to determine the adequate sample size of traffic conflicts, which has the potential to contribute significantly to the standardization of traffic conflict techniques in road safety research.</p>}}, author = {{Zheng, Lai and Li, Jiayi and Wei, Wei and D'Agostino, Carmelo and Laureshyn, Aliaksei}}, issn = {{0001-4575}}, keywords = {{Bayesian hierarchical modeling; Crash return level; Extreme value theory; Sample size determination; Traffic conflict}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Accident Analysis and Prevention}}, title = {{How long is long enough for traffic conflict observation : An investigation using extreme value theory approaches}}, url = {{http://dx.doi.org/10.1016/j.aap.2025.108215}}, doi = {{10.1016/j.aap.2025.108215}}, volume = {{221}}, year = {{2025}}, }