Generation and Validation of Representative Pre-Crash Scenarios
Doctoral thesis, 2026

Driving automation systems (DAS), including Advanced Driver Assistance Systems (ADAS) and Automated Driving Systems (ADS), are expected to substantially improve traffic safety. Virtual safety assessment is the primary approach for quantitatively evaluating the prospective safety impacts of these systems, but its validity depends critically on the availability of comprehensive and representative pre-crash scenarios. Existing real-world data are limited in quantity and coverage and often suffer from sampling bias, making the generation of synthetic pre-crash scenarios necessary. However, current generation approaches face challenges such as biased or incomplete data and difficulties in validation. In particular, the absence of systematic methods for validating the representativeness of the synthetic scenarios remains a critical knowledge gap.

To address these challenges, this thesis develops an integrated methodological framework for generating and validating representative synthetic pre‑crash scenarios for (prospective) safety impact assessment (SIA) of DAS. The framework consists of two complementary components: 1) a novel approach for generating representative synthetic pre-crash scenarios, and 2) an assessment‑oriented framework for validating their representativeness.

Papers I and II present the proposed scenario generation approach that combines heterogeneous empirical data through model-based parameterization and weighting to construct reference pre-crash datasets. Synthetic scenarios are generated using parametric multivariate models and reweighted to match the reference distributions. The underlying generation logic can, in principle, be applied to conflict-based scenarios with or without collision, but the empirical implementation focuses on rear-end pre-crash scenarios with purely longitudinal dynamics, reflecting current limitations in available datasets.

To address the validation gap, Papers III and IV introduce a Bayesian Region of Practical Equivalence (ROPE)-based framework to assess whether synthetic pre-crash scenarios are practically equivalent to their real-world counterparts for SIA purposes. The framework emphasizes assessment-relevant metric selection, interpretable statistics, and explicitly defined equivalence criteria, and provides diagnostic insight into the sources and implications of non-equivalence.

Overall, the thesis contributes a transparent, reproducible methodology for generating representative synthetic rear-end pre-crash scenarios and a general, assessment-oriented framework for validating scenario representativeness, supporting more accurate and credible SIAs of DAS.

Safety Impact Assessment

Equivalence Testing

Synthetic Pre-Crash Scenario Generation

Data Combination

Virtual Safety Assessment

Driving Automation Systems

Virtual Development Laboratory (VDL), Tvärgata 4C, Chalmers
Opponent: Feng Guo, Virginia Tech, USA

Author

Jian Wu

Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Safety

Modeling Lead-Vehicle Kinematics for Rear-End Crash Scenario Generation

IEEE Transactions on Intelligent Transportation Systems,;Vol. 25(2024)p. 10866-10884

Journal article

Model-Based Generation of Representative Rear-End Crash Scenarios Across the Full Severity Range Using Pre-Crash Data

IEEE Transactions on Intelligent Transportation Systems,;Vol. 26(2025)p. 15932-15950

Journal article

Practical Equivalence Testing and its Application in Synthetic Pre-Crash Scenario Validation

Iavvc 2025 IEEE International Automated Vehicle Validation Conference Proceedings,;(2025)

Paper in proceeding

Driving automation systems, such as automatic emergency braking, are developing rapidly and are expected to help prevent or reduce the severity of traffic crashes. The safety performance of these systems is mainly evaluated using virtual simulations. For such virtual safety studies to be trustworthy, the simulated traffic situations must closely reflect what actually happens on real roads.

This thesis investigates how realistic and representative pre‑crash scenarios can be generated and validated for evaluating the safety impact of driving automation systems. It develops methods to combine different types of real‑world pre‑crash data to describe how rear‑end crashes typically develop, from minor contacts to severe injuries. Based on these data, large numbers of synthetic pre‑crash scenarios can be generated to reflect real‑world conditions across the full severity range.

A key contribution of this work is a validation framework that goes beyond assessing physical plausibility. Instead, it evaluates whether synthetic scenarios are similar enough to real‑world data for safety assessment, using transparent and interpretable statistical criteria.

Together, these methods improve the credibility of virtual safety impact assessments and help ensure that conclusions about the safety benefits of driving automation systems are based on scenarios that accurately reflect real traffic conditions, supporting more reliable system development, regulation, and deployment.

Improved quantitative driver behavior models and safety assessment methods for ADAS and AD (QUADRIS)

VINNOVA (2020-05156), 2021-04-01 -- 2024-03-31.

Areas of Advance

Transport

Subject Categories (SSIF 2025)

Probability Theory and Statistics

Computer Systems

Other Computer and Information Science

DOI

10.63959/chalmers.dt/5844

ISBN

978-91-8103-387-8

Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 5844

Publisher

Chalmers

Virtual Development Laboratory (VDL), Tvärgata 4C, Chalmers

Opponent: Feng Guo, Virginia Tech, USA

More information

Latest update

5/11/2026