Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples
Preprint, 2022

Data subsampling has become widely recognized as a tool to overcome computational and economic bottlenecks in analyzing massive datasets and measurement-constrained experiments. However, traditional subsampling methods often suffer from the lack of information available at the design stage. We propose an active sampling strategy that iterates between estimation and data collection with optimal subsamples, guided by machine learning predictions on yet unseen data. The method is illustrated on virtual simulation-based safety assessment of advanced driver assistance systems. Substantial performance improvements were observed compared to traditional sampling methods.

adaptive importance sampling

inverse probability weighting

active learning

survey sampling

optimal design

Author

Henrik Imberg

Chalmers, Mathematical Sciences, Applied Mathematics and Statistics

Xiaomi Yang

Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Safety

Carol Ann Cook Flannagan

Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Safety

Jonas Bärgman

Chalmers, Mechanics and Maritime Sciences (M2), Vehicle Safety

Supporting the interaction of Humans and Automated vehicles: Preparing for the Environment of Tomorrow (Shape-IT)

European Commission (EC) (EC/H2020/860410), 2019-10-01 -- 2023-09-30.

Improved quantitative driver behavior models and safety assessment methods for ADAS and AD (QUADRIS)

VINNOVA (2020-05156), 2021-04-01 -- 2024-03-31.

Areas of Advance

Transport

Subject Categories

Probability Theory and Statistics

DOI

10.48550/arXiv.2212.10024

More information

Latest update

10/27/2023