Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples
Preprint, 2022

Data subsampling has become widely recognized as a tool to overcome computational and economic bottlenecks in analyzing massive datasets and measurement-constrained experiments. However, traditional subsampling methods often suffer from the lack of information available at the design stage. We propose an active sampling strategy that iterates between estimation and data collection with optimal subsamples, guided by machine learning predictions on yet unseen data. The method is illustrated on virtual simulation-based safety assessment of advanced driver assistance systems. Substantial performance improvements were observed compared to traditional sampling methods.

adaptive importance sampling

inverse probability weighting

active learning

survey sampling

optimal design

Författare

Henrik Imberg

Chalmers, Matematiska vetenskaper, Tillämpad matematik och statistik

Xiaomi Yang

Chalmers, Mekanik och maritima vetenskaper, Fordonssäkerhet

Carol Ann Cook Flannagan

Chalmers, Mekanik och maritima vetenskaper, Fordonssäkerhet

Jonas Bärgman

Chalmers, Mekanik och maritima vetenskaper, Fordonssäkerhet

Supporting the interaction of Humans and Automated vehicles: Preparing for the Environment of Tomorrow (Shape-IT)

Europeiska kommissionen (EU) (EC/H2020/860410), 2019-10-01 -- 2023-09-30.

Improved quantitative driver behavior models and safety assessment methods for ADAS and AD (QUADRIS)

VINNOVA (2020-05156), 2021-04-01 -- 2024-03-31.

Styrkeområden

Transport

Ämneskategorier

Sannolikhetsteori och statistik

DOI

10.48550/arXiv.2212.10024

Mer information

Senast uppdaterat

2023-10-27