Off-Policy Evaluation with Out-of-Sample Guarantees

Sofia Ek; Dave Zachariah; Fredrik Johansson; Petre Stoica

Off-Policy Evaluation with Out-of-Sample Guarantees
Journal article, 2023

We consider the problem of evaluating the performance of a decision policy using past observational data. The outcome of a policy is measured in terms of a loss (aka. disutility or negative reward) and the main problem is making valid inferences about its out-of-sample loss when the past data was observed under a different and possibly unknown policy. Using a sample-splitting method, we show that it is possible to draw such inferences with finitesample coverage guarantees about the entire loss distribution, rather than just its mean. Importantly, the method takes into account model misspecifications of the past policy – including unmeasured confounding. The evaluation method can be used to certify the performance of a policy using observational data under a specified range of credible model assumptions.

Author

Sofia Ek

Uppsala University

Dave Zachariah

Uppsala University

Fredrik Johansson

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

University of Gothenburg

Other publications Research

Petre Stoica

Uppsala University

Transactions on Machine Learning Research

28358856 (eISSN)

Vol. 2023

Subject Categories (SSIF 2025)

Probability Theory and Statistics

More information

Latest update

11/17/2025

If you have questions, need help, find a bug or just want to give us feedback you may use this form, or contact us per e-mail research.lib@chalmers.se.

Message

Your email address

Research.chalmers.se contains research information from Chalmers University of Technology, Sweden. It includes information on projects, publications, research funders and collaborations.

More about coverage period and what is publicly available

Privacy and cookies

Accessibility

Citation Style Language
citeproc-js (Frank Bennett)

Chalmers Library

Chalmers Research

Chalmers Student Theses

SE-412 96 GOTHENBURG, SWEDEN
PHONE: +46 (0)31-772 10 00
WWW.CHALMERS.SE

Off-Policy Evaluation with Out-of-Sample Guarantees Journal article, 2023

Author

Sofia Ek

Dave Zachariah

Fredrik Johansson

Petre Stoica

Transactions on Machine Learning Research

Subject Categories (SSIF 2025)

More information

Latest update

Off-Policy Evaluation with Out-of-Sample Guarantees
Journal article, 2023