Active preference learning for ordering items in- and out-of-sample

Herman Bergström; Emil Carlsson; Devdatt Dubhashi; Fredrik Johansson

Active preference learning for ordering items in- and out-of-sample
Paper in proceeding, 2024

Learning an ordering of items based on pairwise comparisons is useful when items are difficult to rate consistently on an absolute scale, for example, when annotators have to make subjective assessments. When exhaustive comparison is infeasible, actively sampling item pairs can reduce the number of annotations necessary for learning an accurate ordering. However, many algorithms ignore shared structure between items, limiting their sample efficiency and precluding generalization to new items. It is also common to disregard how noise in comparisons varies between item pairs, despite it being informative of item similarity. In this work, we study active preference learning for ordering items with contextual attributes, both in- and out-of-sample. We give an upper bound on the expected ordering error of a logistic preference model as a function of which items have been compared. Next, we propose an active learning strategy that samples items to minimize this bound by accounting for aleatoric and epistemic uncertainty in comparisons. We evaluate the resulting algorithm, and a variant aimed at reducing model misspecification, in multiple realistic ordering tasks with comparisons made by human annotators. Our results demonstrate superior sample efficiency and generalization compared to non-contextual ranking approaches and active preference learning baselines.

Author

Herman Bergström

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Other publications Research

Emil Carlsson

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Other publications Research

Devdatt Dubhashi

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Other publications Research

Fredrik Johansson

Chalmers, Computer Science and Engineering (Chalmers), Data Science and AI

Other publications Research

Advances in Neural Information Processing Systems

10495258 (ISSN)

Vol. 37

38th Conference on Neural Information Processing Systems, NeurIPS 2024
Vancouver, Canada,

Causality and auxiliary information for efficient machine learning

Swedish Research Council (VR) (2022-04748), 2023-01-01 -- 2026-12-31.

Show Project

Subject Categories (SSIF 2025)

Computer and Information Sciences

More information

Latest update

4/1/2025 1

Active preference learning for ordering items in- and out-of-sample Paper in proceeding, 2024

Author

Herman Bergström

Emil Carlsson

Devdatt Dubhashi

Fredrik Johansson

Advances in Neural Information Processing Systems

Causality and auxiliary information for efficient machine learning

Subject Categories (SSIF 2025)

More information

Latest update

Active preference learning for ordering items in- and out-of-sample
Paper in proceeding, 2024