Runtime Monitoring and Enforcement of Conditional Fairness in Generative AIs
Paper i proceeding, 2026

The deployment of generative AI (GenAI) models raises significant fairness concerns, addressed in this paper through novel characterization and enforcement techniques specific to GenAI. Unlike standard AI performing specific tasks, GenAI’s broad functionality requires “conditional fairness” tailored to the context being generated, such as demographic fairness in generating images of poor people versus successful business leaders. We define two fairness levels: the first evaluates fairness in generated outputs, independent of prompts and models; the second assesses inherent fairness with neutral prompts. Given the complexity of GenAI and challenges in fairness specifications, we focus on bounding the worst case, considering a GenAI system unfair if the distance between appearances of a specific group exceeds preset thresholds. We also explore combinatorial testing for assessing relative completeness in intersectional fairness. By bounding the worst case, we develop a prompt injection scheme within an agent-based framework to enforce conditional fairness with minimal intervention, validated on state-of-the-art GenAI systems.

conditional fairness

monitoring and enforcement

generative AI

Författare

Chih-Hong Cheng

Chalmers, Data- och informationsteknik, Interaktionsdesign och Software Engineering

Carl von Ossietzky Universität Oldenburg

Changshun Wu

Université Grenoble Alpes

Xingyu Zhao

The University of Warwick

Saddek Bensalem

CSX-AI

Harald Ruess

SRI International

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

03029743 (ISSN) 16113349 (eISSN)

Vol. 16087 LNCS 73-91
9783032054340 (ISBN)

25th International Conference on Runtime Verification, RV 2025
Graz, Austria,

Ämneskategorier (SSIF 2025)

Datavetenskap (datalogi)

Systemvetenskap, informationssystem och informatik

DOI

10.1007/978-3-032-05435-7_5

Mer information

Senast uppdaterat

2025-10-10