Adaptive Testing for LLM-Based Applications: A Diversity-Based Approach
Paper i proceeding, 2025

The recent surge of building software systems powered by Large Language Models (LLMs) has led to the development of various testing frameworks, primarily focused on treating prompt templates as the unit of testing. Despite the significant costs associated with test input execution and output assessment, the curation of optimized test suites is yet overlooked in these tools, which calls for tailored test selection or prioritization strategies. In this paper, we show that diversity-based testing techniques, such as Adaptive Random Testing (ART) with appropriate string distance metrics, can be effectively applied to the testing of prompt templates. Our proposed adaptive testing approach adjusts the conventional ART process to this context by selecting new test inputs based on scores derived from existing test suite and their labelling results. Our results, obtained using various implementations that explore several string-based distances, confirm that our approach enables the discovery of failures with reduced testing budgets and promotes the generation of more varied outputs.

LLM Testing

Test Prioritization

Test Selection

LLM Applications

Adaptive Random Testing

Författare

Juyeon Yoon

Korea Advanced Institute of Science and Technology (KAIST)

Robert Feldt

Chalmers, Data- och informationsteknik, Software Engineering

Shin Yoo

Korea Advanced Institute of Science and Technology (KAIST)

2025 IEEE International Conference on Software Testing, Verification and Validation Workshops, ICSTW 2025

375-382
9798331534677 (ISBN)

18th IEEE International Conference on Software Testing, Verification and Validation Workshops, ICSTW 2025
Naples, Italy,

Automatiserad testning av gränser för kvalitet på AI/ML modeller (AQUAS)

Vetenskapsrådet (VR) (2020-05272), 2021-01-01 -- 2024-12-31.

Ämneskategorier (SSIF 2025)

Datavetenskap (datalogi)

DOI

10.1109/ICSTW64639.2025.10962467

Mer information

Senast uppdaterat

2025-05-20