An empirical investigation of challenges of specifying training data and runtime monitors for critical software with machine learning and their relation to architectural decisions
Artikel i vetenskaplig tidskrift, 2024

The development and operation of critical software that contains machine learning (ML) models requires diligence and established processes. Especially the training data used during the development of ML models have major influences on the later behaviour of the system. Runtime monitors are used to provide guarantees for that behaviour. Runtime monitors for example check that the data at runtime is compatible with the data used to train the model. In a first step towards identifying challenges when specifying requirements for training data and runtime monitors, we conducted and thematically analysed ten interviews with practitioners who develop ML models for critical applications in the automotive industry. We identified 17 themes describing the challenges and classified them in six challenge groups. In a second step, we found interconnection between the challenge themes through an additional semantic analysis of the interviews. We explored how the identified challenge themes and their interconnections can be mapped to different architecture views. This step involved identifying relevant architecture views such as data, context, hardware, AI model, and functional safety views that can address the identified challenges. The article presents a list of the identified underlying challenges, identified relations between the challenges and a mapping to architecture views. The intention of this work is to highlight once more that requirement specifications and system architecture are interlinked, even for AI-specific specification challenges such as specifying requirements for training data and runtime monitoring.

Författare

Hans-Martin Heyn

Göteborgs universitet

Software Engineering 1

Eric Knauss

Chalmers, Data- och informationsteknik, Interaktionsdesign och Software Engineering

Göteborgs universitet

Iswarya Malleswaran

Student vid Chalmers

Shruthi Dinakaran

Student vid Chalmers

Requirements Engineering

0947-3602 (ISSN) 1432-010X (eISSN)

Vol. 29 1 97-117

Very Efficient Deep Learning in IOT (VEDLIoT)

Europeiska kommissionen (EU) (EC/H2020/957197), 2020-11-01 -- 2023-10-31.

Styrkeområden

Informations- och kommunikationsteknik

Ämneskategorier (SSIF 2011)

Data- och informationsvetenskap

DOI

10.1007/s00766-024-00415-4

Relaterade dataset

Replication Data for: An investigation of challenges encountered when specifying training data and runtime monitors for safety critical ML applications [dataset]

DOI: 10.7910/DVN/WJ8TKY URI: https://doi.org/10.7910/DVN/WJ8TKY

Mer information

Senast uppdaterat

2025-01-10