A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development

Jesper Knapp; Klas Moberg; Yuchuan Jin; Simin Sun; Miroslaw Staron

doi:10.1007/978-3-031-78392-0_3

A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development
Paper i proceeding, 2025

Autonomous driving software generates enormous amounts of data every second, which software development organizations save for future analysis and testing in the form of logs. However, given the vast size of this data, locating specific scenarios within a collection of vehicle logs can be challenging. Writing the correct SQL queries to find these scenarios requires engineers to have a strong background in SQL and the specific databases in question, further complicating the search process. This paper presents and evaluates a pipeline that allows searching for specific scenarios in log collections using natural language descriptions instead of SQL. The generated descriptions were evaluated by engineers working with vehicle logs at the Zenseact on a scale from 1 to 5. Our approach achieved a mean score of 3.3, demonstrating the potential of using a multi-model architecture to improve the software development workflow. We also present an interface that can visualize the query process and visualize the results.

Large Language Models (LLMs)

Data Retrieval

Autonomous Vehicles

Multi-Modal Models

Författare

Jesper Knapp

Student vid Chalmers

Klas Moberg

Student vid Chalmers

Forskning Andra publikationer

Yuchuan Jin

Zenseact AB

Simin Sun

Chalmers, Data- och informationsteknik, Interaktionsdesign och Software Engineering

Forskning Andra publikationer

Miroslaw Staron

Chalmers, Data- och informationsteknik, Software Engineering

Forskning Andra publikationer

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

03029743 (ISSN) 16113349 (eISSN)

Vol. 15453 LNCS 35-49
9783031783913 (ISBN)

25th International Conference on Product-Focused Software Process Improvement, PROFES 2024
Tartu, Estonia,

Ämneskategorier (SSIF 2011)

Programvaruteknik

DOI

10.1007/978-3-031-78392-0_3

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2024-12-16

A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development Paper i proceeding, 2025

Författare

Jesper Knapp

Klas Moberg

Yuchuan Jin

Simin Sun

Miroslaw Staron

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Ämneskategorier (SSIF 2011)

DOI

Mer information

Senast uppdaterat

A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development
Paper i proceeding, 2025