A Multi-model Approach for Video Data Retrieval in Autonomous Vehicle Development
Paper i proceeding, 2025

Autonomous driving software generates enormous amounts of data every second, which software development organizations save for future analysis and testing in the form of logs. However, given the vast size of this data, locating specific scenarios within a collection of vehicle logs can be challenging. Writing the correct SQL queries to find these scenarios requires engineers to have a strong background in SQL and the specific databases in question, further complicating the search process. This paper presents and evaluates a pipeline that allows searching for specific scenarios in log collections using natural language descriptions instead of SQL. The generated descriptions were evaluated by engineers working with vehicle logs at the Zenseact on a scale from 1 to 5. Our approach achieved a mean score of 3.3, demonstrating the potential of using a multi-model architecture to improve the software development workflow. We also present an interface that can visualize the query process and visualize the results.

Large Language Models (LLMs)

Data Retrieval

Autonomous Vehicles

Multi-Modal Models

Författare

Jesper Knapp

Student vid Chalmers

Klas Moberg

Student vid Chalmers

Yuchuan Jin

Zenseact AB

Simin Sun

Chalmers, Data- och informationsteknik, Interaktionsdesign och Software Engineering

Miroslaw Staron

Chalmers, Data- och informationsteknik, Software Engineering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

03029743 (ISSN) 16113349 (eISSN)

Vol. 15453 LNCS 35-49
9783031783913 (ISBN)

25th International Conference on Product-Focused Software Process Improvement, PROFES 2024
Tartu, Estonia,

Ämneskategorier

Programvaruteknik

DOI

10.1007/978-3-031-78392-0_3

Mer information

Senast uppdaterat

2024-12-16