A Multi-model Approach forĀ Video Data Retrieval inĀ Autonomous Vehicle Development
Paper in proceeding, 2025

Autonomous driving software generates enormous amounts of data every second, which software development organizations save for future analysis and testing in the form of logs. However, given the vast size of this data, locating specific scenarios within a collection of vehicle logs can be challenging. Writing the correct SQL queries to find these scenarios requires engineers to have a strong background in SQL and the specific databases in question, further complicating the search process. This paper presents and evaluates a pipeline that allows searching for specific scenarios in log collections using natural language descriptions instead of SQL. The generated descriptions were evaluated by engineers working with vehicle logs at the Zenseact on a scale from 1 to 5. Our approach achieved a mean score of 3.3, demonstrating the potential of using a multi-model architecture to improve the software development workflow. We also present an interface that can visualize the query process and visualize the results.

Large Language Models (LLMs)

Data Retrieval

Autonomous Vehicles

Multi-Modal Models

Author

Jesper Knapp

Student at Chalmers

Klas Moberg

Student at Chalmers

Yuchuan Jin

Zenseact AB

Simin Sun

Chalmers, Computer Science and Engineering (Chalmers), Interaction Design and Software Engineering

Miroslaw Staron

Chalmers, Computer Science and Engineering (Chalmers), Software Engineering (Chalmers)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

03029743 (ISSN) 16113349 (eISSN)

Vol. 15453 LNCS 35-49
9783031783913 (ISBN)

25th International Conference on Product-Focused Software Process Improvement, PROFES 2024
Tartu, Estonia,

Subject Categories

Software Engineering

DOI

10.1007/978-3-031-78392-0_3

More information

Latest update

12/16/2024