V-Mon: Scalable and Fault-Tolerant Stream Processing Pipeline for Monitoring Vehicular Data Validity
Paper i proceeding, 2025

With constant increases in edge devices in industry settings, increases in data rates naturally follow. However, with high, unbounded data rates, traditional (store-then-process) database procedures and batch-based processing are struggling to remain performant. To this end, processing streams of data continuously is an increasingly appealing approach, targeting low latency, high scalability and real-time data processing. This work examines design considerations as well as performance trade-offs for a stream processing pipeline targeting stateful analysis. The pipeline implementation employs Apache Kafka, Apache Flink and Apache Druid, and is studied through an example use case at Volvo Trucks, focusing on signal data set validity analysis. Performance evaluation of the pipeline reveals that the throughput requirements of the use case are satisfied, while also achieving sub-second latencies and offering a degree of fault tolerance. The pipeline also shows promise of adapting well to different levels of scale, providing enough headroom for a tenfold increase in data volumes over current demands. Further, the extensible nature of the pipeline enables the support of various feature extraction methods, e.g., data synopsis and sketching, and alternative data representations, e.g., knowledge graphs.

latency & throughput

data completeness

stream processing

scalability

data validation

fault tolerance

Data pipelines

Författare

Carl Magnus Wall

Volvo Group

Måns Josefsson

Student vid Chalmers

Martin Hilgendorf

Chalmers, Data- och informationsteknik, Dator- och nätverkssystem

Marina Papatriantafilou

Chalmers, Data- och informationsteknik, Dator- och nätverkssystem

Binay Mishra

Volvo Group

Debs 2025 Proceedings of the 19th ACM International Conference on Distributed and Event Based Systems

249-250
9798400713323 (ISBN)

19th ACM International Conference on Distributed and Event-Based Systems, DEBS 2025
Gothenburg, Sweden,

VR EPITOME - Sammanfattning och strukturering av kontinuerlig data i pipelines för samtidig behandling

Vetenskapsrådet (VR) (2021-05424), 2022-01-01 -- 2025-12-31.

Ämneskategorier (SSIF 2025)

Datavetenskap (datalogi)

Datorteknik

Datorsystem

DOI

10.1145/3701717.3733228

Mer information

Senast uppdaterat

2025-08-25