GeneaLog: Fine-Grained Data Streaming Provenance at the Edge

Dimitrios Palyvos-Giannas; Vincenzo Massimiliano Gulisano; Marina Papatriantafilou

doi:10.1145/3274808.3274826

GeneaLog: Fine-Grained Data Streaming Provenance at the Edge
Paper i proceeding, 2018

Fine-grained data provenance in data streaming allows linking each result tuple back to the source data that contributed to it, something beneficial for many applications (e.g., to find the conditions triggering a security- or safety-related alert). Further, when data transmission or storage has to be minimized, as in edge computing and cyber-physical systems, it can help in identifying the source data to be prioritized.
The memory and processing costs of fine-grained data provenance, possibly afforded by high-end servers, can be prohibitive for the resource-constrained devices deployed in edge computing and cyber-physical systems. Motivated by this challenge, we present GeneaLog, a novel fine-grained data provenance technique for data streaming applications. Leveraging the logical dependencies of the data, GeneaLog takes advantage of cross-layer properties of the software stack and incurs a minimal, constant size per-tuple overhead. Furthermore, it allows for a modular and efficient algorithmic implementation using only standard data streaming operators. This is particularly useful for distributed streaming applications since the provenance processing can be executed at separate nodes, orthogonal to the data processing. We evaluate an implementation of GeneaLog using vehicular and smart grid applications, confirming it efficiently captures fine-grained provenance data with minimal overhead.

Data streaming

Edge architectures

Fine-grained data provenance

Författare

Dimitrios Palyvos-Giannas

Chalmers, Data- och informationsteknik, Nätverk och system

Forskning Andra publikationer

Vincenzo Massimiliano Gulisano

Chalmers, Data- och informationsteknik, Nätverk och system

Forskning Andra publikationer

Marina Papatriantafilou

Chalmers, Data- och informationsteknik, Nätverk och system

Forskning Andra publikationer

Middleware '18 Proceedings of the 19th International Middleware Conference

227-238
978-1-4503-5702-9 (ISBN)

19th ACM/IFIP/USENIX International Middleware Conference, Middleware 2018
Rennes, France,

Molnbaserade produkter och produktion (FiC)

Stiftelsen för Strategisk forskning (SSF) (GMT14-0032), 2016-01-01 -- 2020-12-31.

Visa projekt

HAREN: Självdistribuerad och anpassningsbar dataströmningsanalys i dimman

Vetenskapsrådet (VR) (2016-03800), 2017-01-01 -- 2020-12-31.

Visa projekt

STAMINA - GE

Göteborg Energi AB, 2017-01-01 -- 2021-12-31.

Visa projekt

INDEED

Chalmers, 2016-01-01 -- 2020-12-31.

Visa projekt

Ämneskategorier (SSIF 2011)

Datorteknik

Datavetenskap (datalogi)

Datorsystem

Styrkeområden

Informations- och kommunikationsteknik

Energi

Ämneskategorier (SSIF 2025)

Säkerhet, integritet och kryptologi

DOI

10.1145/3274808.3274826

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2025-06-27

GeneaLog: Fine-Grained Data Streaming Provenance at the Edge Paper i proceeding, 2018

Författare

Dimitrios Palyvos-Giannas

Vincenzo Massimiliano Gulisano

Marina Papatriantafilou

Middleware '18 Proceedings of the 19th International Middleware Conference

Molnbaserade produkter och produktion (FiC)

HAREN: Självdistribuerad och anpassningsbar dataströmningsanalys i dimman

STAMINA - GE

INDEED

Ämneskategorier (SSIF 2011)

Styrkeområden

Ämneskategorier (SSIF 2025)

DOI

Mer information

Senast uppdaterat

GeneaLog: Fine-Grained Data Streaming Provenance at the Edge
Paper i proceeding, 2018