TinTiN: Travelling in time (if necessary) to deal with out-of-order data in streaming aggregation
Paper i proceeding, 2020
The trade-offs in processing performance and results' accuracy are nonetheless application-dependent. While some applications need strict deterministic execution, others can value fast (but possibly approximated) answers.
Despite the existing literature on how to relax and trade strict determinism for efficiency or deadlines, we lack a formal characterization of levels of determinism, needed by industries to assess whether or not such trade-offs are acceptable.
To bridge the gap, we introduce the notion of D-bounded eventual determinism, where D is the maximum out-of-order delay of the input data.
We design and implement TinTiN, a streaming middleware that can be used in combination with user-defined streaming applications, to provably enforce D-bounded eventual determinism.
We evaluate TinTiN with a real-world streaming application for Advanced Metering Infrastructure (AMI) monitoring, showing it provides an order of magnitude improvement in processing performance, while minimizing delays in output generation, compared to a state-of-the-art strictly deterministic solution that waits for time proportional to D, for each input tuple, before generating output that depends on it.
Advanced Metering Infrastructure
Data streaming
Smart Meter
Författare
Joris Van Rooij
Chalmers, Data- och informationsteknik, Nätverk och system
Göteborg Energi AB
Vincenzo Massimiliano Gulisano
Chalmers, Data- och informationsteknik, Nätverk och system
Marina Papatriantafilou
Chalmers, Data- och informationsteknik, Nätverk och system
DEBS 2020 - Proceedings of the 14th ACM International Conference on Distributed and Event-Based Systems
141-152
Montreal, Canada,
STAMINA - GE
Göteborg Energi, Forskningsstiftelsen, 2017-01-01 -- 2021-12-31.
INDEED: Information and Data-processing in Focus for Energy Efficiency
Chalmers, 2020-01-01 -- .
Molnbaserade produkter och produktion (FiC)
Stiftelsen för Strategisk forskning (SSF) (GMT14-0032), 2016-01-01 -- 2020-12-31.
HAREN: Självdistribuerad och anpassningsbar dataströmningsanalys i dimman
Vetenskapsrådet (VR) (2016-03800), 2017-01-01 -- 2020-12-31.
STAMINA -WASP
Wallenberg AI, Autonomous Systems and Software Program, 2016-04-04 -- 2020-04-06.
Ämneskategorier
Datorteknik
Datavetenskap (datalogi)
Datorsystem
Styrkeområden
Energi
DOI
10.1145/3401025.3401769