GeoRep - Resilient Storage for Wide Area Networks
Journal article, 2022
intermittently be powered on. After sending data from its sensors upstream, the system must therefore be able
to trust that the data, once acknowledged, is not lost. The purpose of this work is to propose a novel solution
for replicating data between the upstream nodes in such systems, with a minimal effect on the software
architecture. On the assumption that there is no relative order between replicated data tuples, we designed
a new replication protocol based on partial replication. Our protocol uses only 2 communication steps per
data tuple, instead of the 3 to 12 used by other solutions. We verified its failover mechanism in a proof-of-
concept implementation of the protocol using simulated network failures, and evaluated the implementation
on throughput and latency in several controlled experiments using up to 7 nodes in up to 5 geographically
separated areas, with up to 1000 data producers per node. The recorded system throughput increased linearly
relative to both the number of nodes and the number of data producers. For comparison, Paxos showed a
performance similar to our protocol when using 3 nodes, but got slower as more nodes were added. The lack
of a relative order, in combination with partial replication, enables our system to continue working during
network partitions, not only in the part containing the majority of the nodes, but also in any sufficiently large
minority partitions.
resilience
replication
distributed computing
Store-and-forward
availability
Author
Daniel Brahneborg
Braxo AB
Romaric Duvignau
Network and Systems
Wasif Afzal
Mälardalens högskola
Saad Mubeen
Mälardalens högskola
IEEE Access
2169-3536 (ISSN) 21693536 (eISSN)
Vol. 10 75772-75788Subject Categories
Computer Systems
DOI
10.1109/ACCESS.2022.3191686