Resource-Efficient Sensor Fusion at the Edge via System-wide Dynamic Gated Neural Networks
Artikel i vetenskaplig tidskrift, 2025

Next-generation mobile systems will support multiple AI-based applications, each leveraging heterogeneous sensors and data sources through deep neural network (DNN) architectures collaboratively executed within the network. In this context, to minimize the cost of the AI inference task subject to requirements on latency, quality, and – crucially – reliability of the inference process, it is vital to optimize (i) the set of sensors/data sources and (ii) the DNN architecture, (iii) the network nodes executing sections of the DNN, and (iv) the resources to use. To achieve these goals, we leverage dynamic gated neural networks with branches, and propose a novel algorithmic strategy called Quantile-constrained Inference (QIC), based upon quantile-Constrained policy optimization. QIC makes joint, high-quality, swift decisions on all the above aspects of the system, with the aim to minimize inference energy cost. We remark that this is the first contribution connecting gated dynamic DNNs with infrastructure-level decision making. We evaluate QIC using a dynamic gated DNN with stems and branches for optimal sensor fusion and inference, trained on the RADIATE dataset offering Radar, LiDAR, and Camera data, and real-world wireless measurements. Our results confirm that QIC closely matches the optimum and outperforms existing approaches in reducing energy consumption (compute, communication, and total) and application requirements failure by over 70%.

Inference in the mobile-edge continuum

Network support to machine learning

Energy efficiency

Dynamic deep neural networks

Författare

C. Singhal

Institut National de Recherche en Informatique et en Automatique (INRIA)

Y. Wu

University of California

Francesco Malandrino

Consiglo Nazionale Delle Richerche

Sharon G.L. Contreras

University of California

Marco Levorato

University of California

Carla Fabiana Chiasserini

Consiglo Nazionale Delle Richerche

Politecnico di Torino

Nätverk och System

IEEE Transactions on Mobile Computing

1536-1233 (ISSN) 15580660 (eISSN)

Vol. In Press

Styrkeområden

Informations- och kommunikationsteknik

Ämneskategorier (SSIF 2025)

Datavetenskap (datalogi)

Datorsystem

DOI

10.1109/TMC.2025.3586882

Mer information

Senast uppdaterat

2025-07-21