Tensor Compression and Reconstruction in Split DNN for Real-time Object Detection at the Edge
Paper i proceeding, 2024

Computer vision applications for UAVs often rely on deep neural networks (DNNs) to increase prediction accuracy. As such DNNs crave for computational resources that can be hardly matched by those available at the UAVs, an emerging solution is to split the DNN into a low-complexity head model run at the UAV and a heavier tail run at the edge. This approach, however, comes at the cost of transmitting a large tensor data, hence of high bandwidth consumption, on the UAV-edge radio link. We tackle this problem by proposing the Compressed Tensor-based DNN split (CoTeD) framework, which executes tensor compression at the UAV and reconstruction at the edge, while conveniently trading off tensor compression with quality of the computer vision task output. When compared with the no-split case, CoTeD reduces the UAV computational burden by 50% w.r.t. performing inference at the UAV only, and the amount of transmitted data by over one order of magnitude w.r.t. running inference at the edge only. When compared to compressive sensing, JPEG-100, and the whole DNN run at the edge, CoTeD decreases the overall latency by (resp.) 95%, 75%, and 80%.

UAVs

Edge computing

Bandwidth utilization

Författare

Yenchia Yu

Politecnico di Torino

Marco Levorato

University of California at Irvine (UCI)

Carla Fabiana Chiasserini

Politecnico di Torino

Nätverk och System

2024 IEEE International Mediterranean Conference on Communications and Networking, MeditCom 2024

149-154
9798350309485 (ISBN)

2024 IEEE International Mediterranean Conference on Communications and Networking, MeditCom 2024
Madrid, Spain,

Ämneskategorier (SSIF 2025)

Datorgrafik och datorseende

Signalbehandling

Datorsystem

DOI

10.1109/MeditCom61057.2024.10621382

Mer information

Senast uppdaterat

2025-05-20