ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators

Muhammad Waqar Azhar; Stavroula Zouzoula; Pedro Petersen Moura Trancoso

doi:10.1145/3587135.3592207

ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators
Paper in proceeding, 2023

Deep Learning (DL) applications are entering every part of our life given their ability to solve complex problems. Nevertheless, energy efficiency is still a major concern due to the large computational and memory requirements. State-of-the-art accelerators strive to address this issue by optimizing the architecture to the compute requirements of DL algorithms. However, there is always a mismatch between compute and memory requirements and what is offered by a particular design. A way to close this gap is by providing run-time adaptation or resource allocation to improve efficiency.

This paper proposes an adaptive resource allocation for deep learning applications (ARADA) with the goal of improving energy efficiency for deep learning accelerators. This is leveraged by having a layer-by-layer resource allocation. The rationale is that each layer in the DL model has a unique compute and memory bandwidth requirement and allocating fixed resources to all layers leads to inefficiencies. This can be achieved by means of resource allocation (e.g., voltage-frequency, memory bandwidth) to save energy without sacrificing performance. Experimental results show that applying ARADA to the execution of 9 state-of-the-art CNN models results in an energy savings of 38% on average compared to race-to-idle for an Edge TPU coupled with LPDDR4 off-chip memory.

Energy Efficiency

Resource Allocation

CNNs

Accelerators

Author

Muhammad Waqar Azhar

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

Other publications Research

Stavroula Zouzoula

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

Other publications Research

Pedro Petersen Moura Trancoso

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

Other publications Research

Proceedings of the 20th ACM International Conference on Computing Frontiers 2023, CF 2023

63-72
979-8-4007-0140-5 (ISBN)

20th ACM International Conference on Computing Frontiers, CF 2023
Bologna, Spain,

Very Efficient Deep Learning in IOT (VEDLIoT)

European Commission (EC) (EC/H2020/957197), 2020-11-01 -- 2023-10-31.

Show Project

Subject Categories (SSIF 2011)

Energy Systems

Computer Science

Computer Systems

DOI

10.1145/3587135.3592207

Publication data connected to DOI

More information

Latest update

9/15/2023

ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators Paper in proceeding, 2023

Author

Muhammad Waqar Azhar

Stavroula Zouzoula

Pedro Petersen Moura Trancoso

Proceedings of the 20th ACM International Conference on Computing Frontiers 2023, CF 2023

Very Efficient Deep Learning in IOT (VEDLIoT)

Subject Categories (SSIF 2011)

DOI

More information

Latest update

ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators
Paper in proceeding, 2023