Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs
Paper in proceeding, 2024
layer fusion
depthwise convolution
CNN
vision transformer
pointwise convolution
GPU
Author
Fareed Mohammad Qararyah
Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)
Muhammad Waqar Azhar
ZEROPOINT TECHNOLOGIES AB
Mohammad Ali Maleki
Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)
Pedro Petersen Moura Trancoso
Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)
ACM International Conference Proceeding Series
58-67
9798400718021 (ISBN)
Gotland, Sweden,
Very Efficient Deep Learning in IOT (VEDLIoT)
European Commission (EC) (EC/H2020/957197), 2020-11-01 -- 2023-10-31.
EPI SGA2
European Commission (EC) (101036168), 2022-01-01 -- 2024-12-31.
Subject Categories
Computer Science
DOI
10.1145/3677333.3678153