Pedro Petersen Moura Trancoso
Showing 35 publications
An Efficient Hybrid Deep Learning Accelerator for Compact and Heterogeneous CNNs
Scratchpad Memory Management for Deep Learning Accelerators
Fusing Depthwise and Pointwise Convolutions for Efficient Inference on GPUs
VEDLIoT: Next generation accelerated AIoT systems and applications
eProcessor: European, Extendable, Energy-Efficient, Extreme-Scale, Extensible, Processor Ecosystem
Evaluation of heterogeneous AIoT Accelerators within VEDLIoT
A Scalable, Heterogeneous Hardware Platform for Accelerated AIoT based on Microservers
RAINBOW: Multi-Dimensional Hardware-Software Co-Design for DL Accelerator On-Chip Memory
ARADA: Adaptive Resource Allocation for Improving Energy Efficiency in Deep Learning Accelerators
Exploiting the Potential of Flexible Processing Units
Introduction to the Special Section on FPL 2020
VEDLIoT: Very Efficient Deep Learning in IoT
FiBHA: Fixed Budget Hybrid CNN Accelerator
VSA: A Hybrid Vector-Systolic Architecture
Reliability Analysis of Compressed CNNs
LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing
Hybrid2: Combining Caching and Migration in Hybrid Memory Systems
Mapping Multiple LSTM models on FPGAs
LLC-guided data migration in hybrid memory systems
Energy-efficient Runtime Management of Heterogeneous Multicores using Online Projection
AVR: Reducing Memory Traffic with Approximate Value Reconstruction
Time-SWAD: A dataflow engine for time-based single window stream aggregation
Decoupled fused cache: Fusing a decoupled LLC with a DRAM cache
FusionCache: Using LLC tags for DRAM cache
LEGaTO: First Steps Towards Energy-Efficient Toolset for Heterogeneous Computing
LEGaTO: Towards Energy-Efficient, Secure, Fault-tolerant Toolset for Heterogeneous Computing
PHOENIX: Efficient computation in memory
Single Window Stream Aggregation using Reconfigurable Hardware
Auto-tuning Static Schedules for Task Data-flow Applications
Using personality metrics to improve cache interference management in multicore processors
Heterogeneous- and NUMA-aware scheduling for many-core architectures
SWAS: Stealing Work Using Approximate System-Load Information
SWITCHES: A Lightweight Runtime for Dataflow Execution of Tasks on Many-Cores
Odd-ECC: On-demand DRAM error correcting codes
Low-Cost Sub-5W Processors for Edge HPC
Download publication list
You can download this list to your computer.
Filter and download publication list
As logged in user (Chalmers employee) you find more export functions in MyResearch.
You may also import these directly to Zotero or Mendeley by using a browser plugin. These are found herer:
Zotero Connector
Mendeley Web Importer
The service SwePub offers export of contents from Research in other formats, such as Harvard and Oxford in .RIS, BibTex and RefWorks format.
Showing 8 research projects
AutoPIM,: Effektiv accelerator för autonoma fordon
Pilot using Independent Local & Open Technologies (The European PILOT)
Principer för beräknande minnesenheter (PRIDE)
Very Efficient Deep Learning in IOT (VEDLIoT)
PRIME: Principled Designs of Processing-in-Memory Parallel Systems