Miquel Pericas
Showing 47 publications
SWEEP: Adaptive Task Scheduling for Exploring Energy Performance Trade-offs
Accelerating CNN inference on long vector architectures via co-design
Analysis and Characterization of Performance Variability for OpenMP Runtime
ODIN: Overcoming Dynamic Interference in iNference Pipelines
Challenges and Opportunities in the Co-design of Convolutions and RISC-V Vector Processors
JOSS: Joint Exploration of CPU-Memory DVFS and Task Scheduling for Energy Efficiency
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads
Shisha: Online Scheduling of CNN Pipelines on Heterogeneous Architectures
STEER: Asymmetry-aware Energy Efficient Task Scheduler for Cluster-based Multicore Architectures
ERASE: Energy Efficient Task Mapping and Resource Management for Work Stealing Runtimes
An online guided tuning approach to run CNN pipelines on edge devices
CBP: Coordinated management of cache partitioning, bandwidth partitioning and prefetch throttling
Vectorized Barrier and Reduction in LLVM OpenMP Runtime
LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing
Enhancing thread-level parallelism in asymmetric multicores using transparent instruction offloading
DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Enhancing Multithreaded Performance of Asymmetric Multicores with SIMD Offloading
QoS-driven coordinated management of resources to save energy in multi-core systems
SaC: Exploiting execution-time slack to save energy in heterogeneous multicore systems
Message from the general co-chairs - 2018 ACM International Conference on Computing Frontiers
LEGaTO: First Steps Towards Energy-Efficient Toolset for Heterogeneous Computing
LEGaTO: Towards Energy-Efficient, Secure, Fault-tolerant Toolset for Heterogeneous Computing
Global dead-block management for task-parallel programs
Elastic Places: An Adaptive Resource Manager for Scalable and Portable Performance
Trends in Data Locality Abstractions for HPC Systems
Runtime-Assisted Global Cache Management for Task-based Parallel Programs
SWAS: Stealing Work Using Approximate System-Load Information
RADAR: Run-time assisted Dead-Region Management for Last-Level Caches
Scaling FMM with data-driven OpenMP tasks on multicore architectures
Task Assembly Objects: a Cache-centric Execution Model and its Prototype Runtime Implementation
A Cache-centric Execution Model and Runtime for Deep Parallel Multicore Topologies
Self-Tuned Software-Managed Energy Reduction in Infiniband Links
RADAR: Runtime-assisted dead region management for last-level caches
A Case for Runtime-Assisted Global Cache Management
Scalable and Locality-aware Resource Management with Task Assembly Objects
RADAR: Runtime-Assisted Dead Region Management for Last-Level Caches
DAGViz: A DAG Visualization Tool for Analyzing Task Parallel Program Traces
Scalable analysis of multicore data reuse and sharing
Download publication list
You can download this list to your computer.
Filter and download publication list
As logged in user (Chalmers employee) you find more export functions in MyResearch.
You may also import these directly to Zotero or Mendeley by using a browser plugin. These are found herer:
Zotero Connector
Mendeley Web Importer
The service SwePub offers export of contents from Research in other formats, such as Harvard and Oxford in .RIS, BibTex and RefWorks format.
Showing 8 research projects
Pilot using Independent Local & Open Technologies (The European PILOT)
Principer för beräknande minnesenheter (PRIDE)
P4PIM: Principles of power-constrained HPC programming for PIM networks
Very Efficient Deep Learning in IOT (VEDLIoT)
Low-energy toolset for heterogeneous computing (LEGaTO)
ACE: Approximate Algorithms and Computing Systems