Miquel Pericas
Showing 47 publications
SWEEP: Adaptive Task Scheduling for Exploring Energy Performance Trade-offs
JOSS: Joint Exploration of CPU-Memory DVFS and Task Scheduling for Energy Efficiency
Analysis and Characterization of Performance Variability for OpenMP Runtime
Challenges and Opportunities in the Co-design of Convolutions and RISC-V Vector Processors
Accelerating CNN inference on long vector architectures via co-design
ODIN: Overcoming Dynamic Interference in iNference Pipelines
At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads
Shisha: Online Scheduling of CNN Pipelines on Heterogeneous Architectures
STEER: Asymmetry-aware Energy Efficient Task Scheduler for Cluster-based Multicore Architectures
ERASE: Energy Efficient Task Mapping and Resource Management for Work Stealing Runtimes
CBP: Coordinated management of cache partitioning, bandwidth partitioning and prefetch throttling
Vectorized Barrier and Reduction in LLVM OpenMP Runtime
An online guided tuning approach to run CNN pipelines on edge devices
Enhancing thread-level parallelism in asymmetric multicores using transparent instruction offloading
LEGaTO: Low-Energy, Secure, and Resilient Toolset for Heterogeneous Computing
DELTA: Distributed Locality-Aware Cache Partitioning for Tile-based Chip Multiprocessors
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Enhancing Multithreaded Performance of Asymmetric Multicores with SIMD Offloading
QoS-driven coordinated management of resources to save energy in multi-core systems
SaC: Exploiting execution-time slack to save energy in heterogeneous multicore systems
LEGaTO: First Steps Towards Energy-Efficient Toolset for Heterogeneous Computing
LEGaTO: Towards Energy-Efficient, Secure, Fault-tolerant Toolset for Heterogeneous Computing
Global dead-block management for task-parallel programs
Message from the general co-chairs - 2018 ACM International Conference on Computing Frontiers
Elastic Places: An Adaptive Resource Manager for Scalable and Portable Performance
Runtime-Assisted Global Cache Management for Task-based Parallel Programs
Trends in Data Locality Abstractions for HPC Systems
SWAS: Stealing Work Using Approximate System-Load Information
RADAR: Runtime-assisted dead region management for last-level caches
Task Assembly Objects: a Cache-centric Execution Model and its Prototype Runtime Implementation
Scaling FMM with data-driven OpenMP tasks on multicore architectures
Self-Tuned Software-Managed Energy Reduction in Infiniband Links
RADAR: Run-time assisted Dead-Region Management for Last-Level Caches
A Cache-centric Execution Model and Runtime for Deep Parallel Multicore Topologies
A Case for Runtime-Assisted Global Cache Management
RADAR: Runtime-Assisted Dead Region Management for Last-Level Caches
Scalable and Locality-aware Resource Management with Task Assembly Objects
DAGViz: A DAG Visualization Tool for Analyzing Task Parallel Program Traces
Scalable analysis of multicore data reuse and sharing
Download publication list
You can download this list to your computer.
Filter and download publication list
As logged in user (Chalmers employee) you find more export functions in MyResearch.
You may also import these directly to Zotero or Mendeley by using a browser plugin. These are found herer:
Zotero Connector
Mendeley Web Importer
The service SwePub offers export of contents from Research in other formats, such as Harvard and Oxford in .RIS, BibTex and RefWorks format.
Showing 8 research projects
Pilot using Independent Local & Open Technologies (The European PILOT)
Principer för beräknande minnesenheter (PRIDE)
P4PIM: Principles of power-constrained HPC programming for PIM networks
Very Efficient Deep Learning in IOT (VEDLIoT)
Low-energy toolset for heterogeneous computing (LEGaTO)
ACE: Approximate Algorithms and Computing Systems