Transactional prefetching: Narrowing the window of contention in hardware transactional memory

Anurag Negi; A. Armejach; A. Cristal; O.S. Unsal; Per Stenström

doi:10.1145/2370816.2370844

Transactional prefetching: Narrowing the window of contention in hardware transactional memory
Paper i proceeding, 2012

Memory access latency is the primary performance bottle-neck in modern computer systems. Prefetching data before it is needed by a processing core allows substantial performance gains by overlapping significant portions of memory latency with useful work. Prior work has investigated this technique and measured potential benefits in a variety of scenarios. However, its use in speeding up Hardware Transactional Memory (HTM) has remained hitherto unexplored. In several HTM designs transactions invalidate speculatively updated cache lines when they abort. Such cache lines tend to have high locality and are likely to be accessed again when the transaction re-executes. Coarse grained transactions that update several cache lines are particularly susceptible to performance degradation even under moderate contention. However, such transactions show strong locality of reference, especially when contention is high. Prefetching cache lines with high locality can, therefore, improve overall concurrency by speeding up transactions and, thereby, narrowing the window of time in which such transactions persist and can cause contention. Such transactions are important since they are likely to form a common TM use-case. We note that traditional prefetch techniques may not be able to track such lines adequately or issue prefetches quickly enough. This paper investigates the use of prefetching in HTMs, proposing a simple design to identify and request prefetch candidates, and measures performance gains to be had for several representative TM workloads.

Hardware transactional memory

Prefetching

Multicores

Författare

Anurag Negi

Chalmers, Data- och informationsteknik, Datorteknik

Forskning Andra publikationer

A. Armejach

Centro Nacional de Supercomputacion

Universitat Politecnica de Catalunya

A. Cristal

Centro Nacional de Supercomputacion

Consejo Superior de Investigaciones Científicas (CSIC)

O.S. Unsal

Centro Nacional de Supercomputacion

Per Stenström

Chalmers, Data- och informationsteknik, Datorteknik

Forskning Andra publikationer

Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

1089795X (ISSN)

181-190

Ämneskategorier (SSIF 2011)

Data- och informationsvetenskap

DOI

10.1145/2370816.2370844

Publikationsdata kopplat till DOI

Mer information

Senast uppdaterat

2025-03-09

Transactional prefetching: Narrowing the window of contention in hardware transactional memory Paper i proceeding, 2012

Författare

Anurag Negi

A. Armejach

A. Cristal

O.S. Unsal

Per Stenström

Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

Ämneskategorier (SSIF 2011)

DOI

Mer information

Senast uppdaterat

Transactional prefetching: Narrowing the window of contention in hardware transactional memory
Paper i proceeding, 2012