PHOENIX: Efficient computation in memory
Paper in proceeding, 2017

Parallelism is inherent in most problems but due to current programming models and architectures which have evolved from a sequential paradigm, the parallelism exploited is restricted. We believe that the most efficient parallel execution is achieved when applications are represented as graphs of operations and data, which can then be mapped for execution on a modular and scalable processing-in-memory architecture. In this paper, we present PHOENIX, a general-purpose architecture composed of many Processing Elements (PEs) with memory storage and efficient computational logic units interconnected with a mesh network-on-chip. A preliminary design of PHOENIX shows it is possible to include 10,000 PEs with a storage capacity of 0.6GByte on a 1.5cm2 chip using 14nm technology. PHOENIX may achieve 6TFLOPS with a power consumption of up to 42W, which results in a peak energy efficiency of at least 143GFLOPS/W. A simple estimate shows that for a 4K FFT, PHOENIX achieves 117GFLOPS/W which is more than double of what is achieved by state-of-the-art systems.

Parallel architectures

Energy-efficient

Processing-in-memory (PIM)

High-performance computing (HPC)

Author

Mats Rimborg

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

Pedro Petersen Moura Trancoso

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

Gunnar Carlstedt

Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)

ACM International Conference Proceeding Series

Vol. Part F131197 15-25
978-145035335-9 (ISBN)

Subject Categories (SSIF 2011)

Computer and Information Science

DOI

10.1145/3132402.3132430

ISBN

978-145035335-9

More information

Latest update

3/21/2023