Runtime Management of Multiprocessor Systems for Fault Tolerance, Energy Efficiency and Load Balancing
Doctoral thesis, 2019
Fault Tolerance
Algorithms
Runtime Management
Multiprocessors
Performance
Adaptive Systems
Load Balancing
Energy Efficiency
Author
Stavros Tzilis
Chalmers, Computer Science and Engineering (Chalmers), Computer Engineering (Chalmers)
A runtime manager for gracefully degrading SoCs
Proceedings - IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems,;(2014)p. 216-221
Paper in proceeding
A Probabilistic Analysis of Resilient Reconfigurable Designs
27th IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, DFT 2014, Amsterdam, Netherlands, 1-3 October 2014,;(2014)p. 141-146
Paper in proceeding
A dependable coarse-grain reconfigurable multicore array
Proceedings of the International Parallel and Distributed Processing Symposium, IPDPS,;(2014)p. 141-150
Paper in proceeding
The DeSyRe runtime support for fault-tolerant embedded MPSoCs
Proceedings - 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2014,;(2014)p. 197-204
Paper in proceeding
Reducing the performance overhead of resilient CMPs with substitutable resources
Proceedings of the 2015 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, DFTS 2015,;(2015)p. 191-196
Paper in proceeding
Resilient chip multiprocessors with mixed-grained reconfigurability
IEEE Micro,;Vol. 36(2016)p. 35-45
Journal article
Runtime Management of Adaptive MPSoCs for Graceful Degradation
2016 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (Cases),;(2016)p. Article number 2968517-
Paper in proceeding
Energy-efficient Runtime Management of Heterogeneous Multicores using Online Projection
Transactions on Architecture and Code Optimization,;Vol. 15(2019)
Journal article
SWAS: Stealing Work Using Approximate System-Load Information
46th International Conference on Parallel Processing Workshops, ICPPW 2017, Bristol, United Kingdom, 14 August 2017,;(2017)p. 309-318
Paper in proceeding
The thesis describes strategies to achieve the runtime management described above for three different types of systems: First, a system like the one described in the previous paragraph, the processors of which can fail at unpredictable times. The objective in this case is to maintain acceptable system operation for as long as possible, despite the failures. Second, a system that executes an unpredictable combination of applications, such as a handheld portable device. The objective in this case is to adapt to the user starting and terminating various applications and to maintain proper function, at the same time using as little of the device's battery as possible. Lastly, a system consisting of many processors, running an application that does not always make use of all of them, wasting computational power. The objective in this last case is to redistribute the various parts of the application on all available processors, allowing them to share the workload in a more balanced manner so that the application runs faster.
The above runtime managers have been evaluated with simulations as well as experiments on real systems, conforming to the described specifications. They have been demonstrated to improve system efficiency compared to prior work.
Embedded Multi-Core Systems for Mixed Criticality Applications in Dynamic and Changeable Real-Time Environments (EMC2)
VINNOVA (2014-00607), 2014-04-01 -- 2017-03-31.
European Commission (EC) (EC/FP7/621429), 2014-04-01 -- 2017-03-31.
Energy-efficient Heterogeneous COmputing at exaSCALE (ECOSCALE)
European Commission (EC) (EC/H2020/671632), 2015-10-01 -- 2018-12-31.
Meeting Challenges in Computer Architecture (MECCA)
European Commission (EC) (EC/FP7/340328), 2014-02-01 -- 2019-01-31.
on-Demand System Reliability (DeSyRe)
European Commission (EC) (EC/FP7/287611), 2011-10-01 -- 2015-01-31.
Subject Categories
Computer Engineering
Embedded Systems
Computer Systems
Areas of Advance
Information and Communication Technology
Energy
ISBN
978-91-7597-878-9
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie: 4559
Publisher
Chalmers
Room EA, Rännvägen 4, Chalmers
Opponent: Professor Luigi Carro, Universidade Federal do Rio Grande do Sul, Brazil