Characterizing Piecewise Linear Neural Networks

Anton Johansson

Characterizing Piecewise Linear Neural Networks
Licentiatavhandling, 2022

Neural networks utilizing piecewise linear transformations between layers have in many regards become the default network type to use across a wide range of applications. Their superior training dynamics and generalization performance irrespective of the nature of the problem has resulted in these networks achieving state of the art results on a diverse set of tasks. Even though the efficacy of these networks have been established, there is a poor understanding of their intrinsic behaviour and properties. Little is known regarding how these functions evolve during training, how they behave at initialization and how all of this is related to the architecture of the network. Exploring and detailing these properties is not only of theoretical interest, it can also aid in developing new schemes and algorithms to further improve the performance of the networks. In this thesis we thus seek to further explore and characterize these properties. We theoretically prove how the local properties of piecewise linear networks vary at initialization and explore empirically how more complex properties behave during training. We use these results to reason about which intrinsic properties are associated with the generalization performance and develop new regularization schemes. We further substantiate the empirical success of piecewise linear networks by showcasing how their application can solve two tasks relevant to the safety and effectiveness of processes related to the automotive industry.

automotive applications.

neural network

machine learning

piecewise linear

Room: Pascal. Zoom pw: 416891

Opponent: Associate Professor, Raazesh Sainudiin, Uppsala University

Online disputation

Författare

Anton Johansson

Chalmers, Matematiska vetenskaper, Tillämpad matematik och statistik

Forskning Andra publikationer

Does the dataset meet your expectations? Explaining sample representation in image data

Belgian/Netherlands Artificial Intelligence Conference,;(2020)p. 194-208

Paper i proceeding

Slope and generalization properties of neural networks

SilGAN: Generating driving maneuvers for scenario-based software-in-The-loop testing

Proceedings - 3rd IEEE International Conference on Artificial Intelligence Testing, AITest 2021,;(2021)p. 65-72

Paper i proceeding

Improved Spectral Norm Regularization for Neural Networks

Ämneskategorier (SSIF 2011)

Datorteknik

Annan matematik

Datavetenskap (datalogi)

Utgivare

Chalmers