On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity

Vincent Szolnoky; Viktor Andersson; Balázs Adam Kulcsár; Rebecka Jörnsten

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity
Paper in proceeding, 2022

Most complex machine learning and modelling techniques are prone to overfitting and may subsequently generalise poorly to future data. Artificial neural networks are no different in this regard and, despite having a level of implicit regularisation when trained with gradient descent, often require the aid of explicit regularisers. We introduce a new framework, Model Gradient Similarity (MGS), that (1) serves as a metric of regularisation, which can be used to monitor neural network training, (2) adds insight into how explicit regularisers, while derived from widely different principles, operate via the same mechanism underneath by increasing MGS, and (3) provides the basis for a new regularisation scheme which exhibits excellent performance, especially in challenging settings such as high levels of label noise or limited sample sizes.

Author

Vincent Szolnoky

Chalmers, Mathematical Sciences, Applied Mathematics and Statistics

Other publications Research

Viktor Andersson

Chalmers, Electrical Engineering, Systems and control

Other publications Research

Balázs Adam Kulcsár

Chalmers, Electrical Engineering, Systems and control

Other publications Research

Rebecka Jörnsten

Chalmers, Mathematical Sciences, Applied Mathematics and Statistics

Other publications Research

Advances in Neural Information Processing Systems

10495258 (ISSN)

Vol. 35
9781713871088 (ISBN)

36th Conference on Neural Information Processing Systems, NeurIPS 2022
New Orleans, USA,

Subject Categories (SSIF 2011)

Communication Systems

Bioinformatics (Computational Biology)

Computer Systems

More information

Latest update

1/24/2025

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity Paper in proceeding, 2022

Author

Vincent Szolnoky

Viktor Andersson

Balázs Adam Kulcsár

Rebecka Jörnsten

Advances in Neural Information Processing Systems

Subject Categories (SSIF 2011)

More information

Latest update

On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity
Paper in proceeding, 2022