X
Mnesys - Pubblications

 

January 19, 2024

Predictive power of a Bayesian effective action for fully-connected one hidden layer neural networks in the proportional limit

Progetto: Architecture and dynamics of the social brain in the monkey

We perform accurate numerical experiments with fully-connected (FC) one-hidden layer neural networks trained with a discretized Langevin dynamics on the MNIST and CIFAR10 datasets. Our goal is to empirically determine the regimes of validity of a recently-derived Bayesian effective action for shallow architectures in the proportional limit. We explore the predictive power of the theory as a function of the parameters (the temperature , the magnitude of the Gaussian priors , , the size of the hidden layer and the size of the training set ) by comparing the experimental and predicted generalization error. The very good agreement between the effective theory and the experiments represents an indication that global rescaling of the infinite-width kernel is a main physical mechanism for kernel renormalization in FC Bayesian standard-scaled shallow networks.

Authors

Raffaella Burioni

Raffaella Burioni

Pietro Rotondo

Pietro Rotondo

Other Authors

Baglioni P., R. Pacelli, R. Aiudi, F. di Renzo, , A. Vezzani