Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

On the Thermodynamic Interpretation of Deep Learning Systems

Contributo in Atti di convegno
Data di Pubblicazione:
2021
Citazione:
On the Thermodynamic Interpretation of Deep Learning Systems / Fioresi, R.; Faglioni, F.; Morri, F.; Squadrani, L.. - 12829:(2021), pp. 909-917. ( 5th International Conference on Geometric Science of Information, GSI 2021 fra 2021) [10.1007/978-3-030-80209-7_97].
Abstract:
In the study of time evolution of the parameters in Deep Learning systems, subject to optimization via SGD (stochastic gradient descent), temperature, entropy and other thermodynamic notions are commonly employed to exploit the Boltzmann formalism. We show that, in simulations on popular databases (CIFAR10, MNIST), such simplified models appear inadequate: different regions in the parameter space exhibit significantly different temperatures and no elementary function expresses the temperature in terms of learning rate and batch size, as commonly assumed. This suggests a more conceptual approach involving contact dynamics and Lie Group Thermodynamics.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Deep Learning; Lie groups machine learning; Statistical mechanics
Elenco autori:
Fioresi, R.; Faglioni, F.; Morri, F.; Squadrani, L.
Autori di Ateneo:
FAGLIONI Francesco
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1254442
Titolo del libro:
GEOMETRIC SCIENCE OF INFORMATION (GSI 2021)
Pubblicato in:
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Journal
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.4.5.0