Skip to Main Content (Press Enter)

Logo UNIMORE
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze

UNI-FIND
Logo UNIMORE

|

UNI-FIND

unimore.it
  • ×
  • Home
  • Corsi
  • Insegnamenti
  • Professioni
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Attività
  • Competenze
  1. Pubblicazioni

On the Steplength Selection in Stochastic Gradient Methods

Contributo in Atti di convegno
Data di Pubblicazione:
2020
Citazione:
On the Steplength Selection in Stochastic Gradient Methods / Franchini, G.; Ruggiero, V.; Zanni, L.. - 11973:(2020), pp. 186-197. ( 3rd Triennial International Conference and Summer School on Numerical Computations: Theory and Algorithms, NUMTA 2019 ita 2019) [10.1007/978-3-030-39081-5_17].
Abstract:
This paper deals with the steplength selection in stochastic gradient methods for large scale optimization problems arising in machine learning. We introduce an adaptive steplength selection derived by tailoring a limited memory steplength rule, recently developed in the deterministic context, to the stochastic gradient approach. The proposed steplength rule provides values within an interval, whose bounds need to be prefixed by the user. A suitable choice of the interval bounds allows to perform similarly to the standard stochastic gradient method equipped with the best-tuned steplength. Since the setting of the bounds slightly affects the performance, the new rule makes the tuning of the parameters less expensive with respect to the choice of the optimal prefixed steplength in the standard stochastic gradient method. We evaluate the behaviour of the proposed steplength selection in training binary classifiers on well known data sets and by using different loss functions.
Tipologia CRIS:
Relazione in Atti di Convegno
Keywords:
Machine learning; Ritz-like values; Steplength selection rule; Stochastic gradient methods
Elenco autori:
Franchini, G.; Ruggiero, V.; Zanni, L.
Autori di Ateneo:
FRANCHINI Giorgia
ZANNI Luca
Link alla scheda completa:
https://iris.unimore.it/handle/11380/1199144
Titolo del libro:
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pubblicato in:
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Journal
LECTURE NOTES IN ARTIFICIAL INTELLIGENCE
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.4.5.0