Neural networks are now day routinely employed in the classification of sets of objects, which consists in predicting the class label of an object. The softmax function is a popular choice of the output function in neural networks. It is a probability distribution of the class labels and the label with maximum probability represents the prediction of the neural network, given the object being classified. The softmax function is also used to compute the loss function, which evaluates the error made by the network in the classification task. In this paper we consider a simple modification to the loss function, called label smoothing. We experimented this modification by training a neural network using 12 data sets, all containing a total of about 1x10^6 images. We show that this modification allow a neural network to achieve a better accuracy in the classification task.

Mezzini, M. (2018). Empirical study on label smoothing in neural networks. In 26. International Conference in Central Europe on Computer Graphics, Visualiz ation and Computer Vision WSCG 2018 Plzen, Czech Republic May 28 – June 1, 2018 (pp.200-205).

Empirical study on label smoothing in neural networks

mauro mezzini
2018

Abstract

Neural networks are now day routinely employed in the classification of sets of objects, which consists in predicting the class label of an object. The softmax function is a popular choice of the output function in neural networks. It is a probability distribution of the class labels and the label with maximum probability represents the prediction of the neural network, given the object being classified. The softmax function is also used to compute the loss function, which evaluates the error made by the network in the classification task. In this paper we consider a simple modification to the loss function, called label smoothing. We experimented this modification by training a neural network using 12 data sets, all containing a total of about 1x10^6 images. We show that this modification allow a neural network to achieve a better accuracy in the classification task.
978-80-86943-41-1
Mezzini, M. (2018). Empirical study on label smoothing in neural networks. In 26. International Conference in Central Europe on Computer Graphics, Visualiz ation and Computer Vision WSCG 2018 Plzen, Czech Republic May 28 – June 1, 2018 (pp.200-205).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/338499
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact