Benvenuti nell'Anagrafe della Ricerca d'Ateneo

Deep learning models are now used in multiple contexts, including safety critical applications. However, it has been proven that small adversarial alterations to the input can undermine the performances of the model, leading to unreliable results, while being hardly visible to a human observer. Image watermarking share similarities with this field: a small information is embedded inside the media, aiming at being not perceivable but robust. Many attacks have been developed to remove watermarks. In this paper, we evaluate the effectiveness of multiple image transformations to remove adversarial perturbations from images. Our experiments on the MNIST dataset for a Projected Gradient Descent-based adversary demonstrate that many transformations can result in a significant gain in accuracy when classifying adversarial examples, while not degrading the quality of the images when the adversary is not present/non significant.

Colangelo, F., Neri, A., Battisti, F. (2019). Countering Adversarial Examples by Means of Steganographic Attacks. In Proceedings - European Workshop on Visual Information Processing, EUVIP (pp.193-198). Institute of Electrical and Electronics Engineers Inc. [10.1109/EUVIP47703.2019.8946254].

Countering Adversarial Examples by Means of Steganographic Attacks

Colangelo F.;Neri A.;Battisti F.

2019-01-01

Abstract

Deep learning models are now used in multiple contexts, including safety critical applications. However, it has been proven that small adversarial alterations to the input can undermine the performances of the model, leading to unreliable results, while being hardly visible to a human observer. Image watermarking share similarities with this field: a small information is embedded inside the media, aiming at being not perceivable but robust. Many attacks have been developed to remove watermarks. In this paper, we evaluate the effectiveness of multiple image transformations to remove adversarial perturbations from images. Our experiments on the MNIST dataset for a Projected Gradient Descent-based adversary demonstrate that many transformations can result in a significant gain in accuracy when classifying adversarial examples, while not degrading the quality of the images when the adversary is not present/non significant.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Codice ISBN
	
				978-1-7281-4496-2
			
	Citazione
	
				Colangelo, F., Neri, A., Battisti, F. (2019). Countering Adversarial Examples by Means of Steganographic Attacks. In Proceedings - European Workshop on Visual Information Processing, EUVIP (pp.193-198). Institute of Electrical and Electronics Engineers Inc. [10.1109/EUVIP47703.2019.8946254].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/364030

Citazioni

ND

1

ND

social impact