A domain adaptive deep learning solution for scanpath prediction of paintings

IRIS

Cultural heritage understanding and preservation is an important issue for society as it represents a fundamental aspect of its identity. Paintings represent a significant part of cultural heritage, and are the subject of study continuously. However, the way viewers perceive paintings is strictly related to the so-called HVS (Human Vision System) behaviour. This paper focuses on the eye-movement analysis of viewers during the visual experience of a certain number of paintings. In further details, we introduce a new approach to predicting human visual attention, which impacts several cognitive functions for humans, including the fundamental understanding of a scene, and then extend it to painting images. The proposed new architecture ingests images and returns scanpaths, a sequence of points featuring a high likelihood of catching viewers’ attention. We use an FCNN (Fully Convolutional Neural Network), in which we exploit a differentiable channel-wise selection and Soft-Argmax modules. We also incorporate learnable Gaussian distributions onto the network bottleneck to simulate visual attention process bias in natural scene images. Furthermore, to reduce the effect of shifts between different domains (i.e. natural images, painting), we urge the model to learn unsupervised general features from other domains using a gradient reversal classifier. The results obtained by our model outperform existing state-of-the-art ones in terms of accuracy and efficiency.

A domain adaptive deep learning solution for scanpath prediction of paintings, 2022.

A domain adaptive deep learning solution for scanpath prediction of paintings

Kerkouri, Mohamed Amine;Tliba, Marouane;Chetouani, Aladine;Bruno, Alessandro

2022-01-01

Abstract

Cultural heritage understanding and preservation is an important issue for society as it represents a fundamental aspect of its identity. Paintings represent a significant part of cultural heritage, and are the subject of study continuously. However, the way viewers perceive paintings is strictly related to the so-called HVS (Human Vision System) behaviour. This paper focuses on the eye-movement analysis of viewers during the visual experience of a certain number of paintings. In further details, we introduce a new approach to predicting human visual attention, which impacts several cognitive functions for humans, including the fundamental understanding of a scene, and then extend it to painting images. The proposed new architecture ingests images and returns scanpaths, a sequence of points featuring a high likelihood of catching viewers’ attention. We use an FCNN (Fully Convolutional Neural Network), in which we exploit a differentiable channel-wise selection and Soft-Argmax modules. We also incorporate learnable Gaussian distributions onto the network bottleneck to simulate visual attention process bias in natural scene images. Furthermore, to reduce the effect of shifts between different domains (i.e. natural images, painting), we urge the model to learn unsupervised general features from other domains using a gradient reversal classifier. The results obtained by our model outperform existing state-of-the-art ones in terms of accuracy and efficiency.

Scheda breve

Scheda completa

Scheda completa (DC)

	Lingua/e
	
				Inglese
			
	Data di pubblicazione degli atti o dell'intervento
	
				2022
			
	DOI
	
				https://dx.doi.org/10.1145/3549555.3549597
			
	URL
	
				https://dl.acm.org/doi/abs/10.1145/3549555.3549597
			
	Nome del convegno
	
				19th International Conference on Content-based Multimedia Indexing
			
	Rilevanza del convegno
	
				internazionale
			
	Relazione
	
				contributo
			
	Titolo degli Atti
	
				Proceedings of the 19th International Conference on Content-Based Multimedia Indexing
			
	Pagina iniziale del contributo
	
				57
			
	Pagina finale del contributo
	
				63
			
	ISBN degli Atti
	
				9781450397209
			
	Paese di pubblicazione
	
				United States
			
	Editore
	
				ACM Association for Computing Machinery
			
	Referee
	
				esperti anonimi
			
	Formato
	
				Online
			
	Settori scientifico-disciplinari (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Numero autori
	
				4
			
	Appare nelle tipologie:
	
				4.01 Contributo in atti di convegno (pubblicato)

File in questo prodotto:

File	Dimensione	Formato
2209.11338.pdf Accessibile solo dagli utenti con account Apeiron Tipologia: Documento in Pre-print Dimensione 1.34 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.34 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10808/50886

Citazioni

ND

4

3

social impact