This paper proposes using social robots to enhance children's experiences in museums. Specifically, we aim to equip these social robots with multimodal large language models (MLLMs) to generate questions that engage children interactively. To achieve this, we evaluate the capabilities of LLaVA models in generating diverse and relevant questions about artworks, comparing their performance on visual questions with contextual questions. We utilize a subset of the AQUA dataset to assess both quantitative metrics and qualitative aspects of the generated questions. Additionally, we examine the models' ability to create engaging questions tailored specifically for children. We emphasize how MLLMs can generate questions that may increase enjoyment during visits, promote active observation, and enhance children's cognitive and emotional engagement with artworks. This approach aims to contribute to more inclusive and effective learning experiences in museum settings.

Ferrato, A., Gena, C., Limongelli, C., Sansonetti, G. (2025). Multimodal LLM Question Generation for Children's Art Engagement via Museum Social Robots. In UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization (pp.144-150). NEW YORK, NY : Association for Computing Machinery, Inc [10.1145/3708319.3733663].

Multimodal LLM Question Generation for Children's Art Engagement via Museum Social Robots

Ferrato, Alessio;Gena, Cristina;Limongelli, Carla;Sansonetti, Giuseppe
2025-01-01

Abstract

This paper proposes using social robots to enhance children's experiences in museums. Specifically, we aim to equip these social robots with multimodal large language models (MLLMs) to generate questions that engage children interactively. To achieve this, we evaluate the capabilities of LLaVA models in generating diverse and relevant questions about artworks, comparing their performance on visual questions with contextual questions. We utilize a subset of the AQUA dataset to assess both quantitative metrics and qualitative aspects of the generated questions. Additionally, we examine the models' ability to create engaging questions tailored specifically for children. We emphasize how MLLMs can generate questions that may increase enjoyment during visits, promote active observation, and enhance children's cognitive and emotional engagement with artworks. This approach aims to contribute to more inclusive and effective learning experiences in museum settings.
2025
Ferrato, A., Gena, C., Limongelli, C., Sansonetti, G. (2025). Multimodal LLM Question Generation for Children's Art Engagement via Museum Social Robots. In UMAP 2025 - Adjunct Proceedings of the 33rd ACM Conference on User Modeling, Adaptation and Personalization (pp.144-150). NEW YORK, NY : Association for Computing Machinery, Inc [10.1145/3708319.3733663].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/521176
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact