Benvenuti nell'Anagrafe della Ricerca d'Ateneo

The use of large language models (LLMs) is increasingly explored in the field of automatic text scoring, particularly in tasks such as automatic essay scoring (AES) and abstract screening for systematic reviews. While prior research has focused on evaluating the accuracy of these models, their robustness and potential biases remain underexplored. To address this gap, we investigated robustness to novel author information and authorship bias of four LLMs in scientific abstracts scoring on 10 evaluation criteria. We conducted three controlled experiments on abstracts from five arXiv categories, comparing baseline scores against conditions where author information was introduced as a perturbation. These perturbations included: associating abstracts with fake authoritative CVs, associating them with fake non-authoritative CVs (both generated by another LLM), and associating abstracts with famous, well-known authors. The results of our controlled analyses illustrate that LLMs lack robustness and exhibit systematic authorship bias when author context is provided. These findings highlight the need for further research to ensure fairness and transparency in automated scoring systems.

Sajeva, A., Merialdo, P. (2026). Robustness and authorship bias of large language models in scientific abstracts scoring. DISCOVER ARTIFICIAL INTELLIGENCE, 6(1) [10.1007/s44163-026-01295-z].

Robustness and authorship bias of large language models in scientific abstracts scoring

Sajeva, Alessandro;Merialdo, Paolo

2026-01-01

Abstract

The use of large language models (LLMs) is increasingly explored in the field of automatic text scoring, particularly in tasks such as automatic essay scoring (AES) and abstract screening for systematic reviews. While prior research has focused on evaluating the accuracy of these models, their robustness and potential biases remain underexplored. To address this gap, we investigated robustness to novel author information and authorship bias of four LLMs in scientific abstracts scoring on 10 evaluation criteria. We conducted three controlled experiments on abstracts from five arXiv categories, comparing baseline scores against conditions where author information was introduced as a perturbation. These perturbations included: associating abstracts with fake authoritative CVs, associating them with fake non-authoritative CVs (both generated by another LLM), and associating abstracts with famous, well-known authors. The results of our controlled analyses illustrate that LLMs lack robustness and exhibit systematic authorship bias when author context is provided. These findings highlight the need for further research to ensure fairness and transparency in automated scoring systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2026
			
	Citazione
	
				Sajeva, A., Merialdo, P. (2026). Robustness and authorship bias of large language models in scientific abstracts scoring. DISCOVER ARTIFICIAL INTELLIGENCE, 6(1) [10.1007/s44163-026-01295-z].
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
s44163-026-01295-z.pdf accesso aperto Tipologia: Versione Editoriale (PDF) Licenza: Copyright dell'editore Dimensione 1.78 MB Formato Adobe PDF Visualizza/Apri	1.78 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/549376

Citazioni

ND

ND

ND

social impact