The use of large language models (LLMs) is increasingly explored in the field of automatic text scoring, particularly in tasks such as automatic essay scoring (AES) and abstract screening for systematic reviews. While prior research has focused on evaluating the accuracy of these models, their robustness and potential biases remain underexplored. To address this gap, we investigated robustness to novel author information and authorship bias of four LLMs in scientific abstracts scoring on 10 evaluation criteria. We conducted three controlled experiments on abstracts from five arXiv categories, comparing baseline scores against conditions where author information was introduced as a perturbation. These perturbations included: associating abstracts with fake authoritative CVs, associating them with fake non-authoritative CVs (both generated by another LLM), and associating abstracts with famous, well-known authors. The results of our controlled analyses illustrate that LLMs lack robustness and exhibit systematic authorship bias when author context is provided. These findings highlight the need for further research to ensure fairness and transparency in automated scoring systems.

Sajeva, A., Merialdo, P. (2026). Robustness and authorship bias of large language models in scientific abstracts scoring. DISCOVER ARTIFICIAL INTELLIGENCE, 6(1) [10.1007/s44163-026-01295-z].

Robustness and authorship bias of large language models in scientific abstracts scoring

Sajeva, Alessandro
;
Merialdo, Paolo
2026-01-01

Abstract

The use of large language models (LLMs) is increasingly explored in the field of automatic text scoring, particularly in tasks such as automatic essay scoring (AES) and abstract screening for systematic reviews. While prior research has focused on evaluating the accuracy of these models, their robustness and potential biases remain underexplored. To address this gap, we investigated robustness to novel author information and authorship bias of four LLMs in scientific abstracts scoring on 10 evaluation criteria. We conducted three controlled experiments on abstracts from five arXiv categories, comparing baseline scores against conditions where author information was introduced as a perturbation. These perturbations included: associating abstracts with fake authoritative CVs, associating them with fake non-authoritative CVs (both generated by another LLM), and associating abstracts with famous, well-known authors. The results of our controlled analyses illustrate that LLMs lack robustness and exhibit systematic authorship bias when author context is provided. These findings highlight the need for further research to ensure fairness and transparency in automated scoring systems.
2026
Sajeva, A., Merialdo, P. (2026). Robustness and authorship bias of large language models in scientific abstracts scoring. DISCOVER ARTIFICIAL INTELLIGENCE, 6(1) [10.1007/s44163-026-01295-z].
File in questo prodotto:
File Dimensione Formato  
s44163-026-01295-z.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 1.78 MB
Formato Adobe PDF
1.78 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/549376
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact