Project In Codice Ratio is developing tools to extract the knowledge contained in the ancient manuscripts of the Vatican Archives. The scarcity of datasets suitable for our setting has led us to rely on crowdsourcing in all phases of our project. In this paper we discuss our approaches for leveraging inexpensive non-expert workers to fruitfully perform labelling operations on challenging manuscripts. We describe the range of different tasks we are devising, as well as the corresponding priority and redundancy policies we are employing. We describe the datasets collected thus far and the corresponding results.
Firmani, D., Merialdo, P., Nieddu, E., Rossi, A., Torlone, R. (2020). Crowdsourcing for Building Knowledge Graphs at Scale from the Vatican Archives. In CEUR Workshop Proceedings (pp.242-249). CEUR-WS.
Crowdsourcing for Building Knowledge Graphs at Scale from the Vatican Archives
Firmani D.;Merialdo P.;Nieddu E.;Rossi A.;Torlone R.
2020-01-01
Abstract
Project In Codice Ratio is developing tools to extract the knowledge contained in the ancient manuscripts of the Vatican Archives. The scarcity of datasets suitable for our setting has led us to rely on crowdsourcing in all phases of our project. In this paper we discuss our approaches for leveraging inexpensive non-expert workers to fruitfully perform labelling operations on challenging manuscripts. We describe the range of different tasks we are devising, as well as the corresponding priority and redundancy policies we are employing. We describe the datasets collected thus far and the corresponding results.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.