We present OpenTriage, a system for extracting structured entities from detail Web pages of several sites and finding linkages between the extracted data. The system builds an integrated knowledge base by leveraging the redundancy of information with an Open Information Extraction approach: it incrementally processes all the available pages while discovering new attributes. It is based on a hybrid human-machine learning technique that targets a desired quality level. After two preliminary tasks, i.e., blocking and extraction, OpenTriage interleaves two integration tasks, i.e., linkage, and matching, while managing the uncertainty by means of very simple questions that are posed to an external oracle.

Voyat, R., Crescenzi, V., Merialdo, P. (2022). OpenTRIAGE: Entity Linkage for DetailWebpages. In CEUR Workshop Proceedings (pp.1-12). CEUR-WS.

OpenTRIAGE: Entity Linkage for DetailWebpages

Voyat R.;Crescenzi V.;Merialdo P.
2022

Abstract

We present OpenTriage, a system for extracting structured entities from detail Web pages of several sites and finding linkages between the extracted data. The system builds an integrated knowledge base by leveraging the redundancy of information with an Open Information Extraction approach: it incrementally processes all the available pages while discovering new attributes. It is based on a hybrid human-machine learning technique that targets a desired quality level. After two preliminary tasks, i.e., blocking and extraction, OpenTriage interleaves two integration tasks, i.e., linkage, and matching, while managing the uncertainty by means of very simple questions that are posed to an external oracle.
Voyat, R., Crescenzi, V., Merialdo, P. (2022). OpenTRIAGE: Entity Linkage for DetailWebpages. In CEUR Workshop Proceedings (pp.1-12). CEUR-WS.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/418222
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact