An increasing number of web sites offer structured information about recognizable concepts, relevant to many application domains, such as finance, sport, commercial products. However, web data is inherently imprecise and uncertain, and conflicting values can be provided by different web sources. Characterizing the uncertainty of web data represents an important issue and several models have been recently proposed in the literature. The paper illustrates state-of-the-art Bayesan models to evaluate the quality of data extracted from the Web and reports the results of an extensive application of the models on real life web data. Our experimental results show that for some applications even simple approaches can provide effective results, while sophisticated solutions are needed to obtain a more precise characterization of the uncertainty. © 2011 ACM.

Blanco, L., Crescenzi, V., Merialdo, P., Papotti, P. (2011). Characterizing the uncertainty of web data: Models and experiences. In ACM International Conference Proceeding Series (pp.1-8) [10.1145/1964114.1964116].

Characterizing the uncertainty of web data: Models and experiences

BLANCO, LORENZO;CRESCENZI, VALTER;MERIALDO, PAOLO;PAPOTTI, PAOLO
2011-01-01

Abstract

An increasing number of web sites offer structured information about recognizable concepts, relevant to many application domains, such as finance, sport, commercial products. However, web data is inherently imprecise and uncertain, and conflicting values can be provided by different web sources. Characterizing the uncertainty of web data represents an important issue and several models have been recently proposed in the literature. The paper illustrates state-of-the-art Bayesan models to evaluate the quality of data extracted from the Web and reports the results of an extensive application of the models on real life web data. Our experimental results show that for some applications even simple approaches can provide effective results, while sophisticated solutions are needed to obtain a more precise characterization of the uncertainty. © 2011 ACM.
9781450307062
9781450307062
Blanco, L., Crescenzi, V., Merialdo, P., Papotti, P. (2011). Characterizing the uncertainty of web data: Models and experiences. In ACM International Conference Proceeding Series (pp.1-8) [10.1145/1964114.1964116].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/310867
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact