The growing availability of data over the last decades has given rise to a number of successful technologies, ranging from data collection and storage infrastructures to hardware and software tools for efficient computation of analytics. This context, in principle, places a great demand on data quality. As a matter of fact, experience has shown that the open Web and other platforms hosting user-generated content or real-time data can provide little quality control at content production time. To address these challenges, our aim is to provide a general and configurable model for assessing data quality supporting task composition. In particular, we introduce a model characterized along the notion of matching, illustrating the issues that can be addressed by this approach with a concrete case study. We also identify and discuss challenges to be addressed in future research to strengthen this idea.
Ceravolo, P., Bellini, E. (2019). Towards Configurable Composite Data Quality Assessment. In Proceedings - 21st IEEE Conference on Business Informatics, CBI 2019 (pp.249-257). 345 E 47TH ST, NEW YORK, NY 10017 USA : IEEE [10.1109/cbi.2019.00035].
Towards Configurable Composite Data Quality Assessment
Bellini, Emanuele
2019-01-01
Abstract
The growing availability of data over the last decades has given rise to a number of successful technologies, ranging from data collection and storage infrastructures to hardware and software tools for efficient computation of analytics. This context, in principle, places a great demand on data quality. As a matter of fact, experience has shown that the open Web and other platforms hosting user-generated content or real-time data can provide little quality control at content production time. To address these challenges, our aim is to provide a general and configurable model for assessing data quality supporting task composition. In particular, we introduce a model characterized along the notion of matching, illustrating the issues that can be addressed by this approach with a concrete case study. We also identify and discuss challenges to be addressed in future research to strengthen this idea.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


