Bayesian networks are multivariate statistical models using a di- rected acyclic graph to represent statistical dependencies among vari- ables. When dealing with Bayesian Networks it is common to assume that all the variables are discrete. This is not often the case in many real contexts where also continuous variables are observed. A common solution consists in discretizing the continuous variables. In this paper we propose a discretization algorithm based on the Kullback-Leibler divergence measure. Formally, we deal with the problem of discretiz- ing a continuous variable Y conditionally on its parents. We show that such a problem is polynomially solvable. A simulation study is finally performed.
Marella, D., Mezzini, M., Vicard, P. (2015). Improving Discretization Exploiting Dependence Structure. In WORKING PAPERS.
Improving Discretization Exploiting Dependence Structure
MARELLA, Daniela;MEZZINI, MAURO;VICARD, Paola
2015-01-01
Abstract
Bayesian networks are multivariate statistical models using a di- rected acyclic graph to represent statistical dependencies among vari- ables. When dealing with Bayesian Networks it is common to assume that all the variables are discrete. This is not often the case in many real contexts where also continuous variables are observed. A common solution consists in discretizing the continuous variables. In this paper we propose a discretization algorithm based on the Kullback-Leibler divergence measure. Formally, we deal with the problem of discretiz- ing a continuous variable Y conditionally on its parents. We show that such a problem is polynomially solvable. A simulation study is finally performed.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.