Outlier detection has gained more relevance throughout the years, and, as of now, its elds of application range from medicine and engineering to - nance. As for the latter, outliers can be the consequence of human error or fraudolent activities; similarly, nancial crises can be viewed as anomalies since markets experience atypical behaviors in those periods. Because of this widespread practical relevance, many authors tackled this topic. Hence, the theory behind anomaly detection has unsurprisingly evolved, from the rst studies which dealt with more simple instances, i.e., univariate Gaussian data, to more complex cases, such as multivariate data following nonparametric distributions. Especially when dealing with high dimensional multivariate data, many techniques aim at nding outliers in univariate projections of such data, in order to reduce the computational e ort. For this reason, a number of studies have been devoted to determining the directions onto which the data must be projected, so as to exploit as much information as possible from the distribution. Therefore, for instance, [4] project the data onto the directions that maximize and minimize the kurtosis coe cient of the projection, while [3] chooses the direction that maximizes the fourth cumulant of the projection. Following this stream of literature, our work aims at detecting outliers, represented by nancial crises, by projecting the data onto the direction that maximizes the cumulant generating function (CGF). In our paper, we re ne some theoretical results of [1] and [2]. More precisely, we prove that CGF is a convex function, and, then, we characterize the CGF maximization problem on the unit n-circle as a concave minimization problem. Then, we extend the PCA technique with the CGF maximization procedure for the outlier detection. Finally, we perform an extensive empirical analysis both on simulated and on historical data, and we compare our method with the aforementioned ones, along with a machine learning approach.

Cesarone, F., Giacometti, R., Ricci, J.M. (2023). Non‐parametric cumulants approach for outlier detection of multivariate financial data. In XXVI Workshop on Quantitative Finance.

Non‐parametric cumulants approach for outlier detection of multivariate financial data

Francesco Cesarone;Rosella Giacometti;Jacopo Maria Ricci
2023-01-01

Abstract

Outlier detection has gained more relevance throughout the years, and, as of now, its elds of application range from medicine and engineering to - nance. As for the latter, outliers can be the consequence of human error or fraudolent activities; similarly, nancial crises can be viewed as anomalies since markets experience atypical behaviors in those periods. Because of this widespread practical relevance, many authors tackled this topic. Hence, the theory behind anomaly detection has unsurprisingly evolved, from the rst studies which dealt with more simple instances, i.e., univariate Gaussian data, to more complex cases, such as multivariate data following nonparametric distributions. Especially when dealing with high dimensional multivariate data, many techniques aim at nding outliers in univariate projections of such data, in order to reduce the computational e ort. For this reason, a number of studies have been devoted to determining the directions onto which the data must be projected, so as to exploit as much information as possible from the distribution. Therefore, for instance, [4] project the data onto the directions that maximize and minimize the kurtosis coe cient of the projection, while [3] chooses the direction that maximizes the fourth cumulant of the projection. Following this stream of literature, our work aims at detecting outliers, represented by nancial crises, by projecting the data onto the direction that maximizes the cumulant generating function (CGF). In our paper, we re ne some theoretical results of [1] and [2]. More precisely, we prove that CGF is a convex function, and, then, we characterize the CGF maximization problem on the unit n-circle as a concave minimization problem. Then, we extend the PCA technique with the CGF maximization procedure for the outlier detection. Finally, we perform an extensive empirical analysis both on simulated and on historical data, and we compare our method with the aforementioned ones, along with a machine learning approach.
2023
Cesarone, F., Giacometti, R., Ricci, J.M. (2023). Non‐parametric cumulants approach for outlier detection of multivariate financial data. In XXVI Workshop on Quantitative Finance.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11590/459610
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact