TS-GNPR Clustering Random Walk Time Series - Frank Nielsen, Gautier Marti, Philippe Donnat, Philippe Very
Author(s): Frank Nielsen, Gautier Marti, Philippe Donnat, Philippe Very
DOI URL: http://dx.doi.org/10.1007/978-3-319-25040-3_72
Creative Commons Attribution-ShareAlike 4.0 International
We present in this paper a novel non-parametric approach useful for clustering independent identically distributed stochastic processes. We introduce a pre-processing step consisting in mapping multivariate independent and identically distributed samples from random variables to a generic non-parametric representation which factorizes dependency and marginal distribution apart without losing any information. An associated metric is defined where the balance between random variables dependency and distribution information is controlled by a single parameter. This mixing parameter can be learned or played with by a practitioner, such use is illustrated on the case of clustering financial time series. Experiments, implementation and results obtained on public financial time series are online on a web portal http://www.datagrapple.com .