INFOTOPO  Topological Information Data Analysis. Deep statistical unsupervised and supervised learning.

INFOTOPO  Topological Information Data Analysis. Deep statistical unsupervised and supervised learning.
The INFOTOPO library is a generic open source suite of Python Programs (compatible with Python 3.4.x, on Linux, windows, or mac) for Information Topological Data Analysis. It is available on Github depository. The library offers stateoftheart statistical high dimensional data structures analysis and algorithms to detect covarying patterns and clusters, multiscale data analysis.
New release (easy to use: scikit and sklearn compatible and format, and with pip install august 2020):
INFOTOPO version 0.1You can find the software on github.
InfoTopo is a Machine Learning method based on Information Cohomology, a cohomology of statistical systems [0,1,8,9]. It allows to estimate higher order statistical structures, dependences and (refined) independences or generalised (possibly nonlinear) correlations and to uncover their structure as simplicial complex. It provides estimations of the basic information functions, entropy, joint and condtional, multivariate MutualInformations (MI) and conditional MI, Total Correlations…
InfoTopo is at the crossroad of Topological Data Analysis, Deep Neural Network learning, statistical physics and complex systems:
 With respect to Topological Data Analysis (TDA), it provides intrinsically probabilistic methods that does not assume metric (Random Variable’s alphabets are not necessarilly ordinal) [2,3,6]. It also provide a quantification of higher order statistical interactions that cannot be detected by pairwise relations or methods based on VietorisRips complexes.
 With respect to Deep Neural Networks (DNN), it provides a simplical complex constrained DNN structure with topologically derived unsupervised and supervised learning rules (forward propagation, differential statistical operators), the PoincaréShannon machine. The neurons are random Variables, the depth of the layers corresponds to the dimensions of the complex [3,4,5].
 With respect to statistical physics, it provides generalized correlation functions, free and internal energy functions, estimations of the nbody interactions contributions to energy functional, that holds in nonhomogeous and finitediscrete case, without meanfield assumptions. Cohomological Complex implements the minimum freeenergy principle. Information Topology is rooted in cognitive sciences and computational neurosciences, and generalizesunifies some consciousness theories [5].
 With respect to complex systems studies, it generalizes complex networks and Probabilistic graphical models to higher degreedimension interactions [2,3].
It assumes basically:
 a classical probability space (here a discrete finite sample space), geometrically formalized as a probability simplex with basic conditionning and Bayes rule and implementing
 a complex (here simplicial) of random variable with a joint operators
 a quite generic coboundary operator (Hochschild, Homological algebra with a (left) action of conditional expectation)
The details for the underlying mathematics and methods can be found in the papers:
[0] Manin, Y., Marcolli, M., Homotopy Theoretic and Categorical Models of Neural Information Networks, 2020, arXiv:2006.15136, PDF0
[1] Vigneaux J., Topology of Statistical Systems. A Cohomological Approach to Information Theory. Ph.D. Thesis, Paris 7 Diderot University, Paris, France, June 2019. PDF1
[2] Baudot P., Tapia M., Bennequin, D. , Goaillard J.M., Topological Information Data Analysis. 2019, Entropy, 21(9), 869 PDF2
[3] Baudot P., The PoincaréShannon Machine: Statistical Physics and Machine Learning aspects of Information Cohomology. 2019, Entropy , 21(9), PDF3
[4] Baudot P. , Bernardi M., The PoincaréBoltzmann Machine: passing the information between disciplines, ENAC Toulouse France. 2019 PDF4
[5] Baudot P. , Bernardi M., Information Cohomology methods for learning the statistical structures of data. DS3 Data Science, Ecole Polytechnique 2019 PDF5
[6] Tapia M., Baudot P., Dufour M., FormizanoTreziny C., Temporal S., Lasserre M., Kobayashi K., Goaillard J.M.. Neurotransmitter identity and electrophysiological phenotype are genetically coupled in midbrain dopaminergic neurons. Scientific Reports. 2018. PDF6
[7] Baudot P., Elements of qualitative cognition: an Information Topology Perspective. Physics of Life Reviews. 2019. extended version on Arxiv. PDF7
[8] Baudot P., Bennequin D., The homological nature of entropy. Entropy, 2015, 17, 166; doi:10.3390. PDF8
[9] Baudot P., Bennequin D., Topological forms of information. AIP conf. Proc., 2015. 1641, 213. PDF9
The previous version of the software INFOTOPO : the 20132017 scripts are available at Github infotopo
The INFOTOPO library is developed as part of the Channelomics project supported by the European Research Council, developped at UNIS Inserm 1072, and thanks previously to supports and hostings since 2007 of Max Planck Institute for Mathematic in the Sciences (MPIMIS) and Complex System Instititute ParisIledeFrance (ISCPIF) and Institut de Mathématiques de Jussieu  Paris Rive Gauche (IMJPRG)