Please use this identifier to cite or link to this item:
Title: Robust principal graphs for data approximation
Authors: Gorban, Alexander N.
Mirkes, Evgeny M.
Zinovyev, Andrei
First Published: 2017
Presented at: ECDA2015 (European Conference on Data Analysis), University of Essex, Colchester, UK
Start Date: 2-Sep-2015
End Date: 4-Sep-2015
Publisher: Institut für Informationswirtschaft und Marketing (IISM)
Citation: Archives of Data Science, Series A, 2017, 2(1) 16 S. online
Abstract: Revealing hidden geometry and topology in noisy data sets is a challenging task. Elastic principal graph is a computationally efficient and flexible data approximator based on embedding a graph into the data space and minimizing the energy functional penalizing the deviation of graph nodes both from data points and from pluri-harmonic configuration (generalization of linearity). The structure of principal graph is learned from data by application of a topological grammar which in the simplest case leads to the construction of principal curves or trees. In order to more efficiently cope with noise and outliers, here we suggest using a trimmed data approximation term to increase the robustness of the method. The modification of the method that we suggest does not affect either computational efficiency or general convergence properties of the original elastic graph method. The trimmed elastic energy functional remains a Lyapunov function for the optimization algorithm. On several examples of complex data distributions we demonstrate how the robust principal graphs learn the global data structure and show the advantage of using the trimmed data approximation term for the construction of principal graphs and other popular data approximators.
DOI Link: 10.5445/KSP/1000058749/11
ISSN: 2363-9881
Version: Publisher Version
Status: Peer-reviewed
Type: Journal Article
Rights: Copyright © the authors, 2017. This is an open-access article distributed under the terms of the Creative Commons Attribution-ShareAlike License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited, and that any contributions are distributed under the same license as the original.
Description: The Parameters of the methods used are provided together with the code from the corresponding GitHub
Appears in Collections:Conference Papers & Presentations, Dept. of Mathematics

Files in This Item:
File Description SizeFormat 
adsa_2-1_article11.pdfPublished (publisher PDF)2.97 MBAdobe PDFView/Open

Items in LRA are protected by copyright, with all rights reserved, unless otherwise indicated.