Visual Growing Neural Gas for Exploratory Data Analysis
2019 (English)In: Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, SciTePress, 2019, Vol. 3, p. 58-71Conference paper, Published paper (Refereed)
Abstract [en]
This paper argues for the use of a topology learning algorithm, the Growing Neural Gas (GNG), for providing an overview of the structure of large and multidimensional datasets that can be used in exploratory data analysis. We introduce a generic, off-the-shelf library, Visual GNG, developed using the Big Data framework Apache Spark, which provides an incremental visualization of the GNG training process, and enables user-in-the-loop interactions where users can pause, resume or steer the computation by changing optimization parameters. Nine case studies were conducted with domain experts from different areas, each working on unique real-world datasets. The results show that Visual GNG contributes to understanding the distribution of multidimensional data; finding which features are relevant in such distribution; estimating the number of k clusters to be used in traditional clustering algorithms, such as K-means; and finding outliers.
Place, publisher, year, edition, pages
SciTePress, 2019. Vol. 3, p. 58-71
Keywords [en]
Growing Neural Gas, Dimensionality Reduction, Multidimensional Data, Visual Analytics, Exploratory Data Analysis
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:hj:diva-43350DOI: 10.5220/0007364000580071Local ID: 0;0;miljJAILISBN: 978-989-758-354-4 (electronic)OAI: oai:DiVA.org:hj-43350DiVA, id: diva2:1297153
Conference
14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, 25 - 27 February, 2019, Prague, Czech Republic
2019-03-192019-03-192021-03-15Bibliographically approved