Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A comparative user study of visualization techniques for cluster analysis of multidimensional data sets
School of Informatics, University of Skövde, Skövde, Sweden.
Jönköping University, School of Engineering, JTH, Department of Computer Science and Informatics. School of Informatics, University of Skövde, Skövde, Sweden.ORCID iD: 0000-0003-2900-9335
2020 (English)In: Information Visualization, ISSN 1473-8716, E-ISSN 1473-8724, Vol. 19, no 4, p. 318-338Article in journal (Refereed) Published
Abstract [en]

This article presents an empirical user study that compares eight multidimensional projection techniques for supporting the estimation of the number of clusters, 𝑘, embedded in six multidimensional data sets. The selection of the techniques was based on their intended design, or use, for visually encoding data structures, that is, neighborhood relations between data points or groups of data points in a data set. Concretely, we study: the difference between the estimates of 𝑘 as given by participants when using different multidimensional projections; the accuracy of user estimations with respect to the number of labels in the data sets; the perceived usability of each multidimensional projection; whether user estimates disagree with 𝑘 values given by a set of cluster quality measures; and whether there is a difference between experienced and novice users in terms of estimates and perceived usability. The results show that: dendrograms (from Ward’s hierarchical clustering) are likely to lead to estimates of 𝑘 that are different from those given with other multidimensional projections, while Star Coordinates and Radial Visualizations are likely to lead to similar estimates; t-Stochastic Neighbor Embedding is likely to lead to estimates which are closer to the number of labels in a data set; cluster quality measures are likely to produce estimates which are different from those given by users using Ward and t-Stochastic Neighbor Embedding; U-Matrices and reachability plots will likely have a low perceived usability; and there is no statistically significant difference between the answers of experienced and novice users. Moreover, as data dimensionality increases, cluster quality measures are likely to produce estimates which are different from those perceived by users using any of the assessed multidimensional projections. It is also apparent that the inherent complexity of a data set, as well as the capability of each visual technique to disclose such complexity, has an influence on the perceived usability. 

Place, publisher, year, edition, pages
Sage Publications, 2020. Vol. 19, no 4, p. 318-338
Keywords [en]
Cluster patterns, data structure, multidimensional data, user study, visualization, Embeddings, Hierarchical clustering, Stochastic systems, Usability engineering, Data dimensionality, Multi-dimensional datasets, Neighborhood relation, Projection techniques, Radial visualization, Statistically significant difference, Stochastic neighbor embedding, Visualization technique, Data visualization
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:hj:diva-50071DOI: 10.1177/1473871620922166ISI: 000545375800001Scopus ID: 2-s2.0-85087437127Local ID: HOA JTH 2020OAI: oai:DiVA.org:hj-50071DiVA, id: diva2:1454214
Available from: 2020-07-15 Created: 2020-07-15 Last updated: 2024-07-16Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Riveiro, Maria

Search in DiVA

By author/editor
Riveiro, Maria
By organisation
JTH, Department of Computer Science and Informatics
In the same journal
Information Visualization
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 125 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf