SOMFlow: Guided Exploratory Cluster Analysis with Self-Organizing Maps and Analytic Provenance

D. Sacha, Matthias Kraus, J. Bernard, M. Behrisch, T. Schreck, Y. Asano, D. Keim

Research output: Contribution to journalArticlepeer-review

Abstract

Clustering is a core building block for data analysis, aiming to extract otherwise hidden structures and relations from raw datasets, such as particular groups that can be effectively related, compared, and interpreted. A plethora of visual-interactive cluster analysis techniques has been proposed to date, however, arriving at useful clusterings often requires several rounds of user interactions to fine-tune the data preprocessing and algorithms. We present a multi-stage Visual Analytics (VA) approach for iterative cluster refinement together with an implementation (SOMFlow) that uses Self-Organizing Maps (SOM) to analyze time series data. It supports exploration by offering the analyst a visual platform to analyze intermediate results, adapt the underlying computations, iteratively partition the data, and to reflect previous analytical activities. The history of previous decisions is explicitly visualized within a flow graph, allowing to compare earlier cluster refinements and to explore relations. We further leverage quality and interestingness measures to guide the analyst in the discovery of useful patterns, relations, and data partitions. We conducted two pair analytics experiments together with a subject matter expert in speech intonation research to demonstrate that the approach is effective for interactive data analysis, supporting enhanced understanding of clustering results as well as the interactive process itself.
Original languageEnglish
Pages (from-to)120-130
JournalIEEE Transactions on Visualization and Computer Graphics
Volume24
Issue number1
DOIs
Publication statusPublished - 2018

Keywords

  • Data analysis
  • Data visualization
  • Density measurement
  • Feature extraction
  • Layout
  • Symmetric matrices
  • Visualization
  • Feature Detection/Selection
  • Matrix Visualization
  • Quality Metrics
  • Relational Data
  • Visual Quality Measures

Fields of Expertise

  • Information, Communication & Computing

Fingerprint

Dive into the research topics of 'SOMFlow: Guided Exploratory Cluster Analysis with Self-Organizing Maps and Analytic Provenance'. Together they form a unique fingerprint.

Cite this