visual analytics - introduction › courses › visualisierung1 › 2015... · 2015-12-13 · data...

46
Visual Analytics - Introduction Eduard Gröller Institute of Computer Graphics and Algorithms Vienna University of Technology

Upload: others

Post on 23-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Visual Analytics -Introduction

Eduard Gröller

Institute of Computer Graphics and Algorithms

Vienna University of Technology

Page 2: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Goals of VA

Creation of tools and techniques to enable people to:

Synthesize information and derive insight from massive, dynamic, ambiguous, and often conflicting data

Detect the expected and discover the unexpected

Provide timely, defensible, and understandable assessments

Communicate these assessment effectively for action2

[VisMaster, 2010]

Page 3: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

What is Visual Analytics?

“Visual Analytics is the science of analytical reasoning supported by a highly interactive visual interface.” [Wong and Thomas 2004]

“Visual Analytics combines automated analysistechniques with interactive visualisations for an effective understanding, reasoning and decision making on the basis of very large and complexdatasets” [Keim 2010]

Detect the expected and discover the unexpected

Page 4: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Visual Analytics Process

First step: preprocess and transform dataData cleaning, normalization, grouping, data fusion

Automated methods+ Scale well‐ Get stuck in local optima‐ Run in a black box fashion

Visualization+ Interactive data analysis‐ Scalability

Visual Analytics integrates bothTied together by the userAlternating between visual and automatic methods

[Keim 2006]

Page 5: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Interdisciplinary!

Page 6: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Challenges

DataDealing with very large, diverse, variable quality datasets

Users Meeting the needs of the users

DesignAssisting designers of visual analytic systems

TechnologyProviding the necessary infrastructure

6

Page 7: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Data Mining Definition

Automatic algorithmic extraction of valuable information from raw data

Page 8: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Knowledge Discovery and Data Mining (KDD)

Semi or fully automated analysis of massive data sets

Contributions are more about general methodologies

Black‐box methods in the hands of end usersUsers need to understand the algorithms for using themWhat attributes to use? What similarity measure? etc.Often trial and error

Page 9: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

The Ability Matrix

adapted from Daniel Keim, Uni. Konstanz

9

Page 10: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Why Graphics?

Figures are richer; provide more information with less clutter and in less space.

Figures provide the 'gestalt‘ effect: they give an overview; make structure more visible.

Figures are more accessible, easier to understand, faster to grasp, more comprehensible, more memorable, more fun, and less formal.

list adapted from: [Stasko et al. 1998]

10

Page 11: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Statistics vs. Visualization:Anscombe’s Quartet

11

Page 12: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Statistics vs. Visualization:Anscombe’s Quartet

Statistics profile is the same for all!

12

Page 13: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Anscombe’s Quartet

Four datasets that have identical simple statistical properties, yet appear very different when graphed.

Wikimedia Commons

13

Page 14: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Visualization Can Be Biased

14

The same data plotted with different scales is perceived dramatically differently.

(a) Equally (uniformly) large scale in both x and y

(b) Large scale in x

(c) Large scale in y (d) Scale determined by range of x‐ and y‐values.

[Ward, Grinstein, Keim 2011]

Page 15: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Diagram vs. Visualization

A diagram represents information.A visualization represents data.

15

Page 16: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Mantras

Guide to visually explore data  ‐ Shneiderman‘s Mantra:Overview first, zoom/filter, details on demand

Describes how data should be presented on screen

For massive datasets it is difficult to create overview without loosing interesting patterns

Extended Mantra for VA:Analyse first, show the important, zoom/filter, analysefurther, details on demand

16

[Shneiderman, 1996]

[Keim, 2006]

Page 17: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Traditional Data Mining vs. Visual Analysis Processes

Page 18: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

KDD Pipeline

18

[Fayyad 1996]

Visualization Pipeline

[dos Santos and Brodlie 2004]

Page 19: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Uncertainty

What is not surrounded by uncertainty cannot be the truth [Richard Feynman]

True genius resides in the capacity for evaluation of uncertain, hazardous, and conflicting information [Winston Churchill]

Doubt is not a pleasant condition, but certainty is absurd [Voltaire]

Eduard Gröller 19

Page 20: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Definition“Degree to which the lack of knowledge about the amount of error is responsible for hesitancy in accepting results and observations with caution”  [Hunter 1993]

Measurement datae.g., DNA microarray expression data

Can be handled in data or view space

Uncertainty

[Holzhüter 2010]

Page 21: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Data Management Challenges

“ Big Data“UncertaintySemantics ManagementData StreamingDistributed and Collaborative VAVA for the Masses

21

Page 22: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

What is “ Big Data“?

Moving target

Fields dealing with this kind of data:

MeteorologyGenomicsConnectomicsComplex physics simulationsBiological and environmental researchBusiness intelligence

22

http://en.wikipedia.org/wiki/Template:Quantities_of_bits

Page 23: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Visual Steering to Support Decision Making in Visdom

Jürgen Waserhttp://www.cg.tuwien.ac.at/research/publications/2011/waser_2011_VSD/

http://www.visdom.at/

Page 24: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Jürgen Waser Visual Steering to Support Decision Making in 24

Flood emergency assistance  

New Orleans 2005: 17th canal levee breach

Image courtesy of USACE, US Army Corps of Engineers

Page 25: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Jürgen Waser Visual Steering to Support Decision Making in 25

Flood emergency assistance  

Testing sandbag configurations in a virtual environment 

Page 26: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Jürgen Waser Visual Steering to Support Decision Making in 26

Solution: World Lines

Page 27: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Jürgen Waser Visual Steering to Support Decision Making in 27

Solution: World Lines

Page 28: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Jürgen Waser Visual Steering to Support Decision Making in 28

Video

Page 29: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Worldlines – Multiple Linked Views

Eduard Gröller 29

Page 30: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Worldlines – Multiple Linked Views

Eduard Gröller 30

Page 31: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

http://www.VRVis.at/

SimVis: Interactive Visual Analysis of Large & Complex

Simulation Data

Dr. Helmut DoleischVRVis Research Center

Page 32: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Helmut Doleischhttp://www.simvis.at/

SimVis: Interactive Visual Analysis of Large & Complex Simulation Data

Motivation large data sets from simulation goal: support exploration and

analysis of results analyze n-dim. data interactively use 3D visualization overview, zoom and filter, detail

on demand (Shneidermans’ information seeking mantra)

challenge: occlusion interactive data handling

Page 33: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Helmut Doleischhttp://www.simvis.at/

SimVis: Interactive Visual Analysis of Large & Complex Simulation Data

Interactive Data Handling sample data set size: 540 million data items currently working to expand to billions

Page 34: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Helmut Doleischhttp://www.simvis.at/

SimVis: Interactive Visual Analysis of Large & Complex Simulation Data

SimVis VRVis´ solution for these challenges Feature-based visualization framework

SimVis key features: Multiple, linked views Interactive feature specification Focus+Context visualization Smooth feature boundaries Explicit feature representation On-the-fly attribute derivation

Page 35: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Helmut Doleischhttp://www.simvis.at/

SimVis: Interactive Visual Analysis of Large & Complex Simulation Data

SimVis: Multiple Views Scatterplots, histogram, 3D(4D) view, etc.

an attribute

cell

coun

t

in 2D,also in 3D

an attribute

another attribute

3D+time+color

+opactiy

Page 36: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Helmut Doleischhttp://www.simvis.at/

36 SimVis: Interactive Visual Analysis of Large & Complex Simulation Data

Move/alter/extend brush interactively

Update linked F+Cviews in real-time

Brushing

–pressure–»

–TK

E–»

–vel.–»

color: temp.

Page 37: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

VAICo: Visual Analysis forImage ComparisonJohanna Schmidt1, M. Eduard Gröller1, 

Stefan Bruckner2

1Vienna University of Technology, Austria2University of Bergen, Norway

Page 38: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

VAICo – Example Video

38

Page 39: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

YMCA - Your Mesh Comparison Application

39[Johanna Schmidt et al. ]

Page 40: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Literature on Visual Analytics

Daniel A. Keim, Jörn Kohlhammer, Geoffrey Ellis and Florian Mansmann: Mastering the Information Age ‐ Solving Problems with Visual Analytics, Eurographics Association, 2010. ISBN: 978‐3905673777. Free download: http://www.vismaster.eu/book/

James J. Thomas and Kristin A. Cook : Illuminating the Path: The Research and Development Agenda for Visual Analytics, National Visualization and Analytics Ctr, 2005.ISBN: 978‐0769523231Free download: http://vis.pnnl.gov/ 40

Page 41: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Literature on Visualization

Heidrun Schumann, Wolfgang Müller:Visualisierung ‐ Grundlagenund allgemeine Methoden, Springer Verlag, 2000. ISBN: 3540649441

Alexandru Telea: Data Visualizaton – Principles and Practice,AK Peters Verlag, 2008.ISBN: 9781568813066

Matthew Ward, George Grinstein, Daniel Keim: Interactive Data Visualization: Foundations, Techniques, and Applications, AK Peters Verlag, 2010.ISBN: 1568814739

41

Page 42: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Literature on Scientific Visualization

Eduard Gröller 42

Page 43: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Literature on Information Visualization

Colin Ware: Information Visualization, Second Edition: Perception for Design, Morgan Kaufmann, 2nd edition,  2004. ISBN: 1558608192

Robert Spence: Information Visualization ‐ Design for Interaction,Pearson Verlag, 2001.ISBN13: 9780132065504

43

Wolfgang Aigner, Silvia Miksch, Heidrun Schumann, Christian Tominski: Visualization of Time‐Oriented Data,Springer Verlag, 2011.ISBN13: 978‐0857290786

Page 44: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Acknowledgements

For material for this lecture unitMarc Streit, Johannes Kepler University Linz

Eduard Gröller, Helwig Hauser 44

Page 45: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Praktika, Bachelorarbeiten, Diplomarbeiten

http://www.cg.tuwien.ac.at/courses/projekte/

Eduard Gröller 45

Page 46: Visual Analytics - Introduction › courses › Visualisierung1 › 2015... · 2015-12-13 · Data cleaning, normalization, grouping, data fusion Automated methods + Scale well ‐Get

Christmas Tree Case Study

Eduard Gröller 46