user-centric visual analytics

72
VALT Chess VA Intro Apps Wrap-up 1/29 User-Centric Visual Analytics Remco Chang Tufts University

Upload: beatrice-cantu

Post on 01-Jan-2016

31 views

Category:

Documents


0 download

DESCRIPTION

User-Centric Visual Analytics. Remco Chang Tufts University. Human + Computer. Human vs. Artificial Intelligence Garry Kasparov vs. Deep Blue (1997) Computer takes a “brute force” approach without analysis - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 1/29

User-Centric Visual Analytics

Remco ChangTufts University

Page 2: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 2/29

Human + Computer

• Human vs. Artificial IntelligenceGarry Kasparov vs. Deep Blue (1997)– Computer takes a “brute force” approach

without analysis– “As for how many moves ahead a grandmaster

sees,” Kasparov concludes: “Just one, the best one”

• Artificial vs. Augmented IntelligenceHydra vs. Cyborgs (2005)– Grandmaster + 1 chess program > Hydra

(equiv. of Deep Blue)– Amateur + 3 chess programs > Grandmaster +

1 chess program1

1. http://www.collisiondetection.net/mt/archives/2010/02/why_cyborgs_are.php

Page 3: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 3/29

Visual Analytics = Human + Computer

• Visual analytics is "the science of analytical reasoning facilitated by visual interactive interfaces.“ 1

• By definition, it is a collaboration between human and computer to solve problems.

1. Thomas and Cook, “Illuminating the Path”, 2005.

Page 4: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 4/29

Example: What Does (Wire) Fraud Look Like?• Financial Institutions like Bank of America have legal responsibilities to

report all suspicious wire transaction activities (money laundering, supporting terrorist activities, etc)

• Data size: approximately 200,000 transactions per day (73 million transactions per year)

• Problems:– Automated approach can only detect known patterns– Bad guys are smart: patterns are constantly changing– Data is messy: lack of international standards resulting in ambiguous data

• Current methods:– 10 analysts monitoring and analyzing all transactions– Using SQL queries and spreadsheet-like interfaces– Limited time scale (2 weeks)

Page 5: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 5/29

WireVis: Financial Fraud Analysis

• In collaboration with Bank of America– Develop a visual analytical tool (WireVis)– Visualizes 7 million transactions over 1 year– Beta-deployed at WireWatch

• A new class of computer science problem:– Little or no data to train on– The data is messy and requires human intelligence

• Design philosophy: “combating human intelligence requires better (augmented) human intelligence”

R. Chang et al., Scalable and interactive visual analysis of financial wire transactions for fraud detection. Information Visualization,2008.R. Chang et al., Wirevis: Visualization of categorical, time-varying data from financial transactions. IEEE VAST, 2007.

Page 6: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 6/29

WireVis: A Visual Analytics Approach

Heatmap View(Accounts to Keywords Relationship)

Strings and Beads(Relationships over Time)

Search by Example (Find Similar Accounts)

Keyword Network(Keyword Relationships)

Page 7: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 7/29

Applications of Visual Analytics

• Political Simulation– Agent-based analysis– With DARPA

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparisonR. Chang et al., Two Visualization Tools for Analysis of Agent-Based Simulations in Political Science. IEEE CG&A, 2012

Page 8: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 8/29

Applications of Visual AnalyticsWhere

When

Who

What

Original Data

EvidenceBox

R. Chang et al., Investigative Visual Analysis of Global Terrorism, Journal of Computer Graphics Forum, 2008.

• Political Simulation– Agent-based analysis– With DARPA

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 9: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 9/29

Applications of Visual Analytics

R. Chang et al., An Interactive Visual Analytics System for Bridge Management, Journal of Computer Graphics Forum, 2010. To Appear.

• Political Simulation– Agent-based analysis– With DARPA

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 10: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 10/29

Applications of Visual Analytics

R. Chang et al., Interactive Coordinated Multiple-View Visualization of Biomechanical Motion Data , IEEE Vis (TVCG) 2009.

• Political Simulation– Agent-based analysis– With DARPA

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 11: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 11/29

Interdisciplinary Research

• Applied research with individual domains– Psychology / Cognitive and Brain

Sciences– Biology and Health Care– Geospatial Information – Political Science– Transportation– etc.– Nearly every discipline that requires

human judgment and decision-making based on large amounts of data

Page 12: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 12/29

Research at the VALT

• Visual Analytics problems from a User-Centric perspective:

1. One optimal visualization for every user?

2. Can a user’s reasoning process be recorded and stored?

3. Can a user express their domain knowledge quantitatively?

4. Can analysis between multiple people be aggregated?

Page 13: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 13/29

1. Analysis of Visualization Designs:Is there an optimal visualization?

Page 14: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 14/29

What’s the Best Visualization for You?

Jürgensmann and Schulz, “Poster: A Visual Survey of Tree Visualization”. InfoVis, 2010.

Page 15: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 15/29

Results

• Personality Factor: Locus of Control– (internal => faster/better with containment)– (external => faster/better with list)

Page 16: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 16/29

2. Study of Expert Users’ Interactions:Does Interaction Logs Contain Knowledge?

Page 17: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 17/29

What is in a User’s Interactions?

• Goal: determine if a user’s reasoning and intent are reflected in a user’s interactions.

Analysts

GradStudents(Coders)

Logged(semantic) Interactions

Compare!(manually)

StrategiesMethodsFindings

Guesses ofAnalysts’ thinking

WireVis Interaction-Log Vis

Page 18: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 18/29

What’s in a User’s Interactions

• From this experiment, we find that interactions contains at least:– 60% of the (high level) strategies– 60% of the (mid level) methods– 79% of the (low level) findings

R. Chang et al., Recovering Reasoning Process From User Interactions. CG&A, 2009.R. Chang et al., Evaluating the Relationship Between User Interaction and Financial Visual Analysis. VAST, 2009.

Page 19: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 19/29

3. Quantifying Domain Knowledge:Can Knowledge be Represented Quantitatively?

Page 20: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 20/29

Direct Manipulation of Visualization

Linear distance function:

Optimization:

Page 21: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 21/29

Results

• Tells the domain expert what dimension of data they care about, and what dimensions are not useful!

Blue: original data dimensionRed: randomly added dimensionsX-axis: dimension numberY-axis: final weights of the distance function

• Using the “Wine” dataset (13 dimensions, 3 clusters)– Assume a linear (sum of squares) distance function

• Added 10 extra dimensions, and filled them with random values

Page 22: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 22/29

4. Examining Collaborative Analysis:Can Individual Analysis be Aggregated?

Page 23: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 23/29

For Example:

• 2 analysts, A and B, each performed an analysis on the same data

A0 A1 A2 A3 A4 A5

B0 B1 B2 B3 B4

Page 24: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 24/29

For Example:

• If A2 is the same as B1 (in that they represent the same analysis step)…

A0 A1

A2

A3 A4 A5

B0

B1

B2 B3 B4

Page 25: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 25/29

For Example:

• We will merge the two nodes

A0 A1

A2B1

A3 A4 A5

B0 B2 B3 B4

Page 26: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 26/29

Example Results:

• This process is repeated for all analysis trails across all analysts, and we could get a temporal graph that look like:

Page 27: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 27/29

Summary

Page 28: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 28/29

Summary

• While Visual Analytics have grown and is slowly finding its identity,

• There is still many open problems that need to be addressed.

• I propose that one research area that has largely been unexplored is in the understanding and supporting of the human user.

Page 29: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 29/29

Page 30: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 30/29

Backup Slides…

Page 31: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 31/29

1. How Personality Influences Compatibility with Visualization Style

Page 32: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 32/29

What’s the Best Visualization for You?

Jürgensmann and Schulz, “Poster: A Visual Survey of Tree Visualization”. InfoVis, 2010.

Page 33: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 33/29

What’s the Best Visualization for You?

• Intuitively, not everyone is created equal.– Our background, experience, and

personality should affect how we perceive and understand information.

• So why should our visualizations be the same for all users?

Page 34: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 34/29

Cognitive Profile

• Objective: to create personalized information visualizations based on individual differences

• Hypothesis: cognitive factors affect a person’s ability (speed and accuracy) in using different visualizations.

Page 35: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 35/29

Experiment Procedure

• 250 participants using Amazon’s Mechanical Turk

• Questionnaire on “locus of control” (LOC)

• 4 visualizations on hierarchical visualization– From list-like view to containment view

Page 36: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 36/29

Results

• Internal LOC users are significantly faster and more accurate with list view (V1) than containment view (V2) in complex information retrieval (inferential) tasks

Page 37: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 37/29

Conclusion

• Cognitive factors can affect how a user perceives and understands information from a visualization

• The effect could be significant in terms of both efficiency and accuracy

• Personalized displays should take into account a user’s cognitive profile

R. Chang et al., How Locus of Control Influences Compatibility with Visualization Style , IEEE VAST 2011.

Page 38: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 38/29

2. Manipulating a User’s Ability

Page 39: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 39/29

What We Know About LOC and Visualization:

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Page 40: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 40/29

We Also Know:

• Based on Psychology research, we know that locus of control can be temporarily affected through priming

• For example, to reduce locus of control (to make someone have a more external LOC)

“We know that one of the things that influence how well you can do everyday tasks is the number of obstacles you face on a daily basis. If you are having a particularly bad day today, you may not do as well as you might on a day when everything goes as planned. Variability is a normal part of life and you might think you can’t do much about that aspect. In the space provided below, give 3 examples of times when you have felt out of control and unable to achieve something you set out to do. Each example must be at least 100 words long.”

Page 41: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 41/29

Research Question

• Known Facts:1. There is a relationship between LOC and use of

visualization2. LOC can be primed

• Research Question:– If we can affect the user’s LOC, will that affect

their use of visualization?

Page 42: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 42/29

LOC and Visualization

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Condition 1:Make Internal LOC more like External LOC

Page 43: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 43/29

LOC and Visualization

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Condition 2:Make External LOC more like Internal LOC

Page 44: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 44/29

LOC and Visualization

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Condition 3:Make 50% of the Average LOC more like Internal LOC

Condition 4:Make 50% of the Average LOC more like External LOC

Page 45: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 45/29

Result

• Yes, users behaviors can be altered by priming their LOC! However, this is only true for:– Speed (not accuracy)– Only for complex tasks (inferential tasks)

Page 46: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 46/29

Effects of Priming (Condition 2)

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

External -> Internal

Page 47: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 47/29

Effects of Priming (Condition 3)

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Average -> External

Page 48: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 48/29

Effects of Priming (Condition 4)

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Average ->Internal

Page 49: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 49/29

Effects of Priming (Condition 1)

Visual Form

List-View (V1) Containment (V4)

Performance

Poor

Good

Internal LOC

External LOC

Average LOC

Internal->External

Page 50: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 50/29

Conclusion

• Cognitive factors can affect how a user perceives and understands information from a visualization in efficiency and accuracy.

• This relationship appears to be a directly correlation: by priming a user’s locus of control, we an alter their behavior in a controlled manner.

• Future work: determine if the interaction patterns are different between the groups. We care about interaction patterns because they infer user reasoning…

R. Chang et al., Locus of Control and Visualization Layout, IEEE TVCG 2012. In submission

Page 51: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 51/29

3. What’s In a User’s Interactions?

Page 52: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 52/29

What is in a User’s Interactions?

• Types of Human-Visualization Interactions– Word editing (input heavy, little output)– Browsing, watching a movie (output heavy, little input)– Visual Analysis (closer to 50-50)

• Challenge: • Can we capture and extract a user’s reasoning and intent through

capturing a user’s interactions?

Visualization HumanOutput

Input

Keyboard, Mouse, etc

Images (monitor)

Page 53: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 53/29

What is in a User’s Interactions?

• Goal: determine if a user’s reasoning and intent are reflected in a user’s interactions.

Analysts

GradStudents(Coders)

Logged(semantic) Interactions

Compare!(manually)

StrategiesMethodsFindings

Guesses ofAnalysts’ thinking

WireVis Interaction-Log Vis

Page 54: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 54/29

What’s in a User’s Interactions

• From this experiment, we find that interactions contains at least:– 60% of the (high level) strategies– 60% of the (mid level) methods– 79% of the (low level) findings

R. Chang et al., Recovering Reasoning Process From User Interactions. CG&A, 2009.R. Chang et al., Evaluating the Relationship Between User Interaction and Financial Visual Analysis. VAST, 2009.

Page 55: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 55/29

What’s in a User’s Interactions

• Why are these so much lower than others?– (recovering “methods” at

about 15%)

• Only capturing a user’s interaction in this case is insufficient.

Page 56: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 56/29

Conclusion

• A high percentage of a user’s reasoning and intent are reflected in a user’s interactions.

• Raises lots of question: (a) what is the upper-bound, (b) how to automated the process, (c) how to utilize the captured results, etc.

• This study is not exhaustive. It merely provides a sample point of what is possible.

R. Chang et al., Analytic Provenance Panel at IEEE VisWeek. 2011

Page 57: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 57/29

4. Is Domain Knowledge Quantifiable?

Page 58: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 58/29

Find Distance Function, Hide Model Inference

• Problem Statement: Given a high dimensional dataset from a domain expert, how does the domain expert create a good distance function?

• Assumption: The domain expert knows about the data, but cannot express it mathematically

Page 59: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 59/29

Working with Domain Experts• Observation: a visualization

expert doesn’t know how to visualize their own data (what is the appropriate way to visualize it)

• However, when they see a visualization, they can tell what’s WRONG with the data (and why)

• So we start by making a “guess visualization” (that is, we guess a distance function and produce a visualization)

Page 60: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 60/29

Direct Manipulation of Visualization

• Our approach allows the expert to directly move the elements of the visualization to what they think is “right”.

Page 61: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 61/29

Direct Manipulation of Visualization

• The process is repeated a few times until the expert is happy (or the visualization can not be improved further)

• The system outputs a new distance function!

Page 62: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 62/29

Our Approach• Given:

1. A weighted distance function (linear, quadratic, etc.)2. What it means to move a point from one location to another

(is it moving closer to a cluster? Or away from some other points?)

• We iteratively solve for the best weights to the distance functionLinear distance function:

Optimization:

Page 63: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 63/29

System Overview

Page 64: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 64/29

Results• Used the “Wine” dataset (13 dimensions, 3 clusters)

– Assume a linear (sum of squares) distance function

• Added 10 extra dimensions, and filled them with random values

• Interactively moved the “bad” points

Blue: original data dimensionRed: randomly added dimensionsX-axis: dimension numberY-axis: final weights of the distance function

Page 65: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 65/29

Conclusion

• With an appropriate projection model, it is possible to quantify a user’s interactions.

• In our system, we let the domain expert interact with a familiar representation of the data (scatter plot), and hides the ugly math (distance function)

• The system “reveals” the domain knowledge of the user.

R. Chang et al., Find Distance Function, Hide Model Inference. IEEE VAST Poster 2011

Page 66: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 66/29

Human + Computer:Dimension Reduction – Lost in Translation• Dimension reduction using principle component analysis (PCA)

• Quick Refresher of PCA– Find most dominant eigenvectors as principle components– Data points are re-projected into the new coordinate system

• For reducing dimensionality• For finding clusters

• For many (especially novices), PCA is easy to understand mathematically, but difficult to understand “semantically”.

age

heig

ht

GPA0.5*GPA + 0.2*age + 0.3*height = ?

Page 67: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 67/29

Human + Computer:Exploring Dimension Reduction: iPCA

R. Chang et al., iPCA: An Interactive System for PCA-based Visual Analytics. Computer Graphics Forum (Eurovis), 2009.

Page 68: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 68/29

4. How to Aggregate Multiple AnalysisTo Perform Group Analytics

Page 69: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 69/29

Scaling Human Computation

• Problem Statement: Computing can be scaled (by adding more CPUs). Visualizations can be scaled (by adding more monitors). Can analysis be scaled by adding more humans?

• Assumption: Conventional wisdom says that humans cannot be scaled because of difficulty in communicating analytical reasoning efficiently.

Page 70: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 70/29

Temporal Graph

• Research Proposal: We propose a Temporal Graph approach to model analytical trails. In a temporal graph,

– Node = a unique state in the visual analysis trail.

– Edge = a (temporal) transition from one state to another.

Page 71: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 71/29

With a Temporal Graph…

• We can answer many questions. For example:

– Given a particular outcome (a yellow states), is there a state that is the catalyst in which every subsequent analysis trail start from?• the answer is yes:• The red states are “points of

no return”• The green states are the

“last decision points”

Page 72: User-Centric Visual Analytics

VALTChess VA Intro Apps Wrap-up 72/29

Conclusion

• There are many benefits to posing analysis trails as a temporal graph problem.

• Mostly, the benefit comes from our ability to apply known graph algorithms.

• Incidentally, this temporal graph formulation can be applied to visualize and analyze other problems involving large state space.

• Poster to be presented at VAST 2011