provenanceintroapplicationpersonalitydist funcwrap-up 1/36 user-centric visual analytics remco chang...

45
Provenan ce Intro Applicat ion Personal ity Dist Func Wrap-up 36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

Upload: ernest-roberts

Post on 17-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up1/36

User-Centric Visual Analytics

Remco Chang

Tufts UniversityDepartment of Computer Science

Page 2: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up2/36

Human + Computer

• Human vs. Artificial IntelligenceGarry Kasparov vs. Deep Blue (1997)– Computer takes a “brute force” approach

without analysis– “As for how many moves ahead a grandmaster

sees,” Kasparov concludes: “Just one, the best one”

• Artificial vs. Augmented IntelligenceHydra vs. Cyborgs (2005)– Grandmaster + 1 chess program > Hydra

(equiv. of Deep Blue)– Amateur + 3 chess programs > Grandmaster +

1 chess program1

1. http://www.collisiondetection.net/mt/archives/2010/02/why_cyborgs_are.php

Page 3: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up3/36

Visual Analytics = Human + Computer

• Visual analytics is "the science of analytical reasoning facilitated by visual interactive interfaces.“ 1

• By definition, it is a collaboration between human and computer to solve problems.

1. Thomas and Cook, “Illuminating the Path”, 2005.

Page 4: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up4/36

Applications of Visual Analytics

• Wire Fraud Detection– With Bank of America

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparisonR. Chang et al., Scalable and interactive visual analysis of financial wire transactions for fraud detection. Information Visualization,2008.

Page 5: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up5/36

Applications of Visual AnalyticsWhere

When

Who

What

Original Data

EvidenceBox

R. Chang et al., Investigative Visual Analysis of Global Terrorism, Journal of Computer Graphics Forum, 2008.

• Wire Fraud Detection– With Bank of America

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 6: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up6/36

Applications of Visual Analytics

R. Chang et al., An Interactive Visual Analytics System for Bridge Management, Journal of Computer Graphics Forum, 2010. To Appear.

• Wire Fraud Detection– With Bank of America

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 7: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up7/36

Applications of Visual Analytics

R. Chang et al., Interactive Coordinated Multiple-View Visualization of Biomechanical Motion Data , IEEE Vis (TVCG) 2009.

• Wire Fraud Detection– With Bank of America

• Global Terrorism Database– With DHS

• Bridge Maintenance – With US DOT– Exploring inspection

reports

• Biomechanical Motion– Interactive motion

comparison

Page 8: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up8/36

Human + Computer:Dimension Reduction – Lost in Translation• Dimension reduction using principle component analysis (PCA)

• Quick Refresher of PCA– Find most dominant eigenvectors as principle components– Data points are re-projected into the new coordinate system

• For reducing dimensionality• For finding clusters

• For many (especially novices), PCA is easy to understand mathematically, but difficult to understand “semantically”.

age

heig

ht

GPA0.5*GPA + 0.2*age + 0.3*height = ?

Page 9: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up9/36

Human + Computer:Exploring Dimension Reduction: iPCA

R. Chang et al., iPCA: An Interactive System for PCA-based Visual Analytics. Computer Graphics Forum (Eurovis), 2009.

Page 10: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up10/36

Talk Outline

• Discuss 4 Visual Analytics problems from a User-Centric perspective:

1. What is a “good” visualization?

2. Why is interaction good? What is in a user’s interaction?

3. Can a user express knowledge through interactions?

4. Can we scale human computation with more analysts?

Page 11: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up11/36

1. What is a “good” visualization?

How Personality Influences Compatibility with Visualization Style

Page 12: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up12/36

What’s the Best Visualization for You?

Jürgensmann and Schulz, “Poster: A Visual Survey of Tree Visualization”. InfoVis, 2010.

Page 13: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up13/36

What’s the Best Visualization for You?

• Intuitively, not everyone is created equal.– Our background, experience, and

personality should affect how we perceive and understand information.

• So why should our visualizations be the same for all users?

Page 14: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up14/36

Cognitive Profile

• Objective: to create personalized information visualizations based on individual differences

• Hypothesis: cognitive factors affect a person’s ability (speed and accuracy) in using different visualizations.

Page 15: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up15/36

Experiment Procedure

• 250 participants using Amazon’s Mechanical Turk

• Questionnaire on “locus of control” (LOC)

• 4 visualizations on hierarchical visualization– From list-like view to containment view

Page 16: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up16/36

Results

• Internal LOC users are significantly faster and more accurate with list view than containment view in complex information retrieval tasks

Page 17: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up17/36

Conclusion

• Cognitive factors can affect how a user perceives and understands information from a visualization

• The effect could be significant in terms of both efficiency and accuracy

• Personalized displays should take into account a user’s cognitive profile

• Paper presented at VAST 2011 (honorable best-paper award)

Page 18: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up18/36

2. Why is interaction good?

What’s In a User’s Interactions?

Page 19: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up19/36

Human + Computer• Visualizing data

• Human perceptual system

• Capture a user’s interactions in a visual analytics system

• Translate the interactions into something that would affect the computation in a meaningful way

Computer Process(Translate) Human

• Challenge: • Can we capture and extract a user’s

reasoning and intent through capturing a user’s interactions?

Page 20: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up20/36

What is in a User’s Interactions?

• Goal: determine if a user’s reasoning and intent are reflected in a user’s interactions.

Analysts

GradStudents(Coders)

Logged(semantic) Interactions

Compare!(manually)

StrategiesMethodsFindings

Guesses ofAnalysts’ thinking

WireVis Interaction-Log Vis

Page 21: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up21/36

What’s in a User’s Interactions

• From this experiment, we find that interactions contains at least:– 60% of the (high level) strategies– 60% of the (mid level) methods– 79% of the (low level) findings

R. Chang et al., Recovering Reasoning Process From User Interactions. CG&A, 2009.R. Chang et al., Evaluating the Relationship Between User Interaction and Financial Visual Analysis. VAST, 2009.

Page 22: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up22/36

What’s in a User’s Interactions

• Why are these so much lower than others?– (recovering “methods” at

about 15%)

• Only capturing a user’s interaction in this case is insufficient.

Page 23: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up23/36

Conclusion

• A high percentage of a user’s reasoning and intent are reflected in a user’s interactions.

• Raises lots of question: (a) what is the upper-bound, (b) how to automated the process, (c) how to utilize the captured results, etc.

• This study is not exhaustive. It merely provides a sample point of what is possible.

• CHI Workshop and VisWeek Panel on Analytic Provenance

Page 24: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up24/36

3. Can a User ExpressKnowledge Through Interaction?

Page 25: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up25/36

Find Distance Function, Hide Model Inference

• Problem Statement: Given a high dimensional dataset from a domain expert, how does the domain expert create a good distance function?

• Assumption: The domain expert knows about the data, but cannot express it mathematically

Page 26: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up26/36

In An Ideal World…

• The domain expert “guesses” a distance function, and produces the following scatter plot:

Page 27: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up27/36

In An Ideal World…

• The domain expert than interactively “moves” the “bad” data points towards the right direction:

Page 28: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up28/36

In An Ideal World…

• The process is repeated a few times until the layout looks about right.

• The system outputs a new distance function!

Page 29: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up29/36

As It Turns Out…

• This can be done.

• Need to make a few assumptions:

1. The type of distance function (linear, quadratic, etc.)

2. What it means to move a point from one location to another (is it moving closer to a cluster? Or away from some other points?)

Page 30: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up30/36

System Overview

Page 31: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up31/36

Results• Used the “Wine” dataset (13 dimensions, 3 clusters)

– Assume a linear (sum of squares) distance function

• Added 10 extra dimensions, and filled them with random values

• Interactively moved the “bad” points

Blue: original data dimensionRed: randomly added dimensionsX-axis: dimension numberY-axis: final weights of the distance function

Page 32: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up32/36

Conclusion

• With an appropriate projection model, it is possible to quantify a user’s interactions.

• In our system, we let the domain expert interact with a familiar representation of the data (scatter plot), and hides the ugly math (distance function)

• The system “reveals” the domain knowledge of the user.

• Poster presented at VAST 2011

Page 33: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up33/36

Summary

Page 34: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up34/36

Summary

• While Visual Analytics have grown and is slowly finding its identity,

• There is still many open problems that need to be addressed.

• I propose that one research area that has largely been unexplored is in the understanding and supporting of the human user.

Page 35: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up35/36

Summary

• The Visual Analytics Lab at Tufts (VALT) have been pursuing problems in this area.

• The presented projects are a select subset of the problems that we’ve been working on.

• For other projects, please feel free to talk to us, or see our papers online.

Page 36: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up36/36

Thank you!

Questions?

Page 37: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up37/36

4. How to Aggregate Multiple AnalysisTo Perform Group Analytics

Page 38: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up38/36

Scaling Human Computation

• Problem Statement: Computing can be scaled (by adding more CPUs). Visualizations can be scaled (by adding more monitors). Can analysis be scaled by adding more humans?

• Assumption: Conventional wisdom says that humans cannot be scaled because of difficulty in communicating analytical reasoning efficiently.

Page 39: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up39/36

Temporal Graph

• Research Proposal: We propose a Temporal Graph approach to model analytical trails. In a temporal graph,

– Node = a unique state in the visual analysis trail.

– Edge = a (temporal) transition from one state to another.

Page 40: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up40/36

For Example:

• 2 analysts, A and B, each performed an analysis on the same data

A0 A1 A2 A3 A4 A5

B0 B1 B2 B3 B4

Page 41: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up41/36

For Example:

• If A2 is the same as B1 (in that they represent the same analysis step)…

A0 A1

A2

A3 A4 A5

B0

B1

B2 B3 B4

Page 42: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up42/36

For Example:

• We will merge the two nodes

A0 A1

A2B1

A3 A4 A5

B0 B2 B3 B4

Page 43: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up43/36

For Example

• This process is repeated for all analysis trails across all analysts, and we could get a temporal graph that look like:

Page 44: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up44/36

With a Temporal Graph…

• We can answer many questions. For example:

– Given a particular outcome (a yellow states), is there a state that is the catalyst in which every subsequent analysis trail start from?• the answer is yes:• The red states are “points of

no return”• The green states are the

“last decision points”

Page 45: ProvenanceIntroApplicationPersonalityDist FuncWrap-up 1/36 User-Centric Visual Analytics Remco Chang Tufts University Department of Computer Science

ProvenanceIntro Application Personality Dist Func Wrap-up45/36

Conclusion

• There are many benefits to posing analysis trails as a temporal graph problem.

• Mostly, the benefit comes from our ability to apply known graph algorithms.

• Incidentally, this temporal graph formulation can be applied to visualize and analyze other problems involving large state space.

• Poster presented at VAST 2011