the interplay between mathematics/computation and analytics haesun park division of computational...

9
The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology FODAVA Review Meeting Dec. 3, 2009

Upload: doreen-webster

Post on 20-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

The Interplay Between Mathematics/Computation and Analytics

Haesun ParkDivision of Computational Science and Engineering

Georgia Institute of Technology

FODAVA Review MeetingDec. 3, 2009

Page 2: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

FODAVA and Me • PhD 87 Cornell, numerical comp. (numerical linear algebra),

parallel computing, signal processing, .. • Early 00’s, data analysis, text analysis, bioinformatics –

dimension reduction (LDA, NMF), classification, clustering, ..• 2003-2005 NSF Program Director CCF

– TF/Numeric, Symbolic, Geometric Computing– Graphics and Vis until Larry Rosenblum– MSPA/MCS (Visual Analytics from Larry R.)

• 2005, Georgia Tech, Division of Computational Science and Eng.• 2007, Apr., DIMACS Workshop: Recent Advances in Math. and

Information Sciences for Analysis and Understanding of Massive and Diverse Sources of Data , Wen Masters of ONR, and Fred Roberts of DIMACS,

• 2007 Fall, FODAVA, on campus team already formed based on CSE Seminar series (J. Stasko, …)

Page 3: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Visual Analytics

• Visual Analytics is the Science of Analytical Reasoning facilitated by Interactive Visual Interfaces (Thomas and Cook)

• Visual Analytics combines automated analysis techniques with interactive visualizations for an effective understanding, reasoning and decision making on the basis of very large and complex data sets (Keim et al.)

AnalyticalReasoning

DataRepresentation

and Transformation

ProductionPresentation and

Dissemination

VisualRepresentationand Interaction

I see, therefore, I analyze better.

I see, therefore, I analyze better.

“Solving a problem simply means representing it so that the solution is obvious.” Herbert Simon, 96

Page 4: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Visual Analytics is Truly Interdisciplinary

• Community very broad– Vis, HCI, Database, Cognitive Science,– FODAVA : math, statistics, computational science, data

analysis, …

• Challenges– Communications– Different communities are used to different problem settings

• Opportunities– Maximally utilizing what human and computer can offer– Visual Representation and Interaction

• Writable vis (in contrast to readable vis) makes vis useful (P. Hanrahan)

• is the key facilitator between human and data

Page 5: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Data Representation & Transformation Tasks

•Classification

•Clustering

•Regression

•Dimension reduction

•Density estimation

•Retrieval of similar items

•Automatic summarization

•…

Mat

hem

atic

al ,

Sta

tist

ical

, an

d

Co

mp

uta

tio

nal

Met

ho

ds

Modules in Data and Visual Analytics System

Analytical Reasoning Tasks

•Identify the individual leaking classified information

•Determine if a set of suspicious events are related

•Predict the next stock market crash

•Identify best medical treatment based on genomic, population data

• …

Hu

man

Kn

ow

led

ge

Visual Representation and Interaction

Page 6: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Challenges• Data Representation and Transformation vs. Analytical

Reasoning Tasks• Data representation and transformation concerned with answering

questions for which solution process is rather well defined• Analytical reasoning concerned with determining what questions to ask

(e.g., formulating hypotheses)• Analysts must continually iterate between these tasks• Evaluation ?• Scalability ?

• How does the Interplay come together ?• Visualization is the way for the interplay to occur effectively• Better identification of mapping between existing data representation and

transformation and the steps of analytical reasoning tasks needed• Expansion and refinement of data analytical tasks needed for extended

mapping• Careful design of visual representation and interaction

Page 7: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Interdisciplinary Activities

• Close collaboration of FODAVA teams with people in VA from vis and/or analytical reasoning community critical (ex. J. Stasko,

NVAC (J. Thomas, S. Bohn),

W. Ribarsky,

DHS CoE: David Ebert )

• FODAVA Test bed

Page 8: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology

Where to publish?• IEEE TPAMI - IEEE Trans. on Pattern Analysis and Machine Intelligence• JMLR – Journal of Machine Learning Research• Information Visualization • InfoVis - IEEE Information Visualization Conference• VAST - IEEE Visual Analytics Science and Technology• NIPS - Neural Information Processing Systems• ICML - International Conference on Machine Learning• SIGKDD - ACM Special Interest Group on Knowledge Disc. and Data Mining• SDM - SIAM Conference on Data Mining• ICDM - IEEE International Conference on Data Mining• CVPR - Conference on Computer Vision and Pattern Recognition• WWW - International World Wide Web Conference• WSDM - International Conference on Web Search and Data Mining• AISTATS - International Conference on Artificial Intelligence and Statistics• CompStat - International Conference on Computational Statistics• JSM - (American Statistical Association's) Joint Statistical Meetings

Page 9: The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology