tools for data visualization - national energy …€¢ goal: exploratory • speed/flexibility:...

31
Annette Greiner NERSC Data and Analytics Services Tools for Data Visualization -1- Data Day, August 22, 2016

Upload: doanhuong

Post on 15-Apr-2018

218 views

Category:

Documents


2 download

TRANSCRIPT

Annette Greiner!NERSC Data and Analytics Services

Tools for Data Visualization

-1-

DataDay,August22,2016

Why Visualize?

-2-

Anscombe’s Quartet

X Y

10.0 8.04

8.0 6.95

13.0 7.58

9.0 8.81

11.0 8.33

14.0 9.96

6.0 7.24

4.0 4.26

12.0 10.84

7.0 4.82

5.0 5.68

-3-

X Y

10.0 9.14

8.0 8.14

13.0 8.74

9.0 8.77

11.0 9.26

14.0 8.10

6.0 6.13

4.0 3.10

12.0 9.13

7.0 7.26

5.0 4.74

X Y

10.0 7.46

8.0 6.77

13.0 12.74

9.0 7.11

11.0 7.81

14.0 8.84

6.0 6.08

4.0 5.39

12.0 8.15

7.0 6.42

5.0 5.73

X Y

8.0 6.58

8.0 5.76

8.0 7.71

8.0 8.84

8.0 8.47

8.0 7.04

8.0 5.25

19.0 12.50

8.0 5.56

8.0 7.91

8.0 6.89

F.J.Anscombe,1973

Same:NMeanXMeanYVarianceXVarianceYRegressionCorrelaGon

Anscombe’s Quartet

-4- F.J.Anscombe,1973

Choosing a Tool

-5-

Good News/Bad News

-6-hIp://selecGon.datavisualizaGon.ch

Goal

Isyourgoalexplana9onorexplora9on?

-7-

Explanatory:

Youalreadyknowwhatyouwanttosay.

Exploratory:

Youwanttofindoutwhatthedatameans.

Speed/Flexibility Tradeoff

Doyouhavelotsof9me?Doyouneedcustomiza9on?

-8- Image:HighlightsforChildren

Speed,EaseFlexibility

Dependencies

Whatiscompa9blewithyourworkinghabits?

PythonR

Analy9csappWeb

LibrariesVersions

-9-

Sharing

Doyouneedtosharewithothers?•  Imagefiles(PNG,TIFF,PDF,etc.)•  Interac9vity(click,brush,rollover,zoom,pan,etc.)•  Codesharing(notebookinterface)•  Privacy(authen9ca9on,authoriza9on)

-10-

Specificity

Isthereatoolaimedatyourprobleminpar9cular?

-11-

Graphical Quality

Whattoolwillletyoushowyourdatawithoutdistor9onorchartjunk?

-12-FusionCharts

Licensing/Cost

Whatarethecosts?•  Purchaseprice•  Licenseover9me•  Licenseconstraints(e.g.,aWribu9on,constraintsonderiva9veworks)

-13-

New Tools of Note

-14-

Seaborn

-15-

Seaborn

•  Goal:exploratory•  Speed/Flexibility:Speed•  Dependencies:python2.7or3.3+,numpy,scipy,matplotlib,pandas

•  Sharing:exportimagefiles•  SpecificUses:sta9s9calgraphs•  GraphicalQuality:goodqualitygraphs,abilitytotweakcolors,axes,etc.

•  Costs:opensource

-16-

Bokeh

-17-

Bokeh

•  Goal:exploratoryorexplanatory•  Speed/Flexibility:low-,intermediate-,orhigh-level•  Dependencies:python•  Sharing:Bokehserver,embeddinginwebpagesornotebooks

•  SpecificUses:novelinterac9vevisualiza9onsinthebrowser

•  GraphicalQuality:verygoodquality•  Costs:opensource

-18-

Plotly

-19-

Plotly

-20-

Plotly

•  Goal:exploratory•  Speed/Flexibility:Speed•  Dependencies:libraryforpython,R,Matlab,Excel,orJS,orusewebapp

•  Sharing:Plot.ly,PlotlyServer•  SpecificUses:collabora9veanalysis•  GraphicalQuality:goodquality•  Costs:plotlylibsareopensource,PlotlyServerislicensed

-21-

Shiny

-22-

Shiny

-23-

Shiny

•  Goal:exploratoryorexplanatory•  Speed/Flexibility:Speed•  Dependencies:RorRStudio•  Sharing:shinyapps.io,ShinyServer•  SpecificUses:interac9vewebapplica9ons•  GraphicalQuality:goodquality•  Costs:opensource,freemiummodel

-24-

Questions?

Ihavesomeforyou...

-25-

Questions?

•  Whattoolsdoyouguysuse(onoroffHPC)?

-26-

Questions?

•  Whattoolsdoyouguysuse(onoroffHPC)?•  WhatwouldyouliketoseeavailableatNERSC?

-27-

Questions?

•  Whattoolsdoyouguysuse(onoroffHPC)?•  WhatwouldyouliketoseeavailableatNERSC?•  DoyoucurrentlydodatavisonHPCsystems?

-28-

Questions?

•  Whattoolsdoyouguysuse(onoroffHPC)?•  WhatwouldyouliketoseeavailableatNERSC?•  DoyoucurrentlydodatavisonHPCsystems?•  Ifno,whynot?Arethereroadblockswecanremove?

-29-

Questions?

•  Whattoolsdoyouguysuse(onoroffHPC)?•  WhatwouldyouliketoseeavailableatNERSC?•  DoyoucurrentlydodatavisonHPCsystems?•  Ifno,whynot?Arethereroadblockswecanremove?

•  Arethereotherissueswithcurrentlyavailablesofwarefordatavis?

-30-

National Energy Research Scientific Computing Center

-31-