google nyc-6-3-2011

58
Analyzing Social Media Networks with NodeXL Ben Shneiderman [email protected] @benbendc Founding Director (1983-2000), Human-Computer Interaction Lab Professor, Department of Computer Science Member, Institute for Advanced Computer Studies College Park, MD 20742

Upload: university-of-maryland

Post on 19-Jan-2015

357 views

Category:

Technology


0 download

DESCRIPTION

Talk in Google- NYC, 6/3/2011 covering Information Visualization Analyzing Social Media Networks with NodeXL

TRANSCRIPT

Page 1: Google nyc-6-3-2011

Analyzing Social Media Networkswith NodeXL

Ben Shneiderman [email protected] @benbendc

Founding Director (1983-2000), Human-Computer Interaction LabProfessor, Department of Computer Science

Member, Institute for Advanced Computer Studies

University of MarylandCollege Park, MD 20742

Page 2: Google nyc-6-3-2011

Interdisciplinary research community - Computer Science & Info Studies - Psych, Socio, Poli Sci & MITH (www.cs.umd.edu/hcil)

Page 3: Google nyc-6-3-2011

Design Issues

• Input devices & strategies• Keyboards, pointing devices, voice

• Direct manipulation

• Menus, forms, commands

• Output devices & formats• Screens, windows, color, sound

• Text, tables, graphics

• Instructions, messages, help

• Collaboration & Social Media

• Help, tutorials, training

• Search www.awl.com/DTUI

Fifth Edition: 2010

• Visualization

Page 4: Google nyc-6-3-2011

Information Visualization

• Visual bandwidth is enormous• Human perceptual skills are remarkable

• Trend, cluster, gap, outlier...

• Color, size, shape, proximity...

• Three challenges• Meaningful visual displays of massive data

• Interaction: widgets & window coordination

• Process models for discovery:

Page 5: Google nyc-6-3-2011

Leader in Info Visualization

Page 6: Google nyc-6-3-2011

Business takes action

• General Dynamics buys MayaViz

• Agilent buys GeneSpring

• Google buys Gapminder

• Oracle buys Hyperion

• Microsoft buys Proclarity

• InfoBuilders buys Advizor Solutions

• SAP buys (Business Objects buys Xcelsius & Inxight & Crystal Reports )

• IBM buys (Cognos buys Celequest) & ILOG

• TIBCO buys Spotfire

Page 7: Google nyc-6-3-2011

Spotfire: Retinol’s role in embryos & vision

Page 8: Google nyc-6-3-2011

http://registration.spotfire.com/eval/default_edu.asp

Page 9: Google nyc-6-3-2011

10M - 100M pixels

Large displays for single or multiple users

Page 10: Google nyc-6-3-2011

100M-pixels & more

Page 11: Google nyc-6-3-2011

1M-pixels & less

Small mobile devices

Page 12: Google nyc-6-3-2011

Information Visualization: Mantra

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

• Overview, zoom & filter, details-on-demand

Page 13: Google nyc-6-3-2011

Information Visualization: Data Types

• 1-D Linear Document Lens, SeeSoft, Info Mural

• 2-D Map GIS, ArcView, PageMaker, Medical imagery

• 3-D World CAD, Medical, Molecules, Architecture

• Multi-Var Spotfire, Tableau, GGobi, TableLens, ParCoords,

• Temporal LifeLines, TimeSearcher, Palantir, DataMontage

• Tree Cone/Cam/Hyperbolic, SpaceTree, Treemap

• Network Pajek, JUNG, UCINet, SocialAction, NodeXL

I

nfoV

iz

S

ciV

iz .

infosthetics.com flowingdata.com infovis.org eagereyes.org www.infovis.net/index.php?lang=2

Page 14: Google nyc-6-3-2011

Anscombe’s Quartet

1 2 3 4

x y x y x y x y

10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58

8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76

13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71

9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84

11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47

14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04

6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25

4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50

12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56

7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91

5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89

Page 15: Google nyc-6-3-2011

Anscombe’s Quartet

1 2 3 4

x y x y x y x y

10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58

8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76

13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71

9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84

11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47

14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04

6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25

4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50

12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56

7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91

5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89

Property Value

Mean of x  9.0

Variance of x 11.0

Mean of y  7.5

Variance of y  4.12

Correlation 0.816

Linear regression y = 3 + 0.5x

Page 16: Google nyc-6-3-2011

Anscombe’s Quartet

Page 17: Google nyc-6-3-2011

Temporal Data: TimeSearcher 1.3

• Time series• Stocks

• Weather

• Genes

• User-specified patterns

• Rapid search

Page 18: Google nyc-6-3-2011

Temporal Data: TimeSearcher 2.0

• Long Time series (>10,000 time points)

• Multiple variables

• Controlled precision in match (Linear, offset, noise, amplitude)

Page 19: Google nyc-6-3-2011

LifeLines: Patient Histories

www.cs.umd.edu/hcil/lifelines

Page 20: Google nyc-6-3-2011

LifeLines2: Contrast+Creatine

Page 21: Google nyc-6-3-2011

LifeLines2: Align-Rank-Filter & Summarize

Page 22: Google nyc-6-3-2011

Treemap: Gene Ontology

www.cs.umd.edu/hcil/treemap/

+ Space filling

+ Space limited

+ Color coding

+ Size coding - Requires learning

(Shneiderman, ACM Trans. on Graphics, 1992 & 2003)

Page 23: Google nyc-6-3-2011

www.smartmoney.com/marketmap

Treemap: Smartmoney MarketMap

Page 24: Google nyc-6-3-2011

Market falls steeply Feb 27, 2007, with one exception

Page 25: Google nyc-6-3-2011

Market mixed, February 8, 2008 Energy & Technology up, Financial & Health Care down

Page 26: Google nyc-6-3-2011

Market rises, September 1, 2010, Gold contrarians

Page 27: Google nyc-6-3-2011

Market rises, March 21, 2011, Sprint declines

Page 28: Google nyc-6-3-2011

newsmap.jp

Treemap: Newsmap (Marcos Weskamp)

Page 29: Google nyc-6-3-2011

www.hivegroup.com

Treemap: Supply Chain

Page 30: Google nyc-6-3-2011

www.spotfire.com

Treemap: Spotfire Bond Portfolio Analysis

Page 31: Google nyc-6-3-2011

Treemap: NY Times – Car&Truck Sales

www.cs.umd.edu/hcil/treemap/

Page 32: Google nyc-6-3-2011

Treemap (Voronoi): NY Times - Inflation

www.nytimes.com/interactive/2008/05/03/business/20080403_SPENDING_GRAPHIC.html

Page 33: Google nyc-6-3-2011
Page 34: Google nyc-6-3-2011

State-of-the-art network visualization

Page 35: Google nyc-6-3-2011

Discovery Process: Systematic Yet Flexible

Preparation• Own the problem & define the schedule• Data cleaning & conditioning• Handle missing & uncertain data• Extract subsets & link to related information

Page 36: Google nyc-6-3-2011

SocialAction

• Integrates statistics & visualization

• 4 case studies, 4-8 weeks (journalist, bibliometrician, terrorist analyst, organizational analyst)

• Identified desired features, gave strong positive feedback about benefits of integration

Perer & Shneiderman, CHI2008, IEEE CG&A 2009www.cs.umd.edu/hcil/socialaction

Page 37: Google nyc-6-3-2011

Footprints of Human Activity

• Footprints in sand as Caesarea

Page 38: Google nyc-6-3-2011

NodeXL: Network Overview for Discovery & Exploration in Excel

www.codeplex.com/nodexl

Page 39: Google nyc-6-3-2011

NodeXL: Network Overview for Discovery & Exploration in Excel

www.codeplex.com/nodexl

Page 40: Google nyc-6-3-2011

NodeXL: Import Dialogs

www.codeplex.com/nodexl

Page 41: Google nyc-6-3-2011

Tweets at #WIN09 Conference: 2 groups

Page 42: Google nyc-6-3-2011

WWW2010 Twitter Community

Page 43: Google nyc-6-3-2011

WWW2011 Twitter Community: Grouped

Page 44: Google nyc-6-3-2011

Tweets for #HCI: 2 groups

Page 45: Google nyc-6-3-2011

CHI2010 Twitter Community

www.codeplex.com/nodexl/

Page 46: Google nyc-6-3-2011

Oil Spill Twitter Community

www.codeplex.com/nodexl/

Page 47: Google nyc-6-3-2011

‘GOP’ tweets, clustered (red-Republicans)

Page 48: Google nyc-6-3-2011

Flickr clusters for “mouse”

Computer Mickey

Animal

Page 49: Google nyc-6-3-2011

Flickr networks

Page 50: Google nyc-6-3-2011

Co-author network for HCIL tech reports

Vertices sized by number of papers. Edges sized number of co-authored reports. Colored by clustering.

Page 51: Google nyc-6-3-2011

Co-author network for HCIL tech reports

Vertices sized by number of papers, edges sized number of co-authored reports

Colored by date of first paper.

Includes only those with at least 5 co-authored papers.

Page 52: Google nyc-6-3-2011

Nation of Neighbors Discussion Groups

Page 53: Google nyc-6-3-2011

Analyzing Social Media Networks with NodeXL

I. Getting Started with Analyzing Social Media Networks 1. Introduction to Social Media and Social Networks 2. Social media: New Technologies of Collaboration 3. Social Network Analysis

II. NodeXL Tutorial: Learning by Doing 4. Layout, Visual Design & Labeling 5. Calculating & Visualizing Network Metrics  6. Preparing Data & Filtering 7. Clustering &Grouping

III Social Media Network Analysis Case Studies 8. Email 9. Threaded Networks 10. Twitter 11. Facebook   12. WWW 13. Flickr 14. YouTube  15. Wiki Networks 

www.elsevier.com/wps/find/bookdescription.cws_home/723354/description

Page 54: Google nyc-6-3-2011

Social Media Research Foundation

Researchers who want to - create open tools - generate & host open data - support open scholarship

Map, measure & understand social media  

Support tool projects to collection, analyze & visualize social media data.  

smrfoundation.org

Page 55: Google nyc-6-3-2011

UN Millennium Development Goals

• Eradicate extreme poverty and hunger• Achieve universal primary education• Promote gender equality and empower women• Reduce child mortality• Improve maternal health• Combat HIV/AIDS, malaria and other diseases• Ensure environmental sustainability• Develop a global partnership for development

To be achieved by 2015

Page 56: Google nyc-6-3-2011

Just happened: 28th Annual SymposiumMay 25-26, 2011

Next Event: Summer Social WebshopAugust 23-26, 2011

(Sponsored by NSF & Google)www.cs.umd.edu/hcil/webshop2011

Page 57: Google nyc-6-3-2011

For More Information

• Visit the HCIL website for 400 papers & info on videos www.cs.umd.edu/hcil

• Conferences & resources: www.infovis.org

• See Chapter 14 on Info Visualization Shneiderman, B. and Plaisant, C., Designing the User Interface: Strategies for Effective Human-Computer Interaction: Fifth Edition (March 2009) www.awl.com/DTUI

• Edited Collections: Card, S., Mackinlay, J., and Shneiderman, B. (1999) Readings in Information Visualization: Using Vision to Think Bederson, B. and Shneiderman, B. (2003) The Craft of Information Visualization: Readings and Reflections

Page 58: Google nyc-6-3-2011

For More Information

• Treemaps• HiveGroup: www.hivegroup.com • Smartmoney: www.smartmoney.com/marketmap • HCIL Treemap 4.0: www.cs.umd.edu/hcil/treemap

• Spotfire: www.spotfire.com • TimeSearcher: www.cs.umd.edu/hcil/timesearcher • NodeXL: nodexl.codeplex.com• Hierarchical Clustering Explorer:

www.cs.umd.edu/hcil/hce

• LifeLines2: www.cs.umd.edu/hcil/lifelines2 • Similan: www.cs.umd.edu/hcil/similan