school of data - mapping opencorporates networks using openrefine and gephi

32
Mapping Corporate Networks - Intro

Upload: tony-hirst

Post on 27-Jan-2015

113 views

Category:

Business


8 download

DESCRIPTION

 

TRANSCRIPT

Page 1: School Of Data - mapping opencorporates networks using openrefine and Gephi

Mapping CorporateNetworks - Intro

Page 2: School Of Data - mapping opencorporates networks using openrefine and Gephi

A two-part recipe for downloading company ownership data from

OpenCorporates using OpenRefine, and then

visualising it with Gephi

Page 3: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 4: School Of Data - mapping opencorporates networks using openrefine and Gephi

http://opencorporates.com/companies/gb/04366849/network.json?depth=2

Page 5: School Of Data - mapping opencorporates networks using openrefine and Gephi

How to grab the data using

OpenRefineVisit openrefine.org to download the application

Page 6: School Of Data - mapping opencorporates networks using openrefine and Gephi

Add/network.json?depth=2to the end of the web address

Where’s the data?

Page 7: School Of Data - mapping opencorporates networks using openrefine and Gephi

URL of the form:

http://opencorporates.com/companies/JURISDICTION/COMPANY_ID/network.json?depth=2

Page 8: School Of Data - mapping opencorporates networks using openrefine and Gephi

What data block makes a row?

Page 9: School Of Data - mapping opencorporates networks using openrefine and Gephi

Toggle selection and preview

Create project

Page 10: School Of Data - mapping opencorporates networks using openrefine and Gephi

Nicely tabulated data

Page 11: School Of Data - mapping opencorporates networks using openrefine and Gephi

What Gephi Expects…

Page 12: School Of Data - mapping opencorporates networks using openrefine and Gephi

Child Parent

Page 13: School Of Data - mapping opencorporates networks using openrefine and Gephi

What Gephi Expects…

Parent -> SourceChild -> Target

(You may find the network analyses work betterif you use the parent as the Target and thechild as the Source…)

Page 14: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 15: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 16: School Of Data - mapping opencorporates networks using openrefine and Gephi

How to visualise the data using

GephiVisit gephi.org to download the application

Page 17: School Of Data - mapping opencorporates networks using openrefine and Gephi

Getting Started with Gephi

Page 18: School Of Data - mapping opencorporates networks using openrefine and Gephi

Import as Edges table

Page 19: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 20: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 21: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 22: School Of Data - mapping opencorporates networks using openrefine and Gephi

View

Layout

Colour/Size

Stats/Filters

Label tools

Page 23: School Of Data - mapping opencorporates networks using openrefine and Gephi

“Spacing”

Page 24: School Of Data - mapping opencorporates networks using openrefine and Gephi

Turn labels on Label size

Label displayselector

Page 25: School Of Data - mapping opencorporates networks using openrefine and Gephi

Degree 2In-degree 2Out-degree 0

A matter of degree…

Degree 3In-degree 0Out-degree 3

Degree 3In-degree 1Out-degree 2

Page 26: School Of Data - mapping opencorporates networks using openrefine and Gephi

Size by degree…Calculate in-degreeand out-degree

Set node size

The color wheel/palette isused to colour the nodes.

Page 27: School Of Data - mapping opencorporates networks using openrefine and Gephi
Page 28: School Of Data - mapping opencorporates networks using openrefine and Gephi

Label Sizing

Page 29: School Of Data - mapping opencorporates networks using openrefine and Gephi

Tweaking the layout

“Expand” the layout(stretch it in twodimensions)

“Adjust” the labelsso that they don’toverlap - maychange relativeposition of nodes

Page 30: School Of Data - mapping opencorporates networks using openrefine and Gephi

Network Stats

HITS – Authority and Hub values:authoritative nodes are pointed to,hub nodes point to others

Measure the ‘influence’of a node in the network

Note: some of these stats are more meaningful if

we set the parent company as the Target and the child

company as the Source in the original data…

Page 31: School Of Data - mapping opencorporates networks using openrefine and Gephi

Use the tools in concert…Colour based on Authority (HITS statistic)

Label adjust tweaks the layoutso we can read the labels

Fine tune label sizingusing text-size slider

Page 32: School Of Data - mapping opencorporates networks using openrefine and Gephi

SchoolOfData.org