tim budden: "unlocking insights from social data"

31
Unlocking insights from social data Tim Budden VP Data Science at DataSift

Upload: digital-henley

Post on 16-Jan-2017

80 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Tim Budden: "Unlocking Insights from Social Data"

Unlocking insightsfrom social data

Tim BuddenVP Data Science at DataSift

Page 2: Tim Budden: "Unlocking Insights from Social Data"

Drew Conway’s Data Science Venn Diagram

2

Page 3: Tim Budden: "Unlocking Insights from Social Data"

Expanding data universe

Agenda

1

2

3

4

Evolution of social data

Social data analyticsPrivacy by design

5 Examples

Page 4: Tim Budden: "Unlocking Insights from Social Data"

Expanding digital universe1

Page 5: Tim Budden: "Unlocking Insights from Social Data"

5

Expanding digital universe

Expanding human data

universe

Page 6: Tim Budden: "Unlocking Insights from Social Data"

Evolution of social data2

Page 7: Tim Budden: "Unlocking Insights from Social Data"

The evolution of social data

From public to non-public spaces:

Public Walled 1 to 1 Image-based

Page 8: Tim Budden: "Unlocking Insights from Social Data"

Public

Where brands and consumers most commonly engage directly. This is where customer support and brand perception can be addressed directly by a brand.

Page 9: Tim Budden: "Unlocking Insights from Social Data"

Walled garden

Users engage each other in a non-public but large network. This is where users are more candid about their aspirations and attitudes toward brands.

Page 10: Tim Budden: "Unlocking Insights from Social Data"

1 to 1

Users engage each other directly on a one-to-one or small group basis. Thus far this space has been considered largely off limits to brands.

Page 11: Tim Budden: "Unlocking Insights from Social Data"

Image-based

Public spaces where people showcase their best visual content.

Page 12: Tim Budden: "Unlocking Insights from Social Data"

12

Page 13: Tim Budden: "Unlocking Insights from Social Data"

Social data analytics3

Page 14: Tim Budden: "Unlocking Insights from Social Data"

14

Business applications of social media

Page 15: Tim Budden: "Unlocking Insights from Social Data"

15

Volume and velocity

Natural Language

Privacy

2.1B People Globally on Social Networks

Challenges to extracting insights from data

Unlocking Insights from 2.1B People on Social Networks

Page 16: Tim Budden: "Unlocking Insights from Social Data"

Example analytics project: Run on the banks?

16

Bank of England experimented with trying to predict a bank run in the days preceding the Scottish independence referendumObserved spike on 15 September of tweets mentioning “RBS” and “run”

Scottishindependence

referendum

Page 17: Tim Budden: "Unlocking Insights from Social Data"

17

Run on the banks?“Great run there! Arm tackles don’t bring down good RBs”

Page 18: Tim Budden: "Unlocking Insights from Social Data"

Ambiguity in natural language

18

Page 19: Tim Budden: "Unlocking Insights from Social Data"

Synonymity in natural language

19

Page 20: Tim Budden: "Unlocking Insights from Social Data"

word2vec

20

king - man=

queen - woman

Berlin - Germany=

Paris - France

https://spacy.io/demos/sense2vec?NFL

Page 21: Tim Budden: "Unlocking Insights from Social Data"

Privacy by design4

Page 22: Tim Budden: "Unlocking Insights from Social Data"

How can information useful to business be extracted from non-public spaces, while wholeheartedly

respecting people’s privacy?

Page 23: Tim Budden: "Unlocking Insights from Social Data"

Think in terms of audiences and demographics not individuals

23

Djokovic

Federer

female male

Come on Djokovic! Come on

Roger!

Go for it Novak!

Great shot Federer!

Henman Hill at Wimbledon

Page 24: Tim Budden: "Unlocking Insights from Social Data"

Think in terms of topics and attitudes not verbatim

Sumptuous interior!

Beautiful lines!

Lots of storage

Page 25: Tim Budden: "Unlocking Insights from Social Data"

PYLON: Anonymised and Aggregated insights

25

Text available to algorithmsbut not output

Aggregated results

Audience sizes are quantised:minimum bucket size and intervals

Anonymised: allPersonallyIdentifiableInformation(PII) is dropped

API

DS

Page 26: Tim Budden: "Unlocking Insights from Social Data"

CONTENTGender: MaleAge Range: 35-44Region: California, USA

CONTENTNegativeNeutralPositive

DEMOGRAPHICS

SENTIMENT

Automatic classification of related topics

e.g. Star Wars VII (Film)

TOPIC ANALYSIS

CONTENT

LINKSAnalyze

URLs shared across Facebook

Engagement and Demographics around Likes, Comments and Shares

ENGAGEMENT

Can’t wait to take the kids to watch Star Wars VII

CONTENT

Privacy-safe aggregate analysis of

text

TEXT ANALYSIS

Topic Data is Multi-Dimensional. Build Insights into Content, Engagement, Audiences

Page 27: Tim Budden: "Unlocking Insights from Social Data"

Examples5

Page 28: Tim Budden: "Unlocking Insights from Social Data"

Analysing and visualising automotive

28

websequencediagrams.com

Page 29: Tim Budden: "Unlocking Insights from Social Data"

Writing the script with Facebook topic data

29

Page 30: Tim Budden: "Unlocking Insights from Social Data"

30

Volume and velocity

Natural Language

Privacy

2.1B People Globally on Social Networks

Challenges to extracting insights from data

Unlocking Insights from 2.1B People on Social Networks

Page 31: Tim Budden: "Unlocking Insights from Social Data"

THANK YOU