collecting and coding twitter data in discovertext

37
Jill Hopke @jillhopke Digital Humani4es Research Network UWMadison September 23, 2014 Collec4ng and Coding TwiJer Data in DiscoverText

Upload: jill-hopke

Post on 02-Dec-2014

367 views

Category:

Education


3 download

DESCRIPTION

These are the slides to a workshop I presented on September 23, 2014 to the University of Wisconsin-Madison Digital Humanities Research Network (http://dhresearchnetwork.wordpress.com/). The workshop covered an overview of my research using DiscoverText, steps to collect data in the cloud-based big data analytics software DiscoverText (https://discovertext.com/), and coding data, as well as limitations, challenges and other resources for social media data collection and analysis.

TRANSCRIPT

Page 1: Collecting and Coding Twitter Data in DiscoverText

Jill  Hopke  @jillhopke  

 Digital  Humani4es  Research  Network  

UW-­‐Madison  September  23,  2014  

   

Collec4ng  and  Coding  TwiJer  Data  in  DiscoverText    

Page 2: Collecting and Coding Twitter Data in DiscoverText

Workshop  Overview  

 My  Research  on  Global  Frackdown    

Steps  to  collect  Twi9er  data  in  DiscoverText  

Coding  data  in  DiscoverText  

LimitaAons  and  Challenges  

Other  Tools/Resources  

Page 3: Collecting and Coding Twitter Data in DiscoverText

Theory-­‐Driven  Research  is  Key!  

CollecAve  AcAon  

Page 4: Collecting and Coding Twitter Data in DiscoverText

The  Changing  Nature  of  Ac4vism    

CollecAve  AcAon  

Connec&ve  AcAon  

Page 5: Collecting and Coding Twitter Data in DiscoverText

The  Changing  Nature  of  Ac4vism    

CollecAve  AcAon  

Connec&ve  AcAon  

CollecAve  AcAon  Frames  

Personal  AcAon  Frames  

Page 6: Collecting and Coding Twitter Data in DiscoverText

Transna4onal  An4-­‐Fracking  Ac4vism  

DistribuAon  of  Global  Frackdown  2013  Events  

Source:  Global  Frackdown.  (n.d.).  Events.    

Page 7: Collecting and Coding Twitter Data in DiscoverText

RQ1:  What  TwiJer  strategies  do  Global  Frackdown  ac4vists  use  to  mobilize  for  the  October  19,  2013  day  of  ac4on?    RQ2:  How  do  Global  Frackdown  tweeters  frame  protest  against  hydraulic  fracturing?  

Research  Ques4ons    

Page 8: Collecting and Coding Twitter Data in DiscoverText

•  Dataset  of  9,449  tweets  for  the  hashtag  #globalfrackdown.  

•  Data  collected  from  October  13  to  October  27,  2013  using  DiscoverText.  

•  Textual  analysis  of  English  (n=7,678)  and  Spanish  (n=1,314)  tweets.  

•  Unit  of  analysis  is  the  individual  tweet.  •  Also  conducted  in-­‐depth  interviews  with  transna4onal  ac4vists.  

Project  Data  

Page 9: Collecting and Coding Twitter Data in DiscoverText

Twee4ng  the  #GlobalFrackdown  

Page 10: Collecting and Coding Twitter Data in DiscoverText

 Tweet  Frequency  (October  13-­‐27,  2013)  

0  

1000  

2000  

3000  

4000  

5000  

6000  10/13/13  

10/14/13  

10/15/13  

10/16/13  

10/17/13  

10/18/13  

10/19/13  

10/20/13  

10/21/13  

10/22/13  

10/23/13  

10/24/13  

10/25/13  

10/26/13  

10/27/13  

English  

Spanish  

Total  

Page 11: Collecting and Coding Twitter Data in DiscoverText

Tweet  Language  

79%  

14%  

3%  2%  1%   1%   0%  

English  

Spanish  

French  

Catalan  

Basque  

German  

Other  

Page 12: Collecting and Coding Twitter Data in DiscoverText

 Propor4on  of  Tweets  by  Device  Source  

0   20   40   60   80   100  

English  

Spanish  

Mobile  

Desktop  

Applica4on  

Page 13: Collecting and Coding Twitter Data in DiscoverText

 Propor4on  of  Tweets  with  Photos  

9%  

91%  

Spanish  

Photo  

No  Photo  

21%  

79%  

English    

Photo  

No  Photo  

Page 14: Collecting and Coding Twitter Data in DiscoverText

Tweet  Content  Type  

0  

10  

20  

30  

40  

50  

60  

%  of  T

weets  

English  

Spanish  

Page 15: Collecting and Coding Twitter Data in DiscoverText

 Most  Frequently  Used  Hashtags  

English   Spanish  Fracking   Fracking  

Elsipogtog   FrackingNo  

Banfracking   StopFracking  

IdleNoMore   FrackingEZ  

PowerShil   BanFracking  

ElsipogtogSolidarity   19O  

BanFrackingNow   SiSePuede19O  

Mikmaqblockade   Castelló  

Cdnpoli   GlobalFrackdo  

NYC   Chervon  

 Excluding  

 #GlobalFrackdown    

Page 16: Collecting and Coding Twitter Data in DiscoverText

Collec4ng  Data  in  DiscoverText  

Page 17: Collecting and Coding Twitter Data in DiscoverText

DiscoverText  Dashboard  –  Login  and  Try  Out!  

Page 18: Collecting and Coding Twitter Data in DiscoverText

Start  a  New  Project  

Page 19: Collecting and Coding Twitter Data in DiscoverText

Name  Your  Project  

Page 20: Collecting and Coding Twitter Data in DiscoverText

Impor4ng  Data  

Page 21: Collecting and Coding Twitter Data in DiscoverText

TwiJer  Data  Types  (API,  GNIP,  Historical)  

Page 22: Collecting and Coding Twitter Data in DiscoverText

Name  Your  Data  Feed  (Archive)  

Page 23: Collecting and Coding Twitter Data in DiscoverText

Enter  Your  Search  Term  

Page 24: Collecting and Coding Twitter Data in DiscoverText

Schedule  Feed  Data  Collec4on  

Page 25: Collecting and Coding Twitter Data in DiscoverText

Archive  Management  

Page 26: Collecting and Coding Twitter Data in DiscoverText

List  of  Tweets  

Page 27: Collecting and Coding Twitter Data in DiscoverText

Viewing  Tweets  

Page 28: Collecting and Coding Twitter Data in DiscoverText

Meta  Data  

Page 29: Collecting and Coding Twitter Data in DiscoverText

Keeping  Track  of  Feed  Schedules  

Page 30: Collecting and Coding Twitter Data in DiscoverText

Coding  in  DiscoverText  

Page 31: Collecting and Coding Twitter Data in DiscoverText

(Part  of)  What  I  Did  (Theory-­‐Driven)  

•  First  round,  code  for  language.  •  Second  round,  read  sub-­‐sec4on  of  data  and  developed  set  of  “working  themes.”  

•  Code  for  themes.  Memo/annotate  interes4ng  examples.  

•  Refine  codebook  (themes)  and  con4nue  coding.  •  Intercoder  reliability  (you  might  want  to  do  this…  Depends  on  your  methodological  approach).  

•  I  also  used  the  machine-­‐learning  func4ons  for  a  separate  chapter  to  “classify”  data  for  valence  and  certainty.  

Page 32: Collecting and Coding Twitter Data in DiscoverText

Coding  Tweets    

DiscoverText  Coding  Example    

Page 33: Collecting and Coding Twitter Data in DiscoverText

Limita4ons  and  Challenges  

Page 34: Collecting and Coding Twitter Data in DiscoverText

Limita4ons  and  Challenges  

PRO:  Doesn’t  require  programming  knowledge.  User-­‐friendly  interface.  Powerful  tool.    CON:  Solware’s  advanced  machine-­‐learning  func4ons  are  expensive!  DiscoverText  is  one  of  the  “affordable”  plaworms.  Also,  human  subjects  research/IRB  considera4ons.    =  Need  for  collabora4ons  and  grant  funding.  

Page 35: Collecting and Coding Twitter Data in DiscoverText

Other  Tools/Resources  

Page 36: Collecting and Coding Twitter Data in DiscoverText

Other  Tools  and  Resources  

•  “Social  Media  Data  Collec4on  Tools”  (see  here):  Running  list  of  tools  curated  by  Deen  Freelon,  Ph.D.,  [email protected],  hJp://dfreelon.org,  @dfreelon.  

•  Digital  Methods  Ini4a4ve  at  University  of  Amsterdam  (see  here).  

•  Digital  Methods  (2013)  by  Richard  Rogers  (see  here).  

•  Join  Associa4on  of  Internet  Researchers  AIR-­‐L  mailing  list  (see  here)!  

 

Page 37: Collecting and Coding Twitter Data in DiscoverText

Ques4ons?    Thank  you!  Jill  Hopke  

[email protected]  @jillhopke  

jillhopke.com