tracking social media participation: new approaches to studying user-generated content

23
Tracking Social Media Participation: New Approaches to Studying User-Generated Content Dr Axel Bruns Associate Professor ARC Centre of Excellence for Creative Industries and Innovation Queensland University of Technology [email protected] http://snurb.info/ @snurb_dot_info

Upload: axel-bruns

Post on 20-Aug-2015

2.032 views

Category:

Technology


2 download

TRANSCRIPT

Tracking Social Media Participation: New Approaches to Studying User-Generated Content

Dr Axel BrunsAssociate Professor

ARC Centre of Excellence for Creative Industries and InnovationQueensland University of Technology

[email protected] http://snurb.info/ – @snurb_dot_info

Researching Social Media

• Social Media:

Websites which build on Web 2.0 technologies to provide space for in-depth social interaction, community formation, and the tackling of collaborative projects.

Axel Bruns and Mark Bahnisch. "Social Drivers behind Growing Consumer Participation in User-Led Content Generation: Volume 1 - State of the Art." Sydney: Smart Services CRC, 2009.

Researching Social Media

• Various existing research approaches:

– Qualitative:

• Processes and practices How? What?

• Content generated by users What?

• Sites and organisational structures How? In what context?

– Quantitative:

• User surveys (demographics, practices, motivations) Who? Why?

• Content coding (usually small-scale) What?

– Mostly small-scale – limited applicability?

Known (Un)knowns

• What we know:

– Behaviour of small social media communities

– Practices of lead users

– Structural frameworks for selected sites / site genres

– Broad demographics of social media users

• Some things we want to know:

– How does all of this work at scale?

– What about ‘average’ users?

– How do communities overlap / interact?

– Can we track developments over time?

Mining and Mapping

• New research materials:

– Massive amounts of data and metadata generated by social media

– Mostly freely available online (Web / RSS / API access)

– Clear, standardised formats

• New research tools:

– Network crawlers

– Website scrapers

– Network analysers / visualisers

– Large-scale text analysers

Network Crawling and Analysis

• E.g. IssueCrawler:

Text Scraping and Analysis

• E.g. Leximancer:

(Kelly & Etling, 2009)

• What timeframe?

• Crawler approach: anything posted in the last 20 years

• Resulting in one static map – but what’s happening now?

• What map?

• Other ways to categorise these sites?

• Differences in activity, consistency

• Known unknowns – dynamics in the Iranian blogosphere:

• Sites appearing / disappearing?

• Increased / decreased activity?

• New linkage patterns:

• Stronger / weaker clustering?

• Move from one cluster to another?

• Change in topics, shift in emphasis, spread of information?

Asking Sophisticated Questions

Asking Sophisticated Questions

• Problems with current research approaches:– Crawlers don’t distinguish site genres or link types– Scrapers gather all text (including headers, footers, comments, …)– Very few attempts to trace the dynamics of participation– Many different ways to visualise these data– Assumptions often built into the software, and difficult to change

• Alternative approaches:– Gather large population of RSS feeds (and keep growing it)– Track for new posts, and scrape posts only (retain timestamp)– Extract links and keywords for further analysis– Develop ways of identifying and visualising change over time

• Needs to be appropriate to research questions

Applications: Blogosphere

• Questions:– (How) does the ‘A-List’

change over time?– (How) does political

alignment change over time?– How strong is cross-

connection across clusters?– What topics are discussed

– e.g. compared with MSM?

– What happens when power (Adamic & Glance, 2005)

changes hands – is bloggingan oppositional practice?

– Beyond left and right (beyond politics!): identification of blog genres based on textual / linkage patterns (qualitative follow-up necessary)

0

100

200

300

400

500

600

2009

.01.

1220

09.0

1.14

2009

.01.

1620

09.0

1.18

2009

.01.

2020

09.0

1.22

2009

.01.

2420

09.0

1.26

2009

.01.

2820

09.0

1.30

2009

.02.

0120

09.0

2.03

2009

.02.

0520

09.0

2.07

2009

.02.

0920

09.0

2.11

2009

.02.

1320

09.0

2.15

2009

.02.

1720

09.0

2.19

2009

.02.

2120

09.0

2.23

2009

.02.

2520

09.0

2.27

2009

.03.

0120

09.0

3.03

2009

.03.

0520

09.0

3.07

2009

.03.

0920

09.0

3.11

2009

.03.

1320

09.0

3.15

2009

.03.

1720

09.0

3.19

2009

.03.

2120

09.0

3.23

2009

.03.

2520

09.0

3.27

2009

.03.

2920

09.0

3.31

2009

.04.

0220

09.0

4.04

2009

.04.

0620

09.0

4.08

2009

.04.

1020

09.0

4.12

2009

.04.

1420

09.0

4.16

2009

.04.

1820

09.0

4.20

2009

.04.

2220

09.0

4.24

2009

.04.

2620

09.0

4.28

2009

.04.

3020

09.0

5.02

2009

.05.

0420

09.0

5.06

2009

.05.

0820

09.0

5.10

2009

.05.

1220

09.0

5.14

2009

.05.

1620

09.0

5.18

2009

.05.

2020

09.0

5.22

2009

.05.

2420

09.0

5.26

2009

.05.

2820

09.0

5.30

2009

.06.

0120

09.0

6.03

2009

.06.

0520

09.0

6.07

2009

.06.

0920

09.0

6.11

2009

.06.

1320

09.0

6.15

2009

.06.

1720

09.0

6.19

2009

.06.

2120

09.0

6.23

2009

.06.

2520

09.0

6.27

2009

.06.

2920

09.0

7.01

2009

.07.

0320

09.0

7.05

2009

.07.

0720

09.0

7.09

2009

.07.

1120

09.0

7.13

2009

.07.

1520

09.0

7.17

2009

.07.

1920

09.0

7.21

2009

.07.

2320

09.0

7.25

2009

.07.

2720

09.0

7.29

2009

.07.

3120

09.0

8.02

2009

.08.

0420

09.0

8.06

2009

.08.

0820

09.0

8.10

Australian News

Australian News

MSM Patterns of Activity (Jan.-Aug. 2009)

BushfiresBudget

Artefact

Qld Election

Utegate Pt. 2?

0

10

20

30

40

50

60

70

80

2009

.01.

1220

09.0

1.14

2009

.01.

1620

09.0

1.18

2009

.01.

2020

09.0

1.22

2009

.01.

2420

09.0

1.26

2009

.01.

2820

09.0

1.30

2009

.02.

0120

09.0

2.03

2009

.02.

0520

09.0

2.07

2009

.02.

0920

09.0

2.11

2009

.02.

1320

09.0

2.15

2009

.02.

1720

09.0

2.19

2009

.02.

2120

09.0

2.23

2009

.02.

2520

09.0

2.27

2009

.03.

0120

09.0

3.03

2009

.03.

0520

09.0

3.07

2009

.03.

0920

09.0

3.11

2009

.03.

1320

09.0

3.15

2009

.03.

1720

09.0

3.19

2009

.03.

2120

09.0

3.23

2009

.03.

2520

09.0

3.27

2009

.03.

2920

09.0

3.31

2009

.04.

0220

09.0

4.04

2009

.04.

0620

09.0

4.08

2009

.04.

1020

09.0

4.12

2009

.04.

1420

09.0

4.16

2009

.04.

1820

09.0

4.20

2009

.04.

2220

09.0

4.24

2009

.04.

2620

09.0

4.28

2009

.04.

3020

09.0

5.02

2009

.05.

0420

09.0

5.06

2009

.05.

0820

09.0

5.10

2009

.05.

1220

09.0

5.14

2009

.05.

1620

09.0

5.18

2009

.05.

2020

09.0

5.22

2009

.05.

2420

09.0

5.26

2009

.05.

2820

09.0

5.30

2009

.06.

0120

09.0

6.03

2009

.06.

0520

09.0

6.07

2009

.06.

0920

09.0

6.11

2009

.06.

1320

09.0

6.15

2009

.06.

1720

09.0

6.19

2009

.06.

2120

09.0

6.23

2009

.06.

2520

09.0

6.27

2009

.06.

2920

09.0

7.01

2009

.07.

0320

09.0

7.05

2009

.07.

0720

09.0

7.09

2009

.07.

1120

09.0

7.13

2009

.07.

1520

09.0

7.17

2009

.07.

1920

09.0

7.21

2009

.07.

2320

09.0

7.25

2009

.07.

2720

09.0

7.29

2009

.07.

3120

09.0

8.02

2009

.08.

0420

09.0

8.06

2009

.08.

0820

09.0

8.10

Blog

Blog

Blog Patterns of Activity (Jan.-Aug. 2009)

Bushfires

Budget

Artefact

Qld ElectionObama

Utegate Pt. 1 Utegate Pt. 2

0

10

20

30

40

50

60

70

80

90

2009

.01.

12

2009

.01.

14

2009

.01.

16

2009

.01.

18

2009

.01.

20

2009

.01.

22

2009

.01.

24

2009

.01.

26

2009

.01.

28

2009

.01.

30

2009

.02.

02

2009

.02.

04

2009

.02.

06

2009

.02.

08

2009

.02.

10

2009

.02.

12

2009

.02.

14

2009

.02.

16

2009

.02.

18

2009

.02.

20

2009

.02.

22

2009

.02.

24

2009

.02.

26

2009

.02.

28

2009

.03.

02

2009

.03.

04

2009

.03.

06

2009

.03.

08

2009

.03.

10

2009

.03.

12

2009

.03.

14

2009

.03.

16

2009

.03.

18

2009

.03.

20

2009

.03.

22

2009

.03.

24

2009

.03.

26

2009

.03.

28

2009

.03.

30

2009

.04.

01

2009

.04.

03

2009

.04.

05

2009

.04.

07

2009

.04.

09

2009

.04.

11

2009

.04.

13

2009

.04.

15

2009

.04.

17

2009

.04.

19

2009

.04.

21

2009

.04.

23

2009

.04.

25

2009

.04.

27

2009

.04.

29

2009

.05.

01

2009

.05.

03

2009

.05.

05

2009

.05.

07

2009

.05.

09

2009

.05.

11

2009

.05.

13

2009

.05.

15

2009

.05.

17

2009

.05.

19

2009

.05.

21

2009

.05.

23

2009

.05.

25

2009

.05.

27

2009

.05.

29

2009

.05.

31

2009

.06.

02

2009

.06.

04

2009

.06.

06

2009

.06.

08

2009

.06.

10

2009

.06.

12

2009

.06.

14

2009

.06.

16

2009

.06.

18

2009

.06.

20

2009

.06.

22

2009

.06.

24

2009

.06.

26

2009

.06.

28

2009

.06.

30

2009

.07.

02

2009

.07.

04

2009

.07.

06

2009

.07.

08

2009

.07.

10

2009

.07.

12

2009

.07.

14

2009

.07.

16

2009

.07.

18

2009

.07.

20

2009

.07.

22

2009

.07.

24

2009

.07.

26

2009

.07.

28

2009

.07.

30

2009

.08.

01

2009

.08.

03

2009

.08.

05

2009

.08.

07

2009

.08.

09

2009

.08.

11

Opinion

Opinion

Opinion Patterns of Activity (Jan.-Aug. 2009)

Australia Day

Budget

Qld Election

Obama Utegate Pt. 2

Artefact

Utegate in the Australian Blogosphere

19-24 June 2009

19 June 2009: Opposition Senator Abetz reads from alleged email from PM advisor to Grech during Senate enquiry

19 June 2009: Turnbull accuses Rudd of corruption and lying to parliament

22 June 2009: Federal Police raid Grech’s house and find email

22 June 2009: Email found to be fake, created by Grech

Utegate in the Australian Blogosphere

4-5 August 2009

4 Aug. 2009: Grech admits forging email

4 Aug. 2009: Auditor-General’s report finds no wrongdoing by PM or Treasurer

Acknowledgements:

Data gathering and processing by Lars Kirchhoff and Thomas Nicolai (Sociomantic Labs, Berlin)Concept maps by Tim Highfield (QUT)

(Preliminary stage for ARC Discovery project, 2010-12)

Applications: last.fm vs. Billboard

• Tracking listening patterns:

– Billboard = sales charts

– last.fm = listening activity

– Comparing sales and use of new releases

– Identifying brief flashes andslow burners

– Distinguishing casual listenersand committed fan groups

– Providing market informationto the music industry

(Adjei & Holland-Cunz, 2008)

Application: Wikipedia Content Dynamics

• Tracking editing patterns:

– Identifying stable/unstable content in Wikipedia

– Highlighting controversy, vandalism, sneaky edits

– Tracking consensus development– Tracking responses to developing

stories (http://www.research.ibm.com/visual/projects/history_flow/capitalism1.htm)

– Establishing trustworthiness based (http://trust.cse.ucsc.edu/)

on extent of peer review

– Highlighting most hotly debated(edited) sections of text

For More Ideas: VisualComplexity.com

_______ Science Emerges

• Web Science Research Initiative (Tim Berners-Lee et al.)– Science, technology, computer engineering, …– Limited inclusion of media, cultural, and communication studies– Strong focus on Semantic Web, artificial ontologies

• Cultural Science + Cultural Science Journal (John Hartley et al.)– Media & cultural studies, evolutionary economics, anthropology, …– Limited inclusion of computer sciences, technology– Strong focus on culture, innovation, evolutionary dynamics

• Data mining and visualisation– Substantial commercial work on data mining– Visualisation experiments in communication

design and visual arts

Looking Ahead

• Critical, interdisciplinary approaches

– Need to better connect cultural studies, computer science, research technology developments

– Need to interrogate in-built assumptions of existing technologies

– Need to explore and investigate visualisation and analysis methods

– Need to develop cross-platform approaches and connect with more conventional research

• Open questions

– Ethics of working with technically public, but notionally private data

– Potential (ab)use of data mining techniques and/or research results by corporate and government interests

– What new knowledge can such research contribute?

Where do you want to go from here?