cscw ’19, austin, tx, usa computational media lab @anu

18
Estimating attention flow in online video networks Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie Computational Media Lab @ANU: http://cm.cecs.anu.edu.au CSCW ’19, Austin, TX, USA

Upload: others

Post on 27-Nov-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

Estimating attention flow in online video networks

Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie

Computational Media Lab @ANU: http://cm.cecs.anu.edu.auCSCW ’19, Austin, TX, USA

Page 2: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

2

Recommender systems are ubiquitous in online platforms

Page 3: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

Effects of recommender systems:what does the network look like? how does it affect video popularity?

The evolution of YouTube recommender systems

Method Papers

Collaborative Filtering [Davidson et al. RecSys ’10][Bendersky et al. KDD ’14]

Deep Learning [Covington et al. RecSys ’16][Beutel et al. WSDM ’18]

Reinforcement Learning [Chen et al. WSDM ’19][Ie et al. IJCAI ’19]

Unbiased recommendation [Zhao et al. RecSys ’19][Yi et al. RecSys ’19]

3

Adele - Hello

Page 4: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

4

The release of “Hello” excited other videos from Adele.

The “Hello” effect

The release of “Hello”

less views more views

Page 5: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

5

1. How to build the network of videos from recommender systems?

3. How to model video popularity under recommender systems?

Talk outline

2. Characteristics of the recommendation network

Page 6: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

6

VEVO music graph dataset

● 60,740 music videos from 4,435 VEVO artists who are active in major English-speaking countries.

● 337K~394K directed links in 63 daily snapshots.

● Links consist of non-personalized feed from YouTube API.

Adele - Hello

Katy Perry - Roar

Katy Perry - Last Friday Night

Ellie Goulding - Love Me Like You Do

OneRepublic - Counting Stars

Jennifer Lopez - On The Floor

Mark Ronson - Uptown Funk

Justin Bieber - Sorry

Justin Bieber - Beauty And A Beat

Jessie J - Bang Bang

Rihanna - Diamonds

P!nk - Just Give Me A Reason

Adele - Rolling in the Deep

smaller- larger indegree

less- more views

Page 7: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

7

Rank higher in API → more likely to display on video webpage, with higher rank.

Relations for recommendations on YouTube webpage and API

p=0.84

p<0.1

Page 8: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

8

Videos disproportionately point to more popular videos

Page 9: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

● LSCC: largest strongly connected component.

● IN: nodes can reach LSCC, but not reachable from the nodes in LSCC.

● OUT: nodes that can be reached by LSCC but not pointing back to LSCC.

11

The bow-tie structure

Graph structure in the Web. Broder et al. WWW ’00

Web graph 1997 VEVO network

Page 10: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

10

The attention bow-tie of Vevo network

structure bow-tieattention bow-tie

● Attention flow in one direction: IN → LSCC → OUT.

● LSCC (23.1% of the videos) occupies most of the attention (82.6% of the views).

● IN component shrinks (68% → 12%).

Page 11: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

11

1. How to build the network of videos from recommender systems?Building a non-personalized video network from YouTube API

3. How to model video popularity under recommender systems?

Talk outline

2. Characteristics of the recommendation network(a) Macroscopic profilings

(b) Microscopic profilings

(c) Temporal patterns

Page 12: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

12

Temporal evolution of Vevo network

434K (25%) links only appear once, 54K (3.1%) links appear in every snapshot.

Page 13: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

13

Building a persistent network

● 2 popularity filters: (a) avg. daily views >= 100; (b) >= 1% compared to target videos.

● A link is maintained/added if it appears in a majority (>=4) of surrounding 7 days windows.

● Persistent network: 52,758 directed links; 28,657 source videos → 13,710 target videos.

Observed links

Smoothed linkst t+1 t+2 t+4t+3 t+5 t+7t+6 t+8

Maintain

Add back

Remove

Page 14: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

14

Baseline methods

● Seasonality -> Seasonal Naive model (SN)

● Autocorrelation -> AutoRegressive model (AR)

● RNN with LSTM units

Page 15: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

15

ARNet model and results

Baselines:● Seasonality -> Seasonal Naive model (SN)

● Autocorrelation -> AutoRegressive model (AR)

● RNN with LSTM units

Proposed model:AutoRegressive + Network (ARNet)

Page 16: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

Estimated network contribution ratio:

16

Which artists benefit the most from the recommendation network?

Mark Ronson

Pharrell Williams

4 Non Blondes

Hoobastank

J-Kwon (American rapper)

Page 17: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

17

1. How to build the network of videos from recommender systems?Building a non-personalized video network from YouTube API

2. Characteristics of the recommendation network(a) Macroscopic profilings

(b) Microscopic profilings

(c) Temporal patterns

3. How to model video popularity under recommender systems?(a) A model taking account of network information

(b) Estimating link strength for each recommendation link

Summary

Page 18: CSCW ’19, Austin, TX, USA Computational Media Lab @ANU

18

1. How to build the network of videos from recommender systems?Building a non-personalized video network from YouTube API

2. Characteristics of the recommendation network(a) Macroscopic profilings

(b) Microscopic profilings

(c) Temporal patterns

3. How to model video popularity under recommender systems?(a) A model taking account of network information

(b) Estimating link strength for each recommendation link

Estimating Attention Flow in Online Video NetworksCode and datasets: https://github.com/avalanchesiqi/networked-popularity

Future work● Measuring link properties, e.g., diversity/novelty between video pairs● Training a shared RNN model on videos with similar dynamics