walter willinger at&t research labs reza rejaie, mojtaba torkjazi, masoud valafar university of...

33
Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi , Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle WA

Upload: erick-bond

Post on 17-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Walter WillingerAT&T Research Labs

Reza Rejaie, Mojtaba Torkjazi, Masoud ValafarUniversity of Oregon

Mauro MaggioniDuke University

HotMetrics’09, Seattle WA

Page 2: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

MotivationOnline Social Networks (OSNs) are becoming increasingly

popular over the InternetThis growing popularity has motivated researchers to

characterize user connectivity and user interaction in OSNsExample: Facebook

Launched in 2004 and opened up to the general public in 2006 More than 200M users as of Early 2009 and 300K new users per

day By late 2008, 300K images per second and 10 billion photos in

total Characterizing OSNs is critical for

Developing measurement and performance modeling/analysis tools

Improving OSN network architecture and system design Understanding privacy and user behavior

Much of the existing OSN research studies seems to have lost sight of this unique opportunity for characterizing OSNs

6/19/2009 2HotMetrics 2009 - Seattle WA

Page 3: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

State-of-the-art in OSN CharacterizationMain focus has been on

Simple connectivity structures (e.g. friendship graphs or interaction graphs)

Graph metrics such as node degree distribution, clustering coefficient, density, diameter

Little is known about the actual structure and dynamics User arrival/departure to/from the system User interactionsGrowth rate of the system

6/19/2009 3HotMetrics 2009 - Seattle WA

Page 4: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

This PaperWe argue that OSN research has to change

courseAbandon the traditional treatment of OSNs as

static networks and become serious about dealing with dynamic nature of real OSNs

Come up with new techniques/tools for collecting and analyzing relevant data

6/19/2009 4HotMetrics 2009 - Seattle WA

Page 5: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Static Friendship Graph Caveat emptor: our toy examples are for illustration purposes only,

they are not meant to describe real-world OSNs Toy example of static friendship graph (TOYFB)

Hierarchical Scale-Free networks [Barabasi2002] HSF(n, m), n: size of the cell, m: number of levels

HSF graphs show power law node degree distribution, rich local clustering properties, and well-defined cluster-within-cluster structure

6/19/2009

Step 1: HSF(5,0)

Step 2: HSF(5,1)

Step 3: HSF(5,2)

5HotMetrics 2009 - Seattle WA

Page 6: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Dynamic Friendship Graph (I)

Show a very elementary and highly stylized evolutionary process125 users join our toy OSN over timeBecome friends with other users over timeBecome inactive after a while

Consider time interval [0, 1], where graph structure changes at time points 1/16, 2/16, ..., and 16/16At any of these discrete points, some friendship

relations become inactive, and some new friendships are being established

6/19/2009 6HotMetrics 2009 - Seattle WA

Page 7: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 1/16

time7HotMetrics 2009 - Seattle WA

Page 8: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 2/16

time8HotMetrics 2009 - Seattle WA

Page 9: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 3/16

time9HotMetrics 2009 - Seattle WA

Page 10: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 4/16

time10HotMetrics 2009 - Seattle WA

Page 11: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 5/16

time11HotMetrics 2009 - Seattle WA

Page 12: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t =6/16

time12HotMetrics 2009 - Seattle WA

Page 13: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 7/16

time13HotMetrics 2009 - Seattle WA

Page 14: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 8/16

time14HotMetrics 2009 - Seattle WA

Page 15: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 9/16

time15HotMetrics 2009 - Seattle WA

Page 16: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 10/16

time16HotMetrics 2009 - Seattle WA

Page 17: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 11/16

time17HotMetrics 2009 - Seattle WA

Page 18: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 12/16

time18HotMetrics 2009 - Seattle WA

Page 19: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 13/16

time19HotMetrics 2009 - Seattle WA

Page 20: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 14/16

time20HotMetrics 2009 - Seattle WA

Page 21: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 15/16

time21HotMetrics 2009 - Seattle WA

Page 22: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 16/16

time22HotMetrics 2009 - Seattle WA

Page 23: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Dynamic Friendship Graph (III) A full crawl of TOYFB is identical to HSF(5,2), since friend lists

maintained by users do not reflect any de-activation of friendship relations Properties of the temporal snapshots , TOYFB(t), are radically different

from those of the static counterpart TOYFB What are the effective and efficient methods for accurately capturing and

systematically characterizing the dynamic nature of large-scale real-world OSNs?

6/19/2009 23HotMetrics 2009 - Seattle WA

Page 24: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Measurement - StaticCrawling complete snapshots does not scale

Limited rate of crawling (e.g. 10 query/sec in Flickr)

Large population of OSNs (millions of users)Partial snapshot is likely to be distorted and

biased towards high degree nodes Graph sampling is a promising approach for

characterizing node properties [Stutzbach2006, Rasti2009]

6/19/2009 24HotMetrics 2009 - Seattle WA

Page 25: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Measurement - DynamicGoal of sampling: collect a “representative” set

of usersIn an unknown graph changing underneath any

measurement tool (e.g. crawler), what does “representative” mean?

Even with a solid definition for “representative” snapshot, how to develop appropriate sampling techniques to deal with dynamic nature of OSNs?

Knowing all the challenges for measuring OSNs, is there any innovative approach for future analysis?

6/19/2009 25HotMetrics 2009 - Seattle WA

Page 26: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Analysis (I)A new approach based on the following two observations in

OSNsClustering at different spatial scales Faster and more noisy temporal dynamics at finer levels of

resolutions of the graphMulti-Resolution Analysis (MRA) for graphs

Start from a coarse-scale representation that is typically small in size and has a slow dynamics

Use the insight gained at this scale to study the graph at the next finer levels of resolutions

Diffusion Wavelets (DW) as a principled approach for graph MRADW provides the necessary mathematical framework for

performing the above graph coarsening intuition [Maggioni2004]

6/19/2009 26HotMetrics 2009 - Seattle WA

Page 27: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

Analysis (II)TOYFB at scale 2 is smaller in size than TOYFB at scale 1TOYFB at scale 2 has slower dynamics than TOYFB at scale 1

6/19/2009

Scale 1

Scale 2

Scale 3

27HotMetrics 2009 - Seattle WA

Page 28: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 1/16

time28HotMetrics 2009 - Seattle WA

Page 29: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 5/16

time29HotMetrics 2009 - Seattle WA

Page 30: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 9/16

time30HotMetrics 2009 - Seattle WA

Page 31: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009

t = 13/16

time31HotMetrics 2009 - Seattle WA

Page 32: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

ConclusionStop ignoring the dynamic nature of OSNsFor a better understanding of OSNs

Abandon current traditional measurement, modeling, analysis, and validation approaches

Replace them by new techniques that can account for the most of the dynamic features of real-world OSNs

New methodologies are required for advancingOSN measurement: extend and develop current

graph sampling techniques to consider churn in OSNs

OSN analysis: apply Multi-Resolution Analysis (MRA) methodology for large-scale dynamic graph structures

6/19/2009 32HotMetrics 2009 - Seattle WA

Page 33: Walter Willinger AT&T Research Labs Reza Rejaie, Mojtaba Torkjazi, Masoud Valafar University of Oregon Mauro Maggioni Duke University HotMetrics’09, Seattle

6/19/2009 33HotMetrics 2009 - Seattle WA