social and technological network data analytics lecture 2 ... · social and technological network...

36
Social and Technological Network Data Analytics Lecture 2: Small World and Weak Ties Prof. Cecilia Mascolo

Upload: vuongkhuong

Post on 15-Feb-2019

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

SocialandTechnologicalNetworkDataAnalytics

Lecture2:SmallWorldandWeakTies

Prof.CeciliaMascolo

Page 2: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

InThisLecture

• Wewillcomparerandomnetworkswithrealnetworks

• Wewillintroducetheconceptofsmallworldnetworks

• Wewillintroducetheconceptofweaktiesandillustratetheirimportance

Page 3: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

ClusteringCoefficientofRealNetworks

• From[WattsandStrogatz,1998]• Characteristicpathlengthandclusteringcoefficientforsomerealnetworksandforrandomnetworkswithsamenumberofnodesandaveragenumberofedgespernode.

• Aimistocheckifrandomgraphscanmodelrealnetworks.

Page 4: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

RealNetworksvs RandomNetworks

• FilmActors:actorsinmoviestogether• Powergrid:thenetworkoftheelectricitygenerators

• C.elegans:networkofneuronsofaworm• LiscomparablewhileCisverydifferent

RandomNet:samesizeandaveragedegree

Page 5: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

SmallWorldModel

• Watts&Strogatz builtamodelwhichwasabletocapturethesecharacteristics.

• Startwithregularlattice– Increaseaprobabilitypof“rewiring”anodetoanothernode.

–Whenpveryhighthelatticewouldbecomearandomgraph.

Page 6: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

SmallWorldModel(2)

Page 7: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

HowareLandCinthismodel?• ThereisazonewhereCishighandLislow• Thesearesmallworldnetworks

RandomNets

Lattice

Page 8: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

OtherRealNetworksExamples

Page 9: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

AnalysisofMessengerNetwork

• [Leskovec andHorvitz2008]analyzedalargedatasetoftheMicrosoftMessenger.

• CommunicationNetworkcontained180millionusersand1.3billionconversationsin1month.

• BuddyNetworkcontained240millionusers.• 99.9%usersbelongedtoaconnectedcomponent.

Page 10: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

AnalysisofaMessengerNetwork• Averageshortestpathis6.6(confirmingMilgram’s study).

• Althoughsomelongerpathsupto29.• Averageclusteringcoefficientisquitehigh:0.137.

Page 11: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

AgainonClusteringCoefficient• Wehaveintroducedtheclusteringcoefficient.Thisindicates:– ThenumberoftrianglesincludingnodeA.– HowconnectedthefriendsofAare.

• Triadicclosure:ifCandBareconnectedtoAthereisanincreasedlikelihoodthattheywillbeconnectedamongthemselvesinfuture.

A B

C DE F

Page 12: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

[Granovetter’74]

• Granovetter interviewedpeopleabouthowtheydiscoveredtheirjobs– Mostpeopledidsothroughpersonalcontacts– Oftenthepersonalcontactsdescribedasacquaintancesandnotclosefriends

• Basicintuitiononthisis:closefriendsarepartoftriadclosuresandwouldknowwhatyouknowandwouldknowotherswhowouldknowwhatyouknow

• Wewillexplainthismoreformally…

Page 13: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Bridges

• EdgebetweenAandBisabridge if,whendeleted,itwouldmakeAandBliein2differentcomponents

A

B

Page 14: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

LocalBridges

• Anedgeisalocalbridgeifitsendpointshavenofriendsincommon– Ifdeletingtheedgewouldincreasethedistanceoftheendpointstoavaluemorethan2.

A

B

Page 15: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

StrongTriadicClosureProperty(STPC)

• Linksbetweennodeshavedifferent“value”:strongandweakties– E.g:Friendshipvs acquaintances

• StrongTriadicClosureProperty(Granovetter):IfanodeAhastwostronglinks(toBandC)thenalink(strongorweak)mustexistbetweenBandC.

Page 16: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

LocalBridgesandWeakTies• IfnodeAsatisfiestheSTCPandisinvolvedinatleasttwostrongtiesthenanylocalbridgeitisinvolvedinmustbeaweaktie.(Proofbycontradiction)

(assumingSTCP)Ifthereareenoughstrongtiesinthenetworkthenlocalbridgesmustbeweakties

B

CA

S

S

ForACandABtobeastronglinkSCTPsaysBCmustexistbutlocalbridgedefinitionsaysitmustnot

Page 17: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

RealDataValidation

• Granovetter’s theoryabouttheimportanceofweaktiesremainednotvalidatedforyearsforlargesocialnetworksduetothelackofdata.

• [Onnela etal’07]testeditoveralargecell-phonenetwork(4millionsusers):– Edgebetweentwousersiftheycalledeachotherwithinthe18monthsperiod.

– Dataexhibitsagiantcomponent(84%).– Edgeweight:timespentinconversation.

Page 18: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Onnela etal.2007

• Extendingthedefinitionoflocalbridge• Given:• Neighbourhood overlap:

Numberofnodeswhoareneighbours ofbothA&BNumberofnodeswhoareneighbours ofatleastAorB

• Whenthenumeratoris0thequantityis0.– Numeratoris0whenABisalocalbridge

• Thedefinitionfinds“almostlocalbridges”(~0)

A B

Page 19: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Neighbourhoodoverlap

Page 20: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

RelationshipofOverlapwithTieStrength

• Red:randomshuffledweightsoverlinks.

• Blue:realones.Correlationwithtiestrength.

Anomaly

Tiestrength:cumulativetiestrengthsmallerthanw

Overlap

Page 21: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Realtieweightsinaportionofthegraph(aroundarandomnode)

A=RealB=Randomlyshuffled

Page 22: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Effectofedgeremoval

Page 23: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Overlapbasedlinkremoval

Page 24: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Weaktiesmatter!

• Wehavejustseenthatweaktiesmatterandiftheyareremoved,theyleadtoabreakdowninthenetwork.

• Ifstrongtiesareremovedtheyleadtoasmoothdegradingofthenetwork

Page 25: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Differenceofimportanceofweaktiesinsocialandothernetworks

• Theimportanceofweaktiesisspecifictosocialnetworks

• Inbiologicalandspatialnetworks:– Deletinganimportantroad[strongtie]damagesthenetworkmore

– Acentralveininaleafismoreimportantthansmallerveins

Page 26: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Tiestrengthmatters:FacebookExample

• Facebook dataanalysisofonemonthofdata• Fournetworks:– Declaredfriendship– Reciprocalcommunication(messages)– Onewaycommunication–Maintainedrelationship:clickingoncontentonnewsfeedfromotherfriendorvisitingprofilemorethanonce.

Page 27: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Whatdoesitlooklike?(onerandomuser)

Page 28: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

ActiveNetworkSize:numberoflinks

Declaredfriends

Newsfeedeffect

Page 29: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

AnotherstudyonFBshowstheimpactoftiesoverinformationdissemination• 3monthsofFBdata• 253millionusers(profileandlocation)

• Measuringeffectoftiestrengthonsharing

Page 30: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Howdidtheymeasuretiestrength?

• Privateinteractions• Publicinteractions(comments)

• Coappearance inpictures

• Involvementinthesamepostwithcomments

Page 31: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Strongtiesaremoreinfluential

Page 32: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

However…

Strongtiesaremoreinfluential buttheireffectisnotlargeenoughtocompensatetheabundanceofweakties…

Page 33: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

TwitterAnalysis

• Huberman atal.haveanalyzedstrongandweaktiesinTwitter.

• The“followers”graphinTwitterisdirected– Someonecanfollowsomeoneelsewhodoesnotfollowhim

• Messagesof140charscanbeposted• Messagescanbeaddressedtospecificusers(althoughtheystayreadabletoall)

• Weakties:usersfollowed• Strongties:userstowhomtheusersentatleast2messagesintheobservationperiod

Page 34: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Twitter

Followees

Numbero

fuser’sstrongties

Numberofstrongtiesstaysbelow~50

Page 35: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

Summary

• Smallworldnetworkmodelsareabletocaptureagoodquantityofrealnetworks– Theyhavecharacteristicpathlengthcomparabletorandomnetworks.

– Butmuchhigherclusteringcoefficient.• Wehaveintroducedweakandstrongtiesandshownexampleofapplicationonrealnetworks

Page 36: Social and Technological Network Data Analytics Lecture 2 ... · Social and Technological Network Data Analytics Lecture 2: Small World ... Lento , and Itamar Rosenn ... Bernardo

References• Kleinberg’sbook:Chapter3and20.

• Collectivedynamicsof'small-world' networks. Watts,D.J.;Strogatz,S.H.(1998).Nature 393(6684):409–10.

• Structureandtiestrengthsinmobilecommunicationnetworks.J.P.Onnela,J.Saramaki,J.Hyvonen,G.Szabo,D.Lazer,K.Kaski,J.Kertesz,A.L.Barabasi.ProceedingsoftheNationalAcademyofSciences,Vol.104,No.18.(13Oct2006),pp.7332-7336.

• Maintainedrelationships onfacebook.CameronMarlow,LeeByron,TomLento,andItamar Rosenn.2009.On-lineathttp://overstated.net/2009/03/09/maintained- relationships-on-facebook.

• Theroleofsocialnetworksininformationdiffusion.Eytan Bakshy,ItamarRosenn,CameronMarlow,andLada Adamic.2012.InProceedingsofthe21stinternationalconferenceonWorldWideWeb (WWW'12).ACM,NewYork,NY,USA,519-528.

• Socialnetworksthatmatter:Twitterunderthemicroscope. BernardoA.Huberman,DanielM.Romero,andFangWu.FirstMonday,14(1),January2009.