importance

45
Importance of individual events in temporal networks Taro Takaguchi 1 , Nobuo Sato 2 , Kazuo Yano 2 , and Naoki Masuda 1 1 Department of Mathematical Informatics, The University of Tokyo 2 Central Research Laboratory, Hitachi, Ltd.

Upload: naoki-masuda

Post on 25-Jun-2015

4.634 views

Category:

Technology


0 download

DESCRIPTION

Presentation slides for:T. Takaguchi, N. Sato, K. Yano, N. Masuda.Importance of individual events in temporal networks.Preprint: arXiv:1205.4808

TRANSCRIPT

Page 1: Importance

Importance of individual events in temporal networks

Taro Takaguchi1, Nobuo Sato2, Kazuo Yano2, and Naoki Masuda1

1 Department of Mathematical Informatics, The University of Tokyo2 Central Research Laboratory, Hitachi, Ltd.

Page 2: Importance

2

Interests: patterns in human communication behavior

photos from flickr

By garryknight

By opacity

By infomatiquetwitter.com/#!/duncanjwatts

Page 3: Importance

3

More extensive data, more detailed analysis

Cell-phone calling network(Onnela et al., NJP 2007)

• Huge populations (~millions)• High temporal resolution (~minute)• Additional information (e.g., locations, history of purchases)

Business Microscope system(Hitachi, Ltd., Japan)

Name tagwith an infrared module

http://www.hitachi-hitec.com/jyouhou/business-microscope/

Page 4: Importance

4

Temporal networks

Reviewed by Holme and Saramäki, Phys. Rep. 2012

Represented by sequences of events with time stamps

static (aggregated) network

time

2

1

3

4

21

3 4

✓ node 1 → node 4 (temporal path)- node 4 → node 1

Page 5: Importance

5

Impact of interevent intervals

Different temporal paths from node 2 to node 3may have different impacts onepidemics, information propagation, etc.

time

2

1

3

1 1

2

1

3

1

2 3

Page 6: Importance

6

Evaluate the importance of each event

Question: which events are important?

time

2 2 21 1 1

3 34 4 3 4

• time-dependent centrality of links

Page 7: Importance

7

Importance of events

Defined by the amount of new information about others

time

Note:“information” ≠ contents of conversation

1

2

3

4

Page 8: Importance

8

Importance of events

Defined by the amount of new information about others

time

Note:“information” ≠ contents of conversation

latest information

Before the event:

1

2

3

4

Page 9: Importance

9

Importance of events

Defined by the amount of new information about others

Note:“information” ≠ contents of conversation

Before the event:

timelatest information

1

2

3

4

Page 10: Importance

10

Importance of events

Defined by the amount of new information about others

Note:“information” ≠ contents of conversation

After the event:

timelatest information

1

2

3

4

Page 11: Importance

11

Importance of events

Defined by the amount of new information about others

Note:“information” ≠ contents of conversation

After the event:

timelatest information

1

2

3

4

Page 12: Importance

12

Concept (1): vector clock and latency

Lamport, Commun. ACM 1978; Mattern, 1988

Vector clock of node

At time , has the latest information about at time

Example:

time

Page 13: Importance

13

Concept (2): advance of event

Kossinets et al., Proc. 14th ACM SIGKDD 2008

Advance for owing to an event between and

time

Page 14: Importance

14

Calculation of importance

Read the given event sequence in the chronological order.1. Update every ‘s information about .

Assumption:• Individuals can be involved in multiple events in a single snapshot.• Information can spread up to hops within a snapshot. (called “horizon” in Tang et al., Proc. 2nd ACM SIGCOMM WOSN 2009)

2. Calculate and for all the events at .

3. Importance = symmetrized advance

Page 15: Importance

15

Calculation of importance

Read the given event sequence in the chronological order.1. Update every ‘s information about .

Assumption:• Individuals can be involved in multiple events in a single snapshot.• Information can spread up to hops within a snapshot. (called “horizon” in Tang et al., Proc. 2nd ACM SIGCOMM WOSN 2009)

2. Calculate and for all the events at .

3. Importance = symmetrized advance

Page 16: Importance

16

Calculation of advance (1)

Source nodeh-neighbors having the latest information about

& being at the shortest distance from

(defined for each )

: source node

Snapshot at

Page 17: Importance

17

Calculation of advance (2)

Contributing neighbors ‘s neighbors that are on a shortest path     from a nearest source node (about ) to

and contribute .

Snapshot at

: source node

: contributing neighbor

Page 18: Importance

18

Case 1: multiple source nodes with different distances

Assumption:Only the closest ones convey the information.

is not a contributing neighbor.

: source node

: contributing neighbor

Snapshot at

Page 19: Importance

19

Case 2: multiple source nodes with the same distance

Assumption:Contributing neighbors equally contributeregardless of the number of shortest paths they bridge.

and contribute .

: source node

: contributing neighbor

Snapshot at

Page 20: Importance

20

Application to real data

Page 21: Importance

21

Research questions

1. How is the importance distributed? Broadly?2. Is the advance asymmetric? (i → j versus j → i)3. Is the importance “valid”?

Data setSituation Company office in Japan

Participants 163

Period / resolution 73 days / 1 min

Total events 118,546

Data was collected by World Signal Center, Hitachi, Ltd.

Page 22: Importance

22

Parameter

We set .

Information can spread to all nodes in the connected componentwithin a snapshot.

Page 23: Importance

23

1,2. Importance is broadly distributed & asymmetric

max = min on the diagonal

frequencyof events

Page 24: Importance

24

3. Is the importance of event “valid”?

time

Event removal testHypothesis: Removal of events with large importance values

1. makes “temporal distance” longer.2. makes node pairs disconnected.

Page 25: Importance

25

Two measures to characterize the connectivity

Reachability ratio (Holme, PRE 2005)

Network efficiency (Tang et al., Proc. 2nd ACM SIGCOMM WOSN 2009)

with at least one temporal path from to

: time average of latency

fully connecteddisconnected

fully connectedwith small latency

disconnectedor large latency

Page 26: Importance

26

Time average of latency

Pan & Saramäki, PRE 2011

is not defined forProblem:

Solution: a periodic boundary condition

Time average sum of

Page 27: Importance

27

Five schemes of event removal

• ascending/descending orders of the importance

• ascending/descending orders of the link weight

• random order

fraction of removed events

?Fraction of connected pairs

Shortness of temporal paths

# events on the link

Page 28: Importance

28

Ascending/descending orders of the link weight

1. Choose a link with the smallest/largest weight.# events on the link

2. Remove an event on the link at random. Decrease the weight of the link by one.

Static (aggregated) network

Page 29: Importance

29

Event removal tests based on the importance

1. Removal of 80% unimportant events influences little (Robustness).2. Removal of 20% important events considerably decreases connectivity.

Page 30: Importance

30

Comparison with the results based on the link weight

Event removals based on temporal/static information are similar but different.

Page 31: Importance

31

Removal of weak links fragments static network

“Strength of weak ties” property(Granovetter, AJS 1973; Onnela et al., PNAS 2007)

Weak links connect different communities mainly composed of strong links.

Takaguchi et al., PRX 2011

Page 32: Importance

32

Do we need to consider the importance?

YES, we do need consider the importance, because:

1. Events on weak links are necessary but NOT sufficient for connecting efficient temporal paths.

2. Events with large importance are necessary and sufficient for connecting efficient temporal paths.

Ascending-link-weight removal efficiently cuts off temporal paths.Information about the importance is not necessary.

A criticism

Page 33: Importance

33

Correlates of the importance value

Length of the IEI

# total events # total events involving i or j

# partners of i or j

0.819 0.701 0.701 0.630

Spearman’s rank correlation coefficient between the importance value and

time

IEI: interevent interval

Page 34: Importance

34

Latest IEI approximates the importance

Page 35: Importance

35

time

of a typical individual

Bursty activity patterns (Barabási, Nature 2005)

(Takaguchi et al., PRX 2011)

Origin of the robustness

Page 36: Importance

36

Exploration of the effect of burstiness

Carry out the event removal tests for the temporal networks generated by

(i) Shuffled IEIs (interevent intervals)

(ii) Poissonized IEIs

For each pair,time

Reassign random time to each event.Events follow Poisson process.

Page 37: Importance

37

Characteristics conserved / lost by the randomizations

Original Shuffled IEIs Poissonized IEIs

Weighted network structure ✓ ✓ ✓

Burstiness ✓ ✓ -Temporal correlations, etc. ✓ - -

Page 38: Importance

38

1. Temporal correlation is not necessary

Results for Shuffled IEIs Results for the original data

Page 39: Importance

39

2. Burstiness (long-tailed IEIs) is essential

Results for Poissonized IEIs ≠ Results for the original data & Shuffled IEIs

Removal of unimportant events rapidly spoils network efficiency.

Page 40: Importance

40

Effect of the weighted network structure

(iii) Rewiring

1. Make an Erdös-Rényi random graphwith the same number of nodes and links as the original data.

2. Put the event sequences on the original linksonto links in the random graph.

time

rewired networkoriginal network

time

Page 41: Importance

41

Characteristics conserved / lost by the randomizations

Original Shuffled IEIs Poissonized IEIs Rewiring

Weighted network structure ✓ ✓ ✓ -

Burstiness✓ ✓ - ✓

Temporal correlation, etc. ✓ - - △

link weight distribution ✓ ✓ ✓ ✓

Page 42: Importance

42

3. Heterogeneity in link weights is sufficient

Skewed degree dist., community, structure-weight corr., etc. are irrelevant.

Results for Rewiring Results for the original data

Page 43: Importance

43

Effect of network structure

timei.i.d.

Regular random graph

Can bustiness explain the heterogeneity in the importanceeven without the heterogeneity in the link weight?

60 events on each link

IEI distributionspower-law + cutoff

exponential (Poisson process)

Page 44: Importance

44

Burstiness is a main cause of the robustness

Power-law IEIs on the RRG Exponential IEIs on the RRG

Page 45: Importance

45

Summary

• Importance of events in temporal networks- Based on advance of vector clocks in an event

• Heterogeneity in the importance- Long-tailed distribution and strong asymmetry

• Robustness of empirical temporal networks- Connectivity conserved after removing 80% unimportant events

• Origin of the robustness - Bursty activity patterns (i.e., long-tailed IEIs) - Heterogeneity in the link weight

Reference

Taro Takaguchi, Nobuo Sato, Kazuo Yano, and Naoki Masuda,“Importance of individual events in temporal networks”,New Journal of Physics 14, 093003 (2012). [Open Access]