growing scale-free networksby a mediation-driven ... · md. kamrul hassan and liana islam...

4
arXiv:1411.3444v2 [physics.soc-ph] 1 Dec 2014 Growing Scale-free Networks by a Mediation-Driven Attachment Rule Md. Kamrul Hassan and Liana Islam University of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh. We propose a model that generates a class of networks exhibiting power-law degree distribution with a spectrum of exponents depending on the number of links (m) with which incoming nodes join the existing network. Here, each new node first picks an existing node at random, and connects not with this but with m of its neighbors also picked at random. Counter-intuitively enough, such a mediation-driven attachment rule gives rise to superhubs for small m due to the winners take all effect, and hubs for large m due to the winners take some effect. To solve it analytically, we use mean-field approximation where we find that the inverse harmonic mean of degrees of the neighborhood of each existing node and their mean play a crucial role in determining the quality of the degree distribution. PACS numbers: 61.43.Hv, 64.60.Ht, 68.03.Fg, 82.70.Dd In the recent past, we have amassed a bewildering amount of information on our universe, and yet we are far from developing a holistic idea. This is because most of the natural and man-made real-world systems are in- tricately wired and seemingly complex. Much of these complex systems can be mapped as networks consisting of nodes connected by links, e.g., author collaboration and movie actor networks, the Internet, World Wide Web, neural and protein networks etc. [1–6]. While working on some of these real-world systems Barab´ asi and Albert (BA) in late 90s found that the tail of the degree distribu- tion P (k), the probability that a randomly chosen node is connected to k other nodes, always follows a power- law. In pursuit of theoretical explanation they realized that real networks are not static, rather they grow. They also realized that a new node does not connect with an existing one randomly rather preferentially with respect to their degrees - which is now known as the preferential attachment (PA) rule [7]. Incorporating both the ingre- dients BA proposed a model and showed that it exhibits power-law degree distribution P (k) k -γ where γ = 3. Despite the profound success of the BA model we are compelled to note a couple of drawbacks. First and fore- most, the preferential attachment (PA) rule is too direct. Second, the BA model can only account for γ = 3 while in real networks the exponent γ assumes a spectrum of values between 2 3. To overcome these drawbacks there exist a few variants of the BA model through the in- clusion of rewiring, aging, ranking etc. [8–11], and a few alternatives to the direct PA mechanism like redirection, vertex copying, and duplication mechanisms [12–15]. In this article, we propose a model in which a walker is parachuted to an arbitrary node of the existing network at random and then takes a random step to one of its neighbors. The new node is attached to the node where the walker stops. The model has been inspired by the growth of the weighted planar stochastic lattice (WPSL) [16, 17]. Seeing its dual emerges as a scale-free network, we became curious as to what happens when a graph is grown following a similar rule. That is, an existing node connects with the incoming node only if one of its neigh- bors is picked at random. We call it the mediation-driven (a) (b) FIG. 1: Snapshots of networks grown using (a) the PA rule of the BA model and (b) the MDA rule. The size of each node is drawn according to its degree showing that the hubs of networks grown using the MDA rule are richer than those of the BA networks. attachment (MDA) rule since the node where the walker first landed acts as a mediator for connection between its neighbor and the new node. We use mean-field ap- proximation (MFA) to solve the model analytically, and use numerical simulation to verify the solution. We show that for small m, the MDA rule is super-preferential due to winners take all (WTA) effect that gives rise to a few superhubs. The snapshots of the networks grown accord- ing to BA and MDA rules shown in FIG. 1 for m =2 clearly reveal the presence of hubs and superhubs respec- tively. However, for large m the WTA effect is replaced by winners take some (WTS) effect that gives rise to sim- ple hubs only. The essential idea of the MDA rule can be seen in the formation of trade links among businessmen and in the growth of the WWW. In business, a newcomer wishing to establish trade links with other businessmen can never assess the whole network to see who has how many links in order to decide who will be his partner, ow- ing to the large size of the trade world. Instead, he uses a mediator to find a suitable partner. We note that there exist a couple of works similar to our model albeit the ensuing analysis and the results are completely different from ours [18–20]. To illustrate the MDA rule we consider a miniature

Upload: others

Post on 25-Sep-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Growing Scale-free Networksby a Mediation-Driven ... · Md. Kamrul Hassan and Liana Islam University of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh

arX

iv:1

411.

3444

v2 [

phys

ics.

soc-

ph]

1 D

ec 2

014

Growing Scale-free Networks by a Mediation-Driven Attachment Rule

Md. Kamrul Hassan and Liana IslamUniversity of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh.

We propose a model that generates a class of networks exhibiting power-law degree distributionwith a spectrum of exponents depending on the number of links (m) with which incoming nodesjoin the existing network. Here, each new node first picks an existing node at random, and connectsnot with this but with m of its neighbors also picked at random. Counter-intuitively enough,such a mediation-driven attachment rule gives rise to superhubs for small m due to the winners

take all effect, and hubs for large m due to the winners take some effect. To solve it analytically,we use mean-field approximation where we find that the inverse harmonic mean of degrees of theneighborhood of each existing node and their mean play a crucial role in determining the quality ofthe degree distribution.

PACS numbers: 61.43.Hv, 64.60.Ht, 68.03.Fg, 82.70.Dd

In the recent past, we have amassed a bewilderingamount of information on our universe, and yet we arefar from developing a holistic idea. This is because mostof the natural and man-made real-world systems are in-tricately wired and seemingly complex. Much of thesecomplex systems can be mapped as networks consisting ofnodes connected by links, e.g., author collaboration andmovie actor networks, the Internet, World Wide Web,neural and protein networks etc. [1–6]. While workingon some of these real-world systems Barabasi and Albert(BA) in late 90s found that the tail of the degree distribu-tion P (k), the probability that a randomly chosen nodeis connected to k other nodes, always follows a power-law. In pursuit of theoretical explanation they realizedthat real networks are not static, rather they grow. Theyalso realized that a new node does not connect with anexisting one randomly rather preferentially with respectto their degrees - which is now known as the preferentialattachment (PA) rule [7]. Incorporating both the ingre-dients BA proposed a model and showed that it exhibitspower-law degree distribution P (k) ∼ k−γ where γ = 3.Despite the profound success of the BA model we arecompelled to note a couple of drawbacks. First and fore-most, the preferential attachment (PA) rule is too direct.Second, the BA model can only account for γ = 3 whilein real networks the exponent γ assumes a spectrum ofvalues between 2 < γ ≤ 3. To overcome these drawbacksthere exist a few variants of the BA model through the in-clusion of rewiring, aging, ranking etc. [8–11], and a fewalternatives to the direct PA mechanism like redirection,vertex copying, and duplication mechanisms [12–15].

In this article, we propose a model in which a walker isparachuted to an arbitrary node of the existing networkat random and then takes a random step to one of itsneighbors. The new node is attached to the node wherethe walker stops. The model has been inspired by thegrowth of the weighted planar stochastic lattice (WPSL)[16, 17]. Seeing its dual emerges as a scale-free network,we became curious as to what happens when a graph isgrown following a similar rule. That is, an existing nodeconnects with the incoming node only if one of its neigh-bors is picked at random. We call it the mediation-driven

(a) (b)

FIG. 1: Snapshots of networks grown using (a) the PA ruleof the BA model and (b) the MDA rule. The size of eachnode is drawn according to its degree showing that the hubsof networks grown using the MDA rule are richer than thoseof the BA networks.

attachment (MDA) rule since the node where the walkerfirst landed acts as a mediator for connection betweenits neighbor and the new node. We use mean-field ap-proximation (MFA) to solve the model analytically, anduse numerical simulation to verify the solution. We showthat for small m, the MDA rule is super-preferential dueto winners take all (WTA) effect that gives rise to a fewsuperhubs. The snapshots of the networks grown accord-ing to BA and MDA rules shown in FIG. 1 for m = 2clearly reveal the presence of hubs and superhubs respec-tively. However, for large m the WTA effect is replacedby winners take some (WTS) effect that gives rise to sim-ple hubs only. The essential idea of the MDA rule can beseen in the formation of trade links among businessmenand in the growth of the WWW. In business, a newcomerwishing to establish trade links with other businessmencan never assess the whole network to see who has howmany links in order to decide who will be his partner, ow-ing to the large size of the trade world. Instead, he uses amediator to find a suitable partner. We note that thereexist a couple of works similar to our model albeit theensuing analysis and the results are completely differentfrom ours [18–20].To illustrate the MDA rule we consider a miniature

Page 2: Growing Scale-free Networksby a Mediation-Driven ... · Md. Kamrul Hassan and Liana Islam University of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh

2

FIG. 2: A schematic description of the MDA rule.

network in FIG. 2 consisting of m0 = 6 nodes labeledi = 1, 2, ..., 6. This can be considered as the seed. Nowto connect a new node to it, we first pick a node at ran-dom from the 6 existing nodes, say node 2. Second, wepick one of its four neighbors, also with equal a priori

probability, say it is node 6, and connect it with the newnode which we label 7. In this particular example weassume that the new nodes are born with one link oredge, i.e. m = 1. In the case of new nodes arriving withmore than one edge we first pick one node randomly fromthe entire network followed by picking m of its distinctneighbors and connect them with m new edges. Now thequestion is, according to our model, what is the proba-bility Π(i) that an existing node i is finally picked andthe new node gets connected with it? Say, the node ihas degree ki and its ki neighbors, labeled 1, 2, . . . , ki,have degrees k1, k2, ..., kki

respectively. We can reach thenode i from each of these ki nodes with probabilities in-verse of their respective degrees, and each of the ki nodescan be picked at random with probability 1/N . We cantherefore write

Π(i) =1

N

[ 1

k1+

1

k2+ . . .+

1

kki

]

=

∑ki

j=11kj

N. (1)

The probabilities Π(i) are normalized, which implies that∑N

i=1

∑ki

j=11kj

= N . We have verified it numerically and

found that it is independent of m and N . The rate atwhich an arbitrary node i gains links is therefore givenby the following rate equation.

∂ki∂t

= m Π(i) (2)

The factor m takes care of the fact that any of the mlinks of the newcomer may connect with the node i.Solving Eq.(2) for Π(i) given by Eq.(1) seems quite a

formidable task unless we can simplify it in some way.A few steps of hand calculation on a network of smallsize reveals that the probability Π(i) of picking node iis higher for nodes with higher degree than those withlower degree, encouraging us to re-write Eq.(1) as

Π(i) =kiN

∑ki

j=11kj

ki. (3)

0

5000

10000

15000

20000

0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1

frequ

ency

<1/ki>

m=10m=14m=20m=50

m=100

0

0.2

0.4

0.6

0.8

1

0 50000 100000

<1/k

i>

node index, i

m = 1

0

0.002

0.004

0.006

0.008

0.01

0 50000 100000

<1/k

i>

node index, i

m = 100

FIG. 3: The distribution of IHM for different m values. Thefluctuations of IHM of individual nodes is shown in the inset(top left for m = 1 and bottom right for m = 100).

The factor

∑kij=1

1

kj

kiis the inverse of the harmonic mean

(IHM) of degrees of the ki neighbors of a node i which

we denote as⟨

1ki

for convenience. We now attempt to

replace the IHM value of each node by an effective “meanvalue”

⟨ 1

ki

⟨ 1

ki

, (4)

where the overline indicates the ensemble average of themean IHM of each realization. In this way, all the in-formation on correlations in the fluctuations is lost andhence we call it mean-field approximation (MFA). Oneimmediate consequence is that like the BA model, ourMDA rule too is preferential in character. This maysound a bit too drastic at this stage, but we shall jus-tify it anyway. It is needless to mention that this ap-proximation will work only if the fluctuations of IHM ofindividual nodes from their mean is not wild.

0.4906

0.4908

0.491

0.4912

0.4914

0.4916

0.4918

0 200000 400000 600000 800000 1e+06 1.2e+06

β(m

,N)

N

-9

-8.5

-8

-7.5

-7

-6.5

11 11.5 12 12.5 13 13.5 14

ln(β

(m,N

) - β

(m,∞

))

ln(N)

FIG. 4: Plot of m⟨

1

ki

≡ β(m,N) vs. N for m = 70. Inset

shows that β(m,N) saturates to β(m,∞) = 0.490513 alge-

braically as N−3/5.

We perform extensive numerical simulation to checkhow robust the approximation is. For that we calculate

Page 3: Growing Scale-free Networksby a Mediation-Driven ... · Md. Kamrul Hassan and Liana Islam University of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh

3

the IHM for each node of the network grown with fixedm,and plot them as a function of node label i = 1, 2, ..., N(as in the insets of FIG. 3). We find that for small valuesof m, the data points fluctuate so wildly from node to

node that the mean of⟨

1ki

over the entire size of the

network bears no meaning, which can also be seen fromthe frequency distributions of IHM shown in FIG. 3. Thefrequency distributions become symmetric as we increasem approximately beyond 14, and the fluctuations occurat increasingly lesser extents, and hence the meaning ofthe mean seems to be more meaningful. We also find thatthe mean of the IHM depends on m and is independentof network size N in the limit N → ∞ (as demonstratedin FIG. 4). So, we can write for m > 14,

Π(i) ≈kiN

⟨ 1

ki

=kiN

β(m)

m, (5)

where the factor m in the denominator is introduced forfuture convenience. Thus the value of β(m) for each fixedm is also independent of N as N → ∞. Eq.(5) confirmsthat the attachment probability Π(i) ∝ ki, and hence likethe BA model our model too is preferential in character,but not directly.

2

4

6

8

10

12

14

5 6 7 8 9 10 11 12 13 14 15

ln(k

i)

ln(t)

i = 1i = 4

i = 10i = 11i = 13

0.5

0.6

0.7

0.8

0.9

1

0 100 200 300 400 500

β

m

FIG. 5: Plots of ln(ki) versus ln(t) for 5 nodes added at fivedifferent times for m = 2. In the inset, we show variation ofthe slope β with m.

It is noteworthy that the size N of the network is anindicative of time t since we assume that only one nodejoins the network at each time step. Thus, for N >> m0

we can write N ∼ t. We can now solve the rate equation.Using Eq.(5) and N ∼ t in Eq.(2) we find that the rateequation we ought to solve is

∂ki∂t

= kiβ(m)

t. (6)

Solving it subject to the initial condition that the ithnode is born at time t = ti with ki(ti) = m we get,

ki(t) = m( t

ti

)β(m)

. (7)

The form of the solution is exactly the same as that ofthe BA model [7] except for the β value. To test Eq.(7)

we plot ln(ki) versus ln(t) in FIG. 5 for a few randomlyselected nodes and find a set of parallel straight lines.It implies that all nodes grow linearly as predicted byEq.(7). However, in contrast to the BA model, the slope(β) of the straight lines depends on m, asymptoticallyapproaching 0.5 for large m (see inset of FIG. 5). Also,there is a significant difference between β obtained using

our approximation β(m) = m⟨

1ki

, with that obtained

from the ln(ki) vs. ln(t) plots when m ≤ 14. The dif-ference gets negligible as m is increased and this too iscorrelated to the increase in the symmetry and decreasein the width of the IHM distributions.

-16

-14

-12

-10

-8

-6

-4

-2

0

0 2 4 6 8 10 12

ln(P

(k))

ln(k)

m = 1m = 15

m = 100

1.6

1.8

2

2.2

2.4

2.6

2.8

3

0 20 40 60 80 100

γ

m

FIG. 6: Plots of ln(P (k)) vs ln(k) for m = 1, m = 15 andm = 100 revealing that the MDA rule gives rise to power-lawdegree distribution. In the inset we show the variation in theexponent γ as a function of m.

We can now find the degree distribution P (k) by appre-ciating the fact that it is related to P (ti), the probabilitywith which node i is added to the system at time ti, via

P (k) dk = −P (ti) dti. (8)

The minus sign is introduced here to take into accountthat the smaller the value of ti the larger the degree ki.Substituting P (ti) =

1N

∼ 1t(since new nodes are added

to the system at equal interval of time) and the derivativeof ti with respect to ki from Eq.(7) in Eq.(8) gives

P (k) ∼ k−γ(m), where γ(m) =1

β(m)+ 1. (9)

The most immediate difference of this result from thatof the BA model is that the exponent γ depends on m.To verify it we plot in FIG. 6 ln(P (k)) vs. ln(k) us-ing data extracted from numerical simulation and findstraight lines with characteristic fat-tail. This confirmsthat our analytical solution is in agreement with the nu-merical solution. It is important to note that for smallm, especially for m = 1 there is a special point in theplot of the degree distribution that is way far above theother points. This special point corresponds to 99.5%of the all the nodes which have degree 1, and which areheld by a few hubs. This is reminiscent of the WTA

Page 4: Growing Scale-free Networksby a Mediation-Driven ... · Md. Kamrul Hassan and Liana Islam University of Dhaka, Department of Physics, Theoretical Physics Group, Dhaka-1000, Bangladesh

4

phenomenon. However, as m increases, we find that theWTA phenomenon is replaced by the WTS phenomenonand at the same time, all the data points follow the sametrend. This happens when the IHM has less noise andthe mean of IHM has meaning. Therefore, we argue thatthe IHM can be regarded as a measure of how curved orstraight the degree distribution is.The next thing we do is to give a generalized analytical

expression for γ. We fit the data and find that γ saturatesto 3 as γ = 3(1 − 0.43531e−0.11m). The value of m forwhich γ has increased to within a factor of 1 − e−1(≈63%) of its maximum value 3, is mc = 1

0.11 ≈ 9. Wecan relate this to the frequency distribution of the IHMin a way that below m = 9 the distribution is highlyasymmetric. From m = 8 to m = 14, the distributionsgradually metamorphose to assume a Gaussian shape.Also, our numerical simulations show that the whole γspectrum form ≥ 3 is between 2 and 3. Our result standsin sharp contrast with that of Yang et al. who found theexponent γ to vary in the range 1.90 to 2.61 when m wasvaried in the range 1 to 200 [18]. Besides, our expressionfor the probability Π and method of solution are verydifferent from theirs.To summarize, we have proposed a new attachment

rule, namely the MDA rule, for growing networks thatexhibit power-law degree distribution. At a glance, itmay seem MDA defies the intuitive idea of the PA rule,however, a closer look reveals otherwise. Indeed, we showexplicitly that the MDA rule is in fact not only prefer-ential but superpreferential for m < 14 and for alrge mit embodies the PA rule but in disguise. We obtainedan exact expression for Π(i) that describes the proba-

bility with which an existing node i is finally picked tobe connected with a new node. Solving the model ana-lytically for the exact expression of Π(i) appeared to bea formidable task. However, the good news is that wecould still find a way to use MFA. Later it turns out thatnot being able to solve analytically for the exact expres-sion for Π(i) was highly rewarding as it helped us gain adeeper insight into the problem. While working with theexpression for Π(i), we realized that the IHM value ofthe degrees of the neighborhood of node i plays a crucialrole in the model. We find that the fluctuations in IHMvalues of existing nodes are so wild for small m that themean, in this case, bears no meaning and hence MFA isnot valid in this case. We observed that for small m suchas m = 1 or 2, the MDA rule becomes super-preferentialin the sense that more than 97% of the nodes are ex-tremely poor in degree as they are linked to one or twoother nodes which are super-hubs. This is reminiscent ofthe WTA phenomenon. However, for large m, the fluc-tuations get weaker and their distribution starts to peakaround the mean revealing that the mean has a mean-ing. This is the regime where MFA works well and weverified it numerically. Here we found that the WTAphenomenon is replaced by the WTS phenomenon. Wehope to extend our work to study dynamic scaling anduniversality classes for the MDA model for different mand check how it differs from the BA model [21].

We acknowledge Syed Arefinul Haque for his technicalsupport.

[1] M. E. J. Newman, PNAS 101 5200 (2004).[2] M. E. J. Newman, S. H. Strogatz, and D. J. Watts,

Phys.Rev. E 64 026118 (2001).[3] M. Faloutsos, P. Faloutsos and C. Faloutsos, Comp.

Comm. Rev. 29 251 (1999).[4] R. Albert, H. Jeong and A. -L. Barabasi, Nature 401 130

(1999).[5] V. M. Eguiluz, D. R. Chialvo, G. A. Cecchi, M. Baliki

and A. V. Apkarian, Phys. Rev. Lett. 94 018102 (2005).[6] H. Jeong, S. P. Mason, A. -L. Barabasi and Z. N. Oltvai,

Nature 411 41 (2001).[7] A. -L. Barabasi and R. Albert, Science 286 509 (1999).[8] R. Albert and A.-L. Barabasi, Phys. Rev. Lett. 85 5234

(2000).[9] S. S. Manna, A. Kabakcioglu, J. Phys. A: Math. Gen. 36

L279 (2003).[10] S.N. Dorogovtsev and J.F.F. Mendes, Phys. Rev. E 62

1842 (2000).[11] S. Fortunato, A. Flammini, and F. Menczer, Phys. Rev.

Lett. 96 218701 (2006).[12] A. Gabel and S. Redner, J. Stat. Mech. 2013 P02043

(2013).[13] A. Gabel, P. L. Krapivsky, and S. Redner, Phys. Rev. E

88 050802(R) (2013).[14] J. Kim, P. L. Krapivsky, B. Kahng, and S. Redner, Phys.

Rev. E 66 055101 (2002).[15] F. Chung, L. Lu, T. G. Dewey and , D. J. Galas, J.

Computat. Biol. 10 677 (2003).[16] M. K. Hassan, M. Z. Hassan and N. I. Pavel, New J.

Phys. 12 093045 (2010).[17] M. K. Hassan, M. Z. Hassan and N. I. Pavel, Journal of

Physics: Conference Series 297 012010 (2011).[18] X. -H. Yang, S. -L. Lou, G. Chen, S. -Y. Chen, W. Huang,

Physica A 392 3531 (2013).[19] S. Boccaletti, D.-U. Hwang and V. Latora, I. J. Bifurca-

tion and Chaos 17 2447 (2007).[20] P. L. Krapivsky and S. Redner, Phys. Rev. E 63 066123

(2001).[21] M. K. Hassan, M. Z. Hassan and N. I. Pavel, J. Phys A:

Math. Theor. 44 175101 (2011).