[ieee 2010 international conference on advances in social networks analysis and mining (asonam 2010)...

2
Community Comparison in Communication Networks Belkacem Serrour and Hamamache Kheddouci Universit´ e de Lyon, Laboratoire LIESP, Bˆ at. Nautibus, 43, Bd. du 11 novembre 1918, 69622 Villeurbanne Cedex, France {belkacem.serrour,hamamache.kheddouci}@univ-lyon1.fr Abstract—If we draw the virtual topology representing the communication in networks, we observe that the structure is similar to those of the social networks. Social networks are these networks with the characteristic relating densely some entities than others. These dense zones are called communities. Generally, the members of a same community share the same interest. In this work, we look for which virtual topology (called communication graph) emerged in communication network gives communities closer to the real one. Three different communication graphs are generated and compared with the graph representing the real communities, the reference graph. Microscopic and macroscopic comparisons are done. Keywords-Community detection, community comparison, communication networks. I. I NTRODUCTION Many of the communication network protocols make appeared a social network behavior [1][2]. If we draw the virtual topology representing the communication in network, we obtain a graph structure similar to the social networks structure. Social networks are these networks characterized by some denser zones called communities. Let CG =(V,E) be a communication graph (links between communicating peers), where V represents the peers of the network and each edge of E between two peers represents the communication between them. In this work, we look for which kind of communication graph (so, which network) is closer to the real one (reference graph). The reference graph is the social network extracted from the network. It represents the real communities of interest issued in a given network. The aim is to find which communication graph is close to that reference one. We model the networks by graphs, we extract from them the reference graphs, we generate from the graphs several communication graphs based on the communication protocol taken, we use one community detection algorithm to detect communities in each graph and finally, we compare the communities of each communication graph with the communities of the reference graph. II. COMMUNICATION GRAPHS AND COMMUNITY COMPARISON Our study starts with a network. We generate the networks by varying some parameters. The taken parameters are: n network size (number of peers in the network), e network density (number of links between nodes), s number of servers in the network, r number of resources on servers and i is the interest of node (number of resources desired by a peer). We extract from the networks some graphs. The first graph is the profile graph (called also reference graph). The nodes of the reference graph are all nodes of the network and an edge is created between two nodes if the similarity be- tween the interest of these two nodes is larger than a defined threshold. The similarity between two nodes is calculated using the Jaccard’s index. The reference graph represents the real connections between similar nodes in the network. So, the extracted communities in this graph are the real existing communities of the network. The aim is to look for which communication graph giving communities closer to the communities given by the reference graph. In this purpose, we extract three different communication graphs from the network: simple, weighted and completed communication graph. For each graph (reference graph and communication graphs), we apply a community detection algorithm to detect communities and we extract some community properties. Using these properties, we do microscopic and macroscopic comparison between communities of the reference graph and communities of the communication graphs. A. Communication graphs The communication graph is the one representing the relationship between nodes communicating more. The edges of this graph have as signification, at least, one communi- cation between two nodes. We mean by communication an exchange of resources (peer v 1 ask v 2 for a resource and v 2 replays to v 1 by sending the asked resource). 1) Simple communication graph: Let SCG =(V s ,E s ) be an unweighted undirected graph representing the simple communication graph. V s represents the node set of the network. E s is the edge set of the simple graph. If two nodes in the network communicate, we make an edge between these two nodes. 2) Weighted communication graph: Let WCG = (V w ,E w ) be a weighted undirected graph representing the weighted communication graph. The weighted graph is similar to the simple graph with the difference that we count the number of communications done between nodes. 3) Completed communication graph: Let CCG = (V c ,E c ) be an unweighted undirected graph representing the completed communication graph. The completed graph is the simple graph for which we add some hidden links. 2010 International Conference on Advances in Social Networks Analysis and Mining 978-0-7695-4138-9/10 $26.00 © 2010 IEEE DOI 10.1109/ASONAM.2010.37 393 2010 International Conference on Advances in Social Networks Analysis and Mining 978-0-7695-4138-9/10 $26.00 © 2010 IEEE DOI 10.1109/ASONAM.2010.37 393 2010 International Conference on Advances in Social Networks Analysis and Mining 978-0-7695-4138-9/10 $26.00 © 2010 IEEE DOI 10.1109/ASONAM.2010.37 393

Upload: hamamache

Post on 07-Apr-2017

214 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: [IEEE 2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010) - Odense, Denmark (2010.08.9-2010.08.11)] 2010 International Conference on Advances

Community Comparison in Communication Networks

Belkacem Serrour and Hamamache KheddouciUniversite de Lyon, Laboratoire LIESP, Bat. Nautibus,

43, Bd. du 11 novembre 1918, 69622 Villeurbanne Cedex, France{belkacem.serrour,hamamache.kheddouci}@univ-lyon1.fr

Abstract—If we draw the virtual topology representing thecommunication in networks, we observe that the structure issimilar to those of the social networks. Social networks arethese networks with the characteristic relating densely someentities than others. These dense zones are called communities.Generally, the members of a same community share the sameinterest. In this work, we look for which virtual topology (calledcommunication graph) emerged in communication networkgives communities closer to the real one. Three differentcommunication graphs are generated and compared with thegraph representing the real communities, the reference graph.Microscopic and macroscopic comparisons are done.

Keywords-Community detection, community comparison,communication networks.

I. INTRODUCTION

Many of the communication network protocols makeappeared a social network behavior [1][2]. If we draw thevirtual topology representing the communication in network,we obtain a graph structure similar to the social networksstructure. Social networks are these networks characterizedby some denser zones called communities. Let CG = (V,E)be a communication graph (links between communicatingpeers), where V represents the peers of the network and eachedge of E between two peers represents the communicationbetween them. In this work, we look for which kind ofcommunication graph (so, which network) is closer to thereal one (reference graph). The reference graph is the socialnetwork extracted from the network. It represents the realcommunities of interest issued in a given network. The aim isto find which communication graph is close to that referenceone. We model the networks by graphs, we extract fromthem the reference graphs, we generate from the graphsseveral communication graphs based on the communicationprotocol taken, we use one community detection algorithmto detect communities in each graph and finally, we comparethe communities of each communication graph with thecommunities of the reference graph.

II. COMMUNICATION GRAPHS AND COMMUNITYCOMPARISON

Our study starts with a network. We generate the networksby varying some parameters. The taken parameters are: nnetwork size (number of peers in the network), e networkdensity (number of links between nodes), s number ofservers in the network, r number of resources on servers and

i is the interest of node (number of resources desired by apeer). We extract from the networks some graphs. The firstgraph is the profile graph (called also reference graph). Thenodes of the reference graph are all nodes of the network andan edge is created between two nodes if the similarity be-tween the interest of these two nodes is larger than a definedthreshold. The similarity between two nodes is calculatedusing the Jaccard’s index. The reference graph represents thereal connections between similar nodes in the network. So,the extracted communities in this graph are the real existingcommunities of the network. The aim is to look for whichcommunication graph giving communities closer to thecommunities given by the reference graph. In this purpose,we extract three different communication graphs from thenetwork: simple, weighted and completed communicationgraph. For each graph (reference graph and communicationgraphs), we apply a community detection algorithm to detectcommunities and we extract some community properties.Using these properties, we do microscopic and macroscopiccomparison between communities of the reference graph andcommunities of the communication graphs.

A. Communication graphs

The communication graph is the one representing therelationship between nodes communicating more. The edgesof this graph have as signification, at least, one communi-cation between two nodes. We mean by communication anexchange of resources (peer v1 ask v2 for a resource and v2

replays to v1 by sending the asked resource).1) Simple communication graph: Let SCG = (Vs, Es)

be an unweighted undirected graph representing the simplecommunication graph. Vs represents the node set of thenetwork. Es is the edge set of the simple graph. If two nodesin the network communicate, we make an edge betweenthese two nodes.

2) Weighted communication graph: Let WCG =(Vw, Ew) be a weighted undirected graph representing theweighted communication graph. The weighted graph issimilar to the simple graph with the difference that we countthe number of communications done between nodes.

3) Completed communication graph: Let CCG =(Vc, Ec) be an unweighted undirected graph representingthe completed communication graph. The completed graphis the simple graph for which we add some hidden links.

2010 International Conference on Advances in Social Networks Analysis and Mining

978-0-7695-4138-9/10 $26.00 © 2010 IEEE

DOI 10.1109/ASONAM.2010.37

393

2010 International Conference on Advances in Social Networks Analysis and Mining

978-0-7695-4138-9/10 $26.00 © 2010 IEEE

DOI 10.1109/ASONAM.2010.37

393

2010 International Conference on Advances in Social Networks Analysis and Mining

978-0-7695-4138-9/10 $26.00 © 2010 IEEE

DOI 10.1109/ASONAM.2010.37

393

Page 2: [IEEE 2010 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2010) - Odense, Denmark (2010.08.9-2010.08.11)] 2010 International Conference on Advances

(a) (b)

(c) (d)

Figure 1. Community structure comparison. (a) the clustering coefficient ofthe reference graph and the three communication graphs (simple, weightedand completed). (b) community comparison using the normalized mutualinformation (NMI). (c) density of the formed communities. (d) quality ofthe communities, measured by the modularity.

The hidden links are added between nodes having a closeinterest.

III. SIMULATIONS AND COMPARISONS

We use NS2 (Network Simulation 2) to generate allgraphs and to detect communities. We use the fast algorithmof Newman [4] to detect communities. A measure of thepartition called the modularity is done. The modularityis a measure that qualifies the partitions. Once with thecommunities, we extract some properties to analyze them.The extracted community properties are: number of resultingcommunities, size of the communities (number of nodes ineach community), modularity of the partition, density of thecommunities (proportion of edges in each community) andthe clustering coefficient of the emerged communities. Theseproperties can be ranked in two classes: microscopic andmacroscopic properties.

A. Macroscopic comparison

The macroscopic analysis is done by looking first atthe clustering coefficient. Figure 1(a) shows the cluster-ing coefficient of the four graphs. The largest coefficientclustering is the reference graph one, meaning that socialbehavior exists and appears in networks. Figure 1(a) showsalso that the closest graph to the referential is the com-pleted communication graph. The hidden edges added tothe completed graph make it closer to the reference one. Wealso remark that the weighted graph is the farthest to thereferential. In the weighted graph, edges with low weight areremoved, thus affecting the number of triangles and so theclustering coefficient will be lower. The second macroscopiccomparison is done by using a measure to compare twopartitions. Authors in [3] define a measure to compare twopartitions. The defined measure is the normalized mutualinformation (nmi). We use the nmi measure to compare the

communities of the reference graph with the communitiesof the three communication graphs. Figure 1(b) shows thatboth, simple and completed graphs, are the closest to thereferential.

B. Microscopic comparison

We take the two properties, community size and thedensity, to do this comparison. Figure 1(c) shows the averagedensity of the graphs. The density of the reference graph isthe largest one (see Figure 1(c), red bloc). The referencegraph is constructed by looking the interest of nodes. So,the formed community of this graph will have the maximumnumber of possible edges relating the nodes in communities(all real connections between nodes having closer interest).We remark that the completed communication graph isthe one that have the density closer than the density ofthe reference graph. Completed communication graph isconstructed by adding some hidden edges. It’s why it theclosest one. Figure 1(c) shows also that no communicationgraph has the same density like the reference graph. Thatmeans, the three communication graphs don’t discover allthe real interest relations between nodes.

IV. CONCLUSION

In this paper we have showed the emergence of commu-nities of interest in communication networks. Three differ-ent communication graphs are studied and compared withthe referential (the graph representing the real connectionsof interest in the network). The results showed that thecommunication graphs have the community structures. Thatmeans, groups of peers in networks share the same interestand, usually, communicate together. The results showedalso that both the simple and the completed graphs arecloser to the real one, owing to the fact that the com-munities detected in these graphs are very closer to thereals. However, the microscopic comparison showed thatno communication graph discover all the real connectionsexisting in the network. As perspective, we intend to applyour study to an existing protocol network and analyze itsemerged community structures.

REFERENCES

[1] F. Bloch and B. Dutta. Communication networks with endoge-nous link strength. Games & Eco. Behav., 66(1):39–56, 2009.

[2] V. Carchiolo, M. Malgeri, G. Mangioni, and V. Nicosia.Emerging structures of p2p networks induced by social rela-tionships. Comput. Commun., 31(3):620–628, 2008.

[3] L. I. Kuncheva and S. T. Hadjitodorov. Using diversity incluster ensembles. In IEEE inter. conf. on systems, man &cybernetics (SMC), pages 1214–1219, 2004.

[4] M. E. J. Newman. Fast algorithm for detecting communitystructure in networks. Physical Review E, 69:066133, 2004.

394394394