research-article

Permanence and Community Structure in Complex Networks

Authors:
Tanmoy Chakraborty

Indian Institute of Technology, Kharagpur, India

Indian Institute of Technology, Kharagpur, India
View Profile

,
Sriram Srinivasan

University of Nebraska at Omaha, Omaha, NE

University of Nebraska at Omaha, Omaha, NE
View Profile

,
Niloy Ganguly

Indian Institute of Technology, Kharagpur, India

Indian Institute of Technology, Kharagpur, India
View Profile

,
Animesh Mukherjee

Indian Institute of Technology, Kharagpur, India

Indian Institute of Technology, Kharagpur, India
View Profile

,
Sanjukta Bhowmick

University of Nebraska at Omaha, Omaha, NE

University of Nebraska at Omaha, Omaha, NE
View Profile

ACM Transactions on Knowledge Discovery from Data Volume 11 Issue 2Article No.: 14pp 1–34https://doi.org/10.1145/2953883

Published:15 November 2016Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

The goal of community detection algorithms is to identify densely connected units within large networks. An implicit assumption is that all the constituent nodes belong equally to their associated community. However, some nodes are more important in the community than others. To date, efforts have been primarily made to identify communities as a whole, rather than understanding to what extent an individual node belongs to its community. Therefore, most metrics for evaluating communities, for example modularity, are global. These metrics produce a score for each community, not for each individual node. In this article, we argue that the belongingness of nodes in a community is not uniform. We quantify the degree of belongingness of a vertex within a community by a new vertex-based metric called permanence.

The central idea of permanence is based on the observation that the strength of membership of a vertex to a community depends upon two factors (i) the extent of connections of the vertex within its community versus outside its community, and (ii) how tightly the vertex is connected internally. We present the formulation of permanence based on these two quantities. We demonstrate that compared to other existing metrics (such as modularity, conductance, and cut-ratio), the change in permanence is more commensurate to the level of perturbation in ground-truth communities. We discuss how permanence can help us understand and utilize the structure and evolution of communities by demonstrating that it can be used to -- (i) measure the persistence of a vertex in a community, (ii) design strategies to strengthen the community structure, (iii) explore the core-periphery structure within a community, and (iv) select suitable initiators for message spreading.

We further show that permanence is an excellent metric for identifying communities. We demonstrate that the process of maximizing permanence (abbreviated as MaxPerm) produces meaningful communities that concur with the ground-truth community structure of the networks more accurately than eight other popular community detection algorithms. Finally, we provide mathematical proofs to demonstrate the correctness of finding communities by maximizing permanence. In particular, we show that the communities obtained by this method are (i) less affected by the changes in vertex ordering, and (ii) more resilient to resolution limit, degeneracy of solutions, and asymptotic growth of values.

References

Yong-Yeol Ahn, James P. Bagrow, and Sune Lehmann. 2010. Link communities reveal multiscale complexity in networks. Nature 466, (August 2010), 761--764.Google Scholar
A. Arenas, A. Fernández, and S. Gómez. 2008. Analysis of the structure of complex networks at different resolution levels. New Journal of Physics 10, 5 (2008), 053039.Google ScholarCross Ref
David A. Bader, Henning Meyerhenke, Peter Sanders, and Dorothea Wagner (Eds.). 2013. Graph partitioning and graph clustering. In Proceedings of the10th DIMACS Implementation Challenge Workshop. Contemporary Mathematics, vol. 588. American Mathematical Society.Google ScholarCross Ref
Jeffrey Baumes, Mark Goldberg, and Malik Magdon-Ismail. 2005. Efficient identification of overlapping communities. In Proceedings of the 2005 IEEE International Conference on Intelligence and Security Informatics (ISI’05). Springer-Verlag, Berlin, 27--36. Google ScholarDigital Library
Jonathan W. Berry, Bruce Hendrickson, Randall A. LaViolette, and Cynthia A. Phillips. 2011. Tolerating the community detection resolution limit with edge weighting. Physical Review E 83, 5 (May 2011), 056119.Google ScholarCross Ref
Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre. 2008. Fast unfolding of communities in large networks. Journal of Statistical Mechanics 2008 (2008), P10008.Google ScholarCross Ref
Tanmoy Chakrabort, Sandipan Sikdar, Vihar Tammana, Niloy Ganguly, and Animesh Mukherjee. 2013. Computer science fields as ground-truth communities: Their impact, rise and fall. In Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’13). ACM, New York, NY, 426--433. Google ScholarDigital Library
Tanmoy Chakraborty. 2015. Leveraging disjoint communities for detecting overlapping community structure. Journal of Statistical Mechanics: Theory and Experiment 2015, 5 (2015), P05017.Google ScholarCross Ref
Tanmoy Chakraborty, Ayushi Dalmia, Animesh Mukherjee, and Niloy Ganguly. 2016a. Metrics for community analysis: A survey. CoRR abs/1604.03512 (2016).Google Scholar
Tanmoy Chakraborty, Suhansanu Kumar, Niloy Ganguly, Animesh Mukherjee, and Sanjukta Bhowmick. 2016b. GenPerm: A unified method for detecting non-overlapping and overlapping communities. CoRR abs/1604.03454 (2016).Google Scholar
Tanmoy Chakraborty, Sriram Srinivasan, Niloy Ganguly, Sanjukta Bhowmick, and Animesh Mukherjee. 2013. Constant communities in complex networks. Scientific Reports 3, (May 2013). DOI:10.1038/srep01825Google Scholar
Tanmoy Chakraborty, Sriram Srinivasan, Niloy Ganguly, Animesh Mukherjee, and Sanjukta Bhowmick. 2014. On the permanence of vertices in network communities. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’14). ACM, New York, NY, 1396--1405. DOI:http://dx.doi.org/10.1145/2623330.2623707 Google ScholarDigital Library
Mingming Chen, Tommy Nguyen, and Boleslaw Szymanski. 2013. A new metric for quality of network community structure. ASE Human Journal 1, 4 (2013), 226--240.Google Scholar
Flavio Chierichetti, Silvio Lattanzi, and Alessandro Panconesi. 2010. Rumour spreading and graph conductance. In Proceedings of the 21st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA). SIAM, 1657--1663. Google ScholarDigital Library
Aaron Clauset, M. E. J. Newman, and Cristopher Moore. 2004. Finding community structure in very large networks. Physical Review E 70, 6 (2004), 066111.Google ScholarCross Ref
L. Danon, A. Diaz-Guilera, J. Duch, and A. Arenas. 2005. Comparing community structure identification. Journal of Statistical Mechanics: Theory and Experiment 9, (2005), P09008.Google Scholar
Pasquale De Meo, Emilio Ferrara, Giacomo Fiumara, and Alessandro Provetti. 2013. Enhancing community detection using a network weighting strategy. Journal of Information Science 222, (Feb. 2013), 648--668. Google ScholarDigital Library
J.-C. Delvenne, S. N. Yaliraki, and M. Barahona. 2010. Stability of graph communities across time scales. Proceedings of the National Academy of Sciences 107, 29 (2010), 12755--12760.Google ScholarCross Ref
Alan Demers, Dan Greene, Carl Hauser, Wes Irish, John Larson, Scott Shenker, Howard Sturgis, Dan Swinehart, and Doug Terry. 1987. Epidemic algorithms for replicated database maintenance. In Proceedings of the 6th Annual ACM Symposium on Principles of Distributed Computing (PODC). New York, 1--12. Google ScholarDigital Library
T. S. Evans and R. Lambiotte. 2009. Line graphs, link partitions, and overlapping communities. Physical Review E 80, 1 (July 2009), 016105.Google ScholarCross Ref
Illés Farkas, Dániel Ábel, Gergely Palla, and Tamás Vicsek. 2007. Weighted network modules. New Journal of Physics 9, 6 (2007), 180.Google ScholarCross Ref
Santo Fortunato. 2010. Community detection in graphs. Physics Reports 486, 3--5 (2010), 75--174.Google ScholarCross Ref
Santo Fortunato and M. Barthelemy. 2007. Resolution limit in community detection. Proceedings of the National Acadamy of Sciences of the United States of America (PNAS) 104, 1 (2007), 36--41.Google ScholarCross Ref
Saptarshi Ghosh, Avishek Banerjee, Naveen Sharma, Sanket Agarwal, and Niloy Ganguly. 2011. Statistical analysis of the indian railway network: A complex network approach. Acta Physica Polonica B Proceedings Supplement 4, (March 2011), 123--137.Google ScholarCross Ref
M. Girvan and M. E. Newman. 2002. Community structure in social and biological networks. Proceedings of the National Academy of Sciences 99, 12 (June 2002), 7821--7826.Google ScholarCross Ref
B. H. Good, Y. A. De Montjoye, and A. Clauset. 2010. Performance of modularity maximization in practical contexts. Physical Review E 81, 4 (2010), 046106.Google ScholarCross Ref
Roger Guimera and Luis A. Nunes Amaral. 2005. Functional cartography of complex metabolic networks. Nature 433, 7028 (Feb. 2005), 895--900.Google ScholarCross Ref
Dongxiao He, Dayou Liu, Weixiong Zhang, Di Jin, and Bo Yang. 2013. Discovering link communities in complex networks by exploiting link dynamics. CoRR abs/1303.4699 (2013).Google Scholar
Paul W. Holland and Samuel Leinhardt. 1971. Transitivity in structural models of small groups. Small Group Research 2, 2 (1971), 107--124.Google Scholar
L. Hubert and P. Arabie. 1985. Comparing partitions. Journal of Classification 2, 1 (1985), 193--218.Google ScholarCross Ref
R. Kannan, S. Vempala, and A. Veta. 2000. On clusterings-good, bad and spectral. In Proceedings of the 41st Annual Symposium on Foundations of Computer Science (FOCS’00). IEEE Computer Society, Washington, DC, 367. Google ScholarDigital Library
David Kempe, Jon Kleinberg, and Éva Tardos. 2003. Maximizing the spread of influence through a social network. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’03). ACM, New York, NY, 137--146. DOI:http://dx.doi.org/10.1145/956750.956769 Google ScholarDigital Library
Renaud Lambiotte. 2010. Multi-scale modularity in complex networks. In Proceedings of the 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt’10). IEEE, 546--553.Google Scholar
Andrea Lancichinetti and Santo Fortunato. 2009. Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Physical Review E 80, 1 (July 2009), 016118.Google ScholarCross Ref
Andrea Lancichinetti and Santo Fortunato. 2011. Limits of modularity maximization in community detection. Physical Review E 84, (2011), 066122.Google ScholarCross Ref
Andrea Lancichinetti and Santo Fortunato. 2012. Consensus clustering in complex networks. Scientific Reports 2 (2012). DOI:10.1038/srep00336Google Scholar
Andrea Lancichinetti, Santo Fortunato, and János Kertész. 2009. Detecting the overlapping and hierarchical community structure in complex networks. New Journal of Physics 11, 3 (2009), 033015.Google ScholarCross Ref
Andrea Lancichinetti, Filippo Radicchi, Jose J. Ramasco, and Santo Fortunato. 2010. Finding statistically significant communities in networks. CoRR abs/1012.2363 (2010).Google Scholar
E. A. Leicht and M. E. J. Newman. 2008. Community structure in directed networks. Physical Review Letters 100, 11 (March 2008), 118703.Google ScholarCross Ref
Jure Leskovec, Kevin J. Lang, Anirban Dasgupta, and Michael W. Mahoney. 2009. Community structure in large networks: Natural cluster sizes and the absence of large well-defined clusters. Internet Mathematics 6, 1 (2009), 29--123.Google ScholarCross Ref
Jure Leskovec, Kevin J. Lang, and Michael Mahoney. 2010. Empirical comparison of algorithms for network community detection. In Proceedings of the 19th International Conference on World Wide Web (WWW’10). ACM, New York, NY, 631--640. Google ScholarDigital Library
Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press, New York, NY. Google Scholar
M. E. Newman. 2006. Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103, 23 (June 2006), 8577--8582.Google ScholarCross Ref
M. E. J. Newman. 2003. Mixing patterns in networks. Physical Review E 67, 2 (Feb. 2003), 026126. DOI:http://dx.doi.org/10.1103/PhysRevE.67.026126Google ScholarCross Ref
M. E. J. Newman. 2004a. Analysis of weighted networks. Physical Review E 70, 5 (Nov. 2004), 056131.Google ScholarCross Ref
M. E. J. Newman. 2004b. Fast algorithm for detecting community structure in networks. Physical Review E 69, 6 (June 2004), 066133.Google Scholar
M. E. J. Newman. 2013. Community detection and graph partitioning. CoRR abs/1305.4974 (2013).Google Scholar
M. E. J. Newman and M. Girvan. 2004. Finding and evaluating community structure in networks. Physical Review E 69, (2004) 026113.Google Scholar
Günce Keziban Orman, Vincent Labatut, and Hocine Cherifi. 2012. Comparative evaluation of community detection algorithms: A topological approach. CoRR abs/1206.4987 (2012).Google Scholar
Gergely Palla, Imre Dernyi, Ills Farkas, and Tams Vicsek. 2005. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 7043 (June 2005), 814--818.Google ScholarCross Ref
Pascal Pons and Matthieu Latapy. 2006. Computing communities in large networks using random walks. Journal of Graph Algortihms and Applications 10, 2 (2006), 191--218.Google ScholarCross Ref
Ioannis Psorakis, Stephen Roberts, Mark Ebden, and Ben Sheldon. 2011. Overlapping community detection using Bayesian non-negative matrix factorization. Physical Review E 83, 6 (June 2011), 066114.Google ScholarCross Ref
Usha N. Raghavan, Réka Albert, and Soundar Kumara. 2007. Near linear time algorithm to detect community structures in large-scale networks. Physical Review E 76, 3 (Sep. 2007), 036106.Google ScholarCross Ref
J. Reichardt and S. Bornholdt. 2006. Statistical mechanics of community detection. Phys. Rev. E 74, 1 (2006), 016110.Google ScholarCross Ref
Thomas Richardson, Peter J. Mucha, and Mason A. Porter. 2009. Spectral tripartitioning of networks. Physical Review E 40, (2009), 027104.Google Scholar
Jason Riedy, David A. Bader, Karl Jiang, Pushkar Pande, and Richa Sharma. 2011. Detecting Communities from Given Seeds in Social Networks. Technical Report GT-CSE-11-01. Georgia Institute of Technology.Google Scholar
M. Rosvall and C. T. Bergstrom. 2007. An information-theoretic framework for resolving community structure in complex networks. Proceedings of the National Academy of Sciences 104, 18 (2007), 7327.Google ScholarCross Ref
Martin Rosvall and Carl T. Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences 105, 4 (2008), 1118--1123.Google ScholarCross Ref
Massoud Seifi, Ivan Junier, Jean-Baptiste Rouquier, Svilen Iskrov, and Jean-Loup Guillaume. 2013. Stable community cores in complex networks. In Complex Networks, Ronaldo Menezes, Alexandre Evsukoff, and Marta C. Gonzlez (Eds.), Studies in Computational Intelligence, vol. 424. Springer, Berlin, 87--98. DOI:http://dx.doi.org/10.1007/978-3-642-30287-9_10Google Scholar
Jianbo Shi and Jitendra Malik. 2000. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 8 (Aug. 2000), 888--905. DOI:http://dx.doi.org/10.1109/34.868688 Google ScholarDigital Library
Peng-Gang Sun, Lin Gao, and Shan Shan Han. 2011. Identification of overlapping and non-overlapping community structure by fuzzy clustering in complex networks. Information Sciences 181, 6 (2011), 1060--1071. Google ScholarDigital Library
Jierui Xie and Boleslaw K. Szymanski. 2011. Community detection using a neighborhood strength driven label propagation algorithm. CoRR abs/1105.3264 (2011). Google ScholarDigital Library
Jierui Xie and Boleslaw K. Szymanski. 2012. Towards linear time overlapping community detection in social networks. CoRR abs/1202.2465 (2012). Google ScholarDigital Library
Jaewon Yang and Jure Leskovec. 2012. Defining and evaluating network communities based on ground-truth. In Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics (MDS’12). ACM, New York, NY, 3:1--3:8. Google ScholarDigital Library
Jaewon Yang and Jure Leskovec. 2014. Overlapping communities explain core-periphery organization of networks. Proceedings of IEEE 102, (2014), 1892--1902.Google ScholarCross Ref

Index Terms

Permanence and Community Structure in Complex Networks
1. Information systems
  1. Information systems applications
    1. Data mining
      1. Clustering
2. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory
      1. Graph algorithms

Recommendations

On the permanence of vertices in network communities
KDD '14: Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining

Despite the prevalence of community detection algorithms, relatively less work has been done on understanding whether a network is indeed modular and how resilient the community structure is under perturbations. To address this issue, we propose a new ...
Read More
On Diameter Based Community Structure Identification in Networks
ICDCN '17: Proceedings of the 18th International Conference on Distributed Computing and Networking

Several community detection algorithms for large scale networks have been reported in the literature so far. But to the best of our knowledge none of these algorithms consider the diameter of the community as a key parameter. In this paper, we propose a ...
Read More
Discovering community structure in Complex Network through Community Detection Approach
IMCOM '18: Proceedings of the 12th International Conference on Ubiquitous Information Management and Communication

Complex network analysis which can be represented as graph has gained much interest from researchers recently. Analysis derived from complex network leading to a discovery of important group or community lies within the network. It imposes a significant ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Knowledge Discovery from Data Volume 11, Issue 2
May 2017
419 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3017677
Editor:
Philip S. Yu
University of Illinois at Chicago, USA
Issue’s Table of Contents
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 November 2016
- Accepted: 1 June 2016
- Revised: 1 April 2016
- Received: 1 July 2015
Published in tkdd Volume 11, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Permanence
community discovery
community evaluation metric
modularity
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 514
  Total Downloads
- Downloads (Last 12 months)36
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Permanence and Community Structure in Complex Networks

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

On the permanence of vertices in network communities

On Diameter Based Community Structure Identification in Networks

Discovering community structure in Complex Network through Community Detection Approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Permanence and Community Structure in Complex Networks

ACM Transactions on Knowledge Discovery from Data

Abstract

References

Cited By

Index Terms

Recommendations

On the permanence of vertices in network communities

On Diameter Based Community Structure Identification in Networks

Discovering community structure in Complex Network through Community Detection Approach

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media