DEMON - Overlapping Community Discovery.
Community discovery in complex networks is an interesting problem with a number of applications, especially in the knowledge extraction task in social and information networks. However, many large networks often lack a particular community organization at a global level. In these cases, traditional graph partitioning algorithms fail to let the latent knowledge embedded in modular structure emerge, because they impose a top-down global view of a network. We propose here a simple local-first approach to community discovery, able to unveil the modular organization of real complex networks. This is achieved by democratically letting each node vote for the communities it sees surrounding it in its limited view of the global system, i.e. its ego neighborhood, using a label propagation algorithm; finally, the local communities are merged into a global collection.
Note: Demon has been integrated within CDlib a python package dedicated to community detection algorithms, check it out!
If you use our algorithm please cite the following works:
Coscia, Michele; Rossetti, Giulio; Giannotti, Fosca; Pedreschi, Dino "Uncovering Hierarchical and Overlapping Communities with a Local-First Approach" ACM Transactions on Knowledge Discovery from Data (TKDD), 9 (1), 2014.
Coscia, Michele; Rossetti, Giulio; Giannotti, Fosca; Pedreschi, Dino "DEMON: a Local-First Discovery Method for Overlapping Communities" SIGKDD international conference on knowledge discovery and data mining, pp. 615-623, IEEE ACM, 2012, ISBN: 978-1-4503-1462-6.
In order to install the package just download (or clone) the current project and copy the demon folder in the root of your application.
Alternatively use pip:
pip install demon
conda install -c giuliorossetti demon
Demon is written in python and requires the following package to run:
The algorithm can be used as standalone program as well as integrated in python scripts.
python demon filename epsilon -c min_com_size
- filename: edgelist filename
- epsilon: merging threshold in [0,1]
- min_community_size: minimum size for communities (default 3 - optional)
Demon results will be saved on a text file.
Input file specs
Edgelist format: tab separated edgelist (nodes represented with integer ids).
As python library
Demon can be executed specifying as input:
- an edgelist file
import demon as d dm = d.Demon(network_filename="filename.tsc", epsilon=0.25, min_community_size=3, file_output="communities.txt") dm.execute()
- a networkx Graph object
import networkx as nx import demon as d g = nx.karate_club_graph() dm = d.Demon(graph=g, epsilon=0.25, min_community_size=3) coms = dm.execute()
The parameter file_output, if specified, allows to write on file the algorithm results. Conversely, the communities will be returned to the main program as a list of node ids tuple, e.g.,