Content deleted Content added
GlennLawyer (talk | contribs) m corrected definition of betweenness centrality |
m Moving Category:Algorithms in graph theory to Category:Graph algorithms per Wikipedia:Categories for discussion/Log/2024 October 4 |
||
(13 intermediate revisions by 13 users not shown) | |||
Line 1:
{{short description|Community detection algorithm}}
The '''Girvan–Newman algorithm''' (named after [[Michelle Girvan]] and [[Mark Newman]]) is a hierarchical method used to detect [[Community structure|communities]] in [[complex system]]s.<ref name=newman>Girvan M. and Newman M. E. J., [
== Edge betweenness and community structure ==
The Girvan–Newman algorithm detects communities by progressively removing edges from the original network. The connected components of the remaining network are the communities. Instead of trying to construct a measure that tells us which edges are the most central to communities, the Girvan–Newman algorithm focuses on edges that are most likely "between" communities.
[[Betweenness centrality|Vertex betweenness]] is an indicator of highly [[centrality|central]] nodes in networks. For any node <math>i</math>, vertex betweenness is defined as the
The Girvan–Newman algorithm extends this definition to the case of edges, defining the "edge betweenness" of an edge as the number of shortest paths between pairs of nodes that run along it. If there is more than one shortest path between a pair of nodes, each path is assigned equal weight such that the total weight of all of the paths is equal to unity. If a network contains communities or groups that are only loosely connected by a few inter-group edges, then all shortest paths between different communities must go along one of these few edges. Thus, the edges connecting communities will have high edge betweenness (at least '''one''' of them). By removing these edges, the groups are separated from one another and so the underlying community structure of the network is revealed.
Line 11 ⟶ 12:
# The betweenness of all existing edges in the network is calculated first.
# The edge(s) with the highest betweenness
# The betweenness of all edges affected by the removal is recalculated.
# Steps 2 and 3 are repeated until no edges remain.
Line 20 ⟶ 21:
== See also ==
* [[Closeness (mathematics)|Closeness]]
* [[Hierarchical clustering]]
|