Professional Documents
Culture Documents
2 Grouping
2 Grouping
Node grouping
Grau de Ciència de Dades | Escola Tècnica Superior d’Informàtica | Universitat Politècnica de València
Sources
●
Albert László Barabási: Network Science. Cambridge
University Press, 2016
– Follows almost section-by-section chapter 02
2/43
Contents
1. Connectedness
2. Clustering coefficient
3. Central sets
4. Bipartite graphs
5. Assortativity
6. Other structures and measures
7. A case study
B
B
A
A Largest Component:
Giant Component
C
D E C
D E
F
G
F The rest: Isolates
G
D E C
D C G
F
G
●
Clustering coefficient Ci is a property of a node i
●
Let Li represent the number of links among neighbors
of node i
●
Many natural processes of link formation encourage the
closing of “V”s into triangles
●
Example 1: you’re more likely to meet new friends
through common friends
●
Example 2: you’re more likely to follow an account u
if you see content posted by u and re-posted by an
account v that you already follow
●
This fact will promote the existence of other measures
related to clusters
Central sets
Words of caution
VL VR
Examples:
Y.-Y. Ahn, S. E. Ahnert, J. P. Bagrow, A.-L. Barabási Flavor network and the principles
of food pairing , Scientific Reports 196, (2011).
TRIPARTITE NETWORK
29/43
Assortativity
Newman, Mark E. J.; Networks: an introduction; Oxford University Press (2010)
δij=1 if i=j
δij=0 otherwise
where
Basic statistics for a number of networks
•
Nodes tend to group in small groups of low-degree or
high-degree
•
Multiedge featured
• Tecnological, information and biological networks tend to a
negative r
•
The number of edges that fall between high-degree nodes
is small
•
Single-edge featured
A case study:
A. Degree distribution: pk
B. Average path length: <d>
C. Clustering coefficient:
GENOME
protein-gene
interactions
PROTEOME
protein-protein
interactions
METABOLISM
Bio-chemical
reactions
Citrate Cycle
A CASE STUDY: PROTEIN-PROTEIN INTERACTION NETWORK
●
Undirected network
●
N=2,018 proteins as nodes
L=2,930 binding interactions
●
Average degree <k>=2.90
●
Not connected:
185 components
●
the largest (giant component)
1,647 nodes
A CASE STUDY: PROTEIN-PROTEIN INTERACTION NETWORK
pk = N k / N
A CASE STUDY: PROTEIN-PROTEIN INTERACTION NETWORK
dmax=14
<d>=5.61
A CASE STUDY: PROTEIN-PROTEIN INTERACTION NETWORK
<C>=0.12