Professional Documents
Culture Documents
Mitx: Statistics, Computation & Applications: Criminal Networks Module Lecture 1: Introduction To Networks
Mitx: Statistics, Computation & Applications: Criminal Networks Module Lecture 1: Introduction To Networks
Mitx: Statistics, Computation & Applications: Criminal Networks Module Lecture 1: Introduction To Networks
Network research:
Grew out of graph theory
e.g. Euler’s celebrated 1735 solution of the Königsberg bridge problem
How does the adjacency matrix of a simple graph look like? How to
suggest new friends in a social network? And what about an acyclic graph?
Network research:
Grew out of graph theory
e.g. Euler’s celebrated 1735 solution of the Königsberg bridge problem
How does the adjacency matrix of a simple graph look like? How to
suggest new friends in a social network? And what about an acyclic graph?
connected components
edge density
degree distribution
clustering
1e+06
Number of components
1e+02 1e+04
1e+00
Figure 3. Component size distribution. The fraction of components with a given component size
Component size distribution in the 2011 Facebook network on a log-log
on a log-log scale. Most vertices (99.91%) are in the largest component.
scale. Most vertices (99.91%) are in the largest component.
Caroline[48].
algorithm UhlerWe
(MIT)
find that theMITx: Statistics,
average distanceComputation
between&pairs
Applications Lecture
of users was 4.7 for Facebook 4 and
users 8 / 19
Degree distribution
Degree of node i: ki
1 P P
i,j Aij 2m
Average degree: n i ki = n = n , where |V | = n, |E | = m
Degree of node i: ki
1 P P
i,j Aij 2m
Average degree: n i ki = n = n , where |V | = n, |E | = m
tail of distribution is fat, i.e., there are many nodes with high degrees
appears linear on a log-log plot
appear in wide variety of settings including WWW, Internet
1e−01
Fraction
1e−05
Global Glo
1e−07
U.S. U.S
1 5 50 500 5000 1 5
Degree
(a)
From “The Anatomy of the Facebook Social Graph” by Ugander et al. (2011)
Figure
Caroline Uhler (MIT) 1.MITx:
Degree
Statistics,distribution p . (a) The fractionLecture
Computation & Applications of users
4 with
11 / 19de
Edge density
The average path length is the average distance between any two
nodes in the network:
1 X
average path length = n dij
2 i≤j
The average path length is the average distance between any two
nodes in the network:
1 X
average path length = n dij
2 i≤j
If network is not connected, one often computes the diameter and the
average path length in the largest component.
The average path length is the average distance between any two
nodes in the network:
1 X
average path length = n dij
2 i≤j
If network is not connected, one often computes the diameter and the
average path length in the largest component.
0 2 4 6 8 10
hop distance
From “The Anatomy of the Facebook Social Graph” by Ugander et al. (2011)
e 2. Diameter. The neighborhood function N (h) showing the percentage of user pairs tha
h hops of each other. The average distance between users on Facebook in May 2011 was 4
Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 4 15 / 19
Clustering
In social networks: It is often the case that two nodes who share a
common friend are also friends
number of triangles
Triangle density:
(n3)
From “The Anatomy of the Facebook Social Graph” by Ugander et al. (2011)
Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 4 17 / 19
Homophily
1 P ki kj
Modularity: 2m i,j Aij − 2m δ(ti , tj ) ∈ [−1, 1]