CS109/Stat121/AC209/E-109 Data Science: Network Models
CS109/Stat121/AC209/E-109 Data Science: Network Models
Data Science
Network Models
Hanspeter Pfister & Joe Blitzstein
[email protected] / [email protected]
5 2
4 3
This Week
HW4 due tonight at 11:59 pm
FIG. 2 Three examples of the kinds of networks that are the topic of this review. (a) A food web of predator-prey interactions
between species in a freshwater lake [272]. Picture courtesy of Neo Martinez and Richard Williams. (b) The network of
collaborations between scientists at a private research institution [171]. (c) A network of sexual contacts between individuals
in the study by Potterat et al. [342].
Graphs
A graph G=(V,E) consists of a vertex set V and an
edge set E containing unordered pairs {i,j} of vertices.
10
9
16 1
13 3
4
5
1
2 3
15 8 14
7
12 4
2 11
6
graph multigraph
The degree of vertex v is the number of
edges attached to it.
A Plea for Clarity: What is a Network?
graph vs. multigraph (are loops, multiple edges ok?
What is a simple graph?)
directed vs. undirected
weighted vs. unweighted
dynamics of vs. dynamics on
labeled vs. unlabeled
network as quantity of interest vs. quantities of
interest on networks
Why model networks?
3 8
4 2 9 5 7
1 6
n = 9, d = (3, 4, 3, 3, 4, 3, 3, 3, 2)
1 2 1 2
Switchings Chain
3 4 3 4
Power Laws
Power-law (a.k.a. scale-free) networks: the number of
vertices of degree k is proportional to k-
Let P (0, 0) be the probability for the absence of an edge between i and
probability of i linking to j (1 indicates the outgoing node of the edg
probability of i linking to j and j linking to i. The p1 model posits the follow
(see [149]):
Erdos
Dyad
Indep.
ERGM
Fixed
degree
Geom
Latent Space Models
4
2
5
3
6
5
2
6 6
7
4
3 4
2 2
5 4
3
Closeness
uses the reciprocal of the average
shortest distance to other nodes
0.4
0.46
0.53 0.39
0.51 0.54
0.54 0.43
0.53
0.53 0.56
0.45
0.5
0.4 0.46
0.46
Betweenness
many variations:
shortest paths
vs. flow
maximization
vs. all paths vs.
random paths
Eigenvector Centrality
use eigenvector of A corresponding to the largest
eigenvalue (Bonacich); more generally, power centrality