0% found this document useful (0 votes)

18 views47 pages

lecture13

The document discusses Minimum Spanning Trees (MST) and algorithms such as Prim's and Kruskal's for finding them in weighted graphs. It explains the concepts of spanning trees, their applications in network construction, clustering, and the Traveling Salesman Problem. Additionally, it covers the Union-Find data structure used for efficient cycle detection in Kruskal's algorithm.

Uploaded by

cglssc787

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views47 pages

lecture13

Uploaded by

cglssc787

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

Lecture 13:

Minimum Spanning Trees

Steven Skiena

Department of Computer Science

State University of New York
Stony Brook, NY 11794–4400

http://www.cs.stonybrook.edu/˜skiena
Topic: Problem of the Day
Problem of the Day
Your job is to arrange n rambunctious children in a straight
line, facing front. You are given a list of m statements of
the form “i hates j”. If i hates j, then you do not want put
i somewhere behind j, because then i is capable of throwing
something at j.
1. Give an algorithm that orders the line, (or says that it is
not possible) in O(m + n) time.
2. Suppose instead you want to arrange the children in
rows, such that if i hates j then i must be in a lower
numbered row than j. Give an efficient algorithm to find
the minimum number of rows needed, if it is possible.
Questions?
Topic: Minimum Spanning Trees
Weighted Graph Algorithms
Beyond DFS/BFS exists an alternate universe of algorithms
for edge-weighted graphs.
Our adjacency list representation quietly supported these
graphs:
typedef struct edgenode {
int y; /* adjacency info */
int weight; /* edge weight, if any */
struct edgenode *next; /* next edge in list */
} edgenode;

typedef struct {
edgenode *edges[MAXV+1]; /* adjacency info */
int degree[MAXV+1]; /* outdegree of each vertex */
int nvertices; /* number of vertices in the graph */
int nedges; /* number of edges in the graph */
int directed; /* is the graph directed? */
} graph;
Minimum Spanning Trees
A tree is a connected graph with no cycles. A spanning tree is
a subgraph of G which has the same set of vertices of G and
is a tree.
A minimum spanning tree of a weighted graph G is the
spanning tree of G whose edges sum to minimum weight.
There can be more than one minimum spanning tree in a
graph → consider a graph with identical weight edges.
Find the Minimum Spanning Tree

(b) (c)
(a)
Why Minimum Spanning Trees?
The minimum spanning tree problem has a long history – the
first algorithm dates back to 1926!
MST is taught in algorithm courses because:
• It arises in many graph applications.
• It is problem where the greedy algorithm always gives the
optimal answer.
• Clever data structures are necessary to make it work.
Greedy algorithms make the decision of what next to do by
selecting the best local option from all available choices.
Applications of Minimum Spanning Trees
Minimum spanning trees are useful in constructing networks,
by describing the way to connect a set of sites using the
smallest total amount of wire.
Minimum spanning trees provide a reasonable way for
clustering points in space into natural groups.
What are natural clusters in the friendship graph?
Minimum Spanning Trees and Net Partitioning
One of the war stories in the text describes how to partition
a graph into compact subgraphs by deleting large edges from
the minimum spanning tree.

(a) (b) (c) (d)

Minimum Spanning Trees and TSP
For points in the Euclidean plane, MST yield a good heuristic
for the traveling salesman problem:
1

2 4
3
7
5
6
10 11
8 9

The optimum traveling salesman tour is at most twice the

length of the minimum spanning tree.
Questions?
Topic: Prim’s Algorithm
Prim’s Algorithm
Prim’s algorithm starts from one vertex and grows the rest of
the tree an edge at a time.
As a greedy algorithm, which edge should we pick?
The cheapest edge with which can grow the tree by one vertex
without creating a cycle.
Prim’s Algorithm in Action

5
7 6
2 5 1
4 2 4 2
9 3 3 3
5 7 4 7 1 2 6 5 4
A 12 A A
G Prim(G,A) Kruskal(G)

https://upload.wikimedia.org/wikipedia/
en/3/33/Prim-algorithm-animation-2.gif
Prim’s Algorithm (Pseudocode)
During execution each vertex v is either in the tree, fringe
(meaning there exists an edge from a tree vertex to v) or
unseen (meaning v is more than one edge away).
Prim-MST(G)
Select an arbitrary vertex s to start the tree from.
While (there are still non-tree vertices)
Pick min cost edge between tree/non-tree vertices
Add the selected edge and vertex to the tree Tprim.
This creates a spanning tree, since no cycle can be introduced.

But is it minimum?
Why is Prim Correct? (Proof by Contradiction)

• If Prim’s algorithm is not correct, these must be some

graph G where it does not give the minimum cost
spanning tree.
• If so, there must be a first edge (x, y) Prim adds, such that
the partial tree V 0 cannot be extended into a MST.

(a) (b)
The Contradiction

• But if (x, y) is not in M ST (G), then there must be a path

in M ST (G) from x to y, because the tree is connected.
• Let (v1, v2) be the other edge on this path with one end in
V 0.
• Replacing (v1, v2) with (x, y) we get a spanning tree. with
smaller weight, since W (v, w) > W (x, y). Thus you did
not have the MST!!
• If W (v, w) = W (x, y), then the tree is the same weight,
but we couldn’t have made a fatal mistake picking (x, y).
Thus Prim’s algorithm is correct!
How Fast is Prim’s Algorithm?
That depends on what data structures are used. In the simplest
implementation, we can simply mark each vertex as tree and
non-tree and search always from scratch:
Select an arbitrary vertex to start.
While (there are non-tree vertices)
select minimum weight edge between tree and fringe
add the selected edge and vertex to the tree
This can be done in O(nm) time, by doing a DFS or BFS to
loop through all edges, with a constant time test per edge, and
a total of n iterations.
Prim’s Implementation
To do it faster, we must identify fringe vertices and the
minimum cost edge associated with it fast.
int prim(graph *g, int start) {
int i; /* counter */
edgenode *p; /* temporary pointer */
bool intree[MAXV+1]; /* is the vertex in the tree yet? */
int distance[MAXV+1]; /* cost of adding to tree */
int v; /* current vertex to process */
int w; /* candidate next vertex */
int dist; /* cheapest cost to enlarge tree */
int weight = 0; /* tree weight */

for (i = 1; i <= g->nvertices; i++) {

intree[i] = false;
distance[i] = MAXINT;
parent[i] = -1;
}

distance[start] = 0;
v = start;

while (!intree[v]) {
intree[v] = true;
if (v != start) {
printf("edge (%d,%d) in tree \n",parent[v],v);
weight = weight + dist;
}
p = g->edges[v];
while (p != NULL) {
w = p->y;
if ((distance[w] > p->weight) && (!intree[w])) {
distance[w] = p->weight;
parent[w] = v;
}
p = p->next;
}

dist = MAXINT;
for (i = 1; i <= g->nvertices; i++) {
if ((!intree[i]) && (dist > distance[i])) {
dist = distance[i];
v = i;
}
}
}

return(weight);
}
Prim’s Analysis
Finding the minimum weight fringe-edge takes O(n) time,
because we iterate through the distance array to find the
minimum
After adding a vertex v to the tree, by running through its
adjacency list in O(n) time we check whether it provides a
cheaper way to connect its neighbors to the tree. If so, update
the distance value.
The total time is n × O(n) = O(n2).
Questions?
Topic: Kruskal’s Algorithm
Kruskal’s Algorithm
Since an easy lower bound argument shows that every edge
must be looked at to find the minimum spanning tree, and the
number of edges m = O(n2), Prim’s algorithm is optimal on
dense graphs.
The complexity of Prim’s algorithm is independent of the
number of edges. Kruskal’s algorithm is faster on sparse
graphs
Kruskal’s algorithm is also greedy. It repeatedly adds the
smallest edge to the spanning tree that does not create a cycle.
Kruskal’s Algorithm in Action

5
7 6
2 5 1
4 2 4 2
9 3 3 3
5 7 4 7 1 2 6 5 4
A 12 A A
G Prim(G,A) Kruskal(G)
Kruskal is Correct (Proof by Contradiction)
• If Kruskal’s algorithm is not correct, these must be
some graph G where it does not give the minimum cost
spanning tree.
• If so, there must be a first edge (x, y) Kruskal adds such
that the set of edges cannot be extended into a minimum
spanning tree.
• When we added (x, y) there no path between x and y, or
it would have created a cycle. Thus adding (x, y) to the
optimal tree it must create a cycle.
• But at least one edge in this cycle must have been added
after (x, y), so it must have heavier.
The Contradiction
Deleting this heavy edge leaves a better MST than the optimal
tree, yielding a contradiction!
x y x y

(a) (b)

Thus Kruskal’s algorithm is correct!

How fast is Kruskal’s algorithm?
What is the simplest implementation?
• Sort the m edges in O(m lg m) time.
• For each edge in order, test whether it creates a cycle the
forest we have thus far built – if so discard, else add to
forest. With a BFS/DFS, this can be done in O(n) time
(since the tree has at most n edges).
The total time is O(mn), but can we do better?
Fast Component Tests Give Fast MST
Kruskal’s algorithm builds up connected components. Any
edge where both vertices are in the same connected compo-
nent create a cycle. Thus if we can maintain which vertices
are in which component fast, we do not have test for cycles!
• Same component(v1, v2) – Do vertices v1 and v2 lie in the
same connected component of the current graph?
• Merge components(C1, C2) – Merge the given pair of
connected components into one component.
Fast Kruskal Implementation

Put the edges in a heap

count = 0
while (count < n − 1) do
get next edge (v, w)
if (component (v) 6= component(w))
add to T
component (v)=component(w)
If we can test components in O(log n), we can find the MST
in O(m log m)!
Question: Is O(m log n) better than O(m log m)?
Questions?
Topic: The Union-Find Data Structure
Union-Find Programs
We need a data structure for maintaining sets which can test
if two elements are in the same and merge two sets together.
These can be implemented by union and find operations,
where
• Find(i) – Return the label of the root of tree containing
element i, by walking up the parent pointers until there is
no where to go.
• Union(i,j) – Link the root of one of the trees (say
containing i) to the root of the tree containing the other
(say j) so f ind(i) now equals f ind(j).
Union-Find “Trees”
We are interested in minimizing the time it takes to execute
any sequence of unions and finds.
A simple implementation is to represent each set as a tree,
with pointers from a node to its parent. Each element is
contained in a node, and the name of the set is the key at
the root:
4

3 1 2 3 4 5 6 7
1 6 2 1 4 3 4 3 4 2
5
7
(l) (r)
Union-Find Data Structure
typedef struct {
int p[SET_SIZE+1]; /* parent element */
int size[SET_SIZE+1]; /* number of elements in subtree i */
int n; /* number of elements in set */
} union_find;

void union_find_init(union_find *s, int n) {

int i; /* counter */

for (i = 1; i <= n; i++) {

s->p[i] = i;
s->size[i] = 1;
}
s->n = n;
}
Worst Case for Union Find
In the worst case, these structures can be very unbalanced:
For i = 1 to n/2 do
Union(i,i+1)
For i = 1 to n/2 do
Find(1)
Who’s The Daddy?
We want the limit the height of our trees which are affected
by union’s.
When we union, we can make the tree with fewer nodes the
child.
UNION (s, t)
FIND(S)

s S

Since the number of nodes is related to the height, the height

of the final tree will increase only if both subtrees are of equal
height!
If U nion(t, v) attaches the root of v as a subtree of t iff the
number of nodes in t is greater than or equal to the number in
v, after any sequence of unions, any tree with h/4 nodes has
height at most blg hc.
Proof
By induction on the number of nodes k, k = 1 has height 0.
Let di be the height of the tree ti

T1 T2 d2

k2 nodes
k1 nodes

k = k1+ k2 nodes

d is the height

If (d1 > d2) then d = d1 ≤ blog k1c ≤ blg(k1 + k2)c = blog kc

If (d1 ≤ d2), then k1 ≥ k2.
d = d2 +1 ≤ blog k2c+1 = blog 2k2c ≤ blog(k1 +k2)c = log k
Can we do better?
We can do unions and finds in O(log n), good enough for
Kruskal’s algorithm. But can we do better?
The ideal Union-Find tree has depth 1:

... ...

N-1 leaves

On a find, if we are going down a path anyway, why not

change the pointers to point to the root?
1
10 1
2 14

3 13 FIND(4) 14
7 11 2 4 3 7 10 13
12
12
4 5 6 8 9 5 6 8 9 11

This path compression will let us do better than O(n log n)

for n union-finds.
O(n)? Not quite . . . Difficult analysis shows that it takes
O(nα(n)) time, where α(n) is the inverse Ackerman function
and α(number of atoms in the universe)= 5.
Same Component Test

bool same_component(union_find *s, int s1, int s2) {

return (find(s, s1) == find(s, s2));
}

int find(union_find *s, int x) {

if (s->p[x] == x) {
return(x);
}
return(find(s, s->p[x]));
}
Merge Components Operation

void union_sets(union_find *s, int s1, int s2) {

int r1, r2; /* roots of sets */

r1 = find(s, s1);
r2 = find(s, s2);

if (r1 == r2) {
return; /* already in same set */
}

if (s->size[r1] >= s->size[r2]) {

s->size[r1] = s->size[r1] + s->size[r2];
s->p[r2] = r1;
} else {
s->size[r2] = s->size[r1] + s->size[r2];
s->p[r1] = r2;
}
}
Questions?

Cole G. Botin
No ratings yet
Cole G. Botin
11 pages
Mth601 Final Term Solved Mcqs
No ratings yet
Mth601 Final Term Solved Mcqs
27 pages
Final Quiz 1
No ratings yet
Final Quiz 1
3 pages
Graphs MST
No ratings yet
Graphs MST
46 pages
Prim's and Kruskal's Algorithm
No ratings yet
Prim's and Kruskal's Algorithm
58 pages
Spanning Trees: Introduction To Algorithms
No ratings yet
Spanning Trees: Introduction To Algorithms
69 pages
08 Minumum Spanning Tree
No ratings yet
08 Minumum Spanning Tree
29 pages
Minimum Spanning Tree Tutorials & Notes - Algorithms - HackerEarth
No ratings yet
Minimum Spanning Tree Tutorials & Notes - Algorithms - HackerEarth
10 pages
Greedy MST
No ratings yet
Greedy MST
30 pages
Module 3 and 4 Notes
No ratings yet
Module 3 and 4 Notes
18 pages
Min Spanning Trees
No ratings yet
Min Spanning Trees
32 pages
4.5 Minimum Spanning Tree
No ratings yet
4.5 Minimum Spanning Tree
10 pages
Data Structures and Algorithms: Minimum Spanning Trees
No ratings yet
Data Structures and Algorithms: Minimum Spanning Trees
41 pages
M5_Chapter 9 1
No ratings yet
M5_Chapter 9 1
18 pages
3.5 Minimum cost spanning trees Kruskal and Prim’s algorithms
No ratings yet
3.5 Minimum cost spanning trees Kruskal and Prim’s algorithms
48 pages
Assignment2 Last
No ratings yet
Assignment2 Last
49 pages
MODULE4-Greedy_methods[1]
No ratings yet
MODULE4-Greedy_methods[1]
17 pages
Mit Quiz Solutions 2
No ratings yet
Mit Quiz Solutions 2
12 pages
19. W-11_L-1_Minimum Spanning Tree (MST), MST Kruskal’s Algorithm, MST Prim’s Algorithm.pptx
No ratings yet
19. W-11_L-1_Minimum Spanning Tree (MST), MST Kruskal’s Algorithm, MST Prim’s Algorithm.pptx
41 pages
Minimum Spanning Tree (Prim's and Kruskal's Algorithms)
No ratings yet
Minimum Spanning Tree (Prim's and Kruskal's Algorithms)
17 pages
CS124 Spring 2011
No ratings yet
CS124 Spring 2011
6 pages
Minimum Spanning Tree
No ratings yet
Minimum Spanning Tree
10 pages
Session+7+and+8
No ratings yet
Session+7+and+8
31 pages
Greedy Technique Definition:: On Each Step, The Choice Made Must Be
No ratings yet
Greedy Technique Definition:: On Each Step, The Choice Made Must Be
14 pages
Minimum Spanning Tree
No ratings yet
Minimum Spanning Tree
16 pages
5thAOAEXP
No ratings yet
5thAOAEXP
10 pages
UNIT 3.3 MST Spanning Tree
No ratings yet
UNIT 3.3 MST Spanning Tree
46 pages
Prims and Kruskal
No ratings yet
Prims and Kruskal
8 pages
dwvm unit3 (2)
No ratings yet
dwvm unit3 (2)
23 pages
13_MST
No ratings yet
13_MST
36 pages
18-Minimum Cost Spanning Tree_ Kruskal's Algorithm
No ratings yet
18-Minimum Cost Spanning Tree_ Kruskal's Algorithm
71 pages
Chapter11C
No ratings yet
Chapter11C
44 pages
Lecture_12.1
No ratings yet
Lecture_12.1
47 pages
Lecture 12.1
No ratings yet
Lecture 12.1
45 pages
UNIT 4
No ratings yet
UNIT 4
175 pages
LatexAssignment
No ratings yet
LatexAssignment
4 pages
Minimum Spanning trees.pptx
No ratings yet
Minimum Spanning trees.pptx
20 pages
(Les08) Minimalspanningtrees
No ratings yet
(Les08) Minimalspanningtrees
27 pages
Minimum Spanning Tree, Kruskal's and Prim's Algorithms, Applications in Networking
No ratings yet
Minimum Spanning Tree, Kruskal's and Prim's Algorithms, Applications in Networking
9 pages
Showclassmst
No ratings yet
Showclassmst
17 pages
Dsa CH 6 Problems
No ratings yet
Dsa CH 6 Problems
45 pages
Lecture 24
No ratings yet
Lecture 24
27 pages
Greedy Algo
No ratings yet
Greedy Algo
31 pages
Prim's and Kruskal's Algorithm
No ratings yet
Prim's and Kruskal's Algorithm
31 pages
Algorithm 8
No ratings yet
Algorithm 8
16 pages
Greedy Algorithms 3
No ratings yet
Greedy Algorithms 3
36 pages
Kruskal's Algorithm
No ratings yet
Kruskal's Algorithm
4 pages
Experiment No. 4 Design and Analysis Spanning Tree: Solve Minimum Cost Spanning Tree Problem Using Greedy Method
No ratings yet
Experiment No. 4 Design and Analysis Spanning Tree: Solve Minimum Cost Spanning Tree Problem Using Greedy Method
4 pages
MST PDF
No ratings yet
MST PDF
3 pages
GrahamHell HistoryMST
No ratings yet
GrahamHell HistoryMST
15 pages
Ada MTE Presentation
No ratings yet
Ada MTE Presentation
20 pages
Assignment Latex
No ratings yet
Assignment Latex
5 pages
Minimum Spanning Tree: Presented By: Hinal Lunagariya
No ratings yet
Minimum Spanning Tree: Presented By: Hinal Lunagariya
30 pages
Minimum Spanning Tree
No ratings yet
Minimum Spanning Tree
25 pages
L12_trees2-1
No ratings yet
L12_trees2-1
45 pages
12 - Minimum Spanning Tree
No ratings yet
12 - Minimum Spanning Tree
5 pages
Prims and Kruskal - ET - C2 - Roll No - 26
No ratings yet
Prims and Kruskal - ET - C2 - Roll No - 26
8 pages
Minimum Spanning Tree001
No ratings yet
Minimum Spanning Tree001
19 pages
Graph Algorithms 10 271709207487730
No ratings yet
Graph Algorithms 10 271709207487730
21 pages
Min Spanning Trees
No ratings yet
Min Spanning Trees
26 pages
Algorithm
No ratings yet
Algorithm
13 pages
NU-Lec - 20 - MST and Algos
No ratings yet
NU-Lec - 20 - MST and Algos
52 pages
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet
3.4. Sharpening Spatial Filtering
No ratings yet
3.4. Sharpening Spatial Filtering
45 pages
Wavelet Basics PDF
No ratings yet
Wavelet Basics PDF
16 pages
JIT10203 Chapter2.0 - Problem Solving
No ratings yet
JIT10203 Chapter2.0 - Problem Solving
23 pages
Neural Network Learning Rules
No ratings yet
Neural Network Learning Rules
33 pages
Dcs Assignment No 2
No ratings yet
Dcs Assignment No 2
6 pages
Huffman Coding Technique For Image Compression: ISSN:2320-0790
No ratings yet
Huffman Coding Technique For Image Compression: ISSN:2320-0790
3 pages
Cluster Analysis
No ratings yet
Cluster Analysis
9 pages
Lab 7
No ratings yet
Lab 7
5 pages
Algorithms & Flowcharts
100% (1)
Algorithms & Flowcharts
37 pages
DSA - TT2 - Practice Questions
No ratings yet
DSA - TT2 - Practice Questions
3 pages
Advance Analysis of Algorithm: Depth First Search & Breadth First Search
No ratings yet
Advance Analysis of Algorithm: Depth First Search & Breadth First Search
46 pages
Spline Interpolation
No ratings yet
Spline Interpolation
8 pages
20C23027 Prac6
No ratings yet
20C23027 Prac6
20 pages
Application of Numerical Methods in Chemical Engineering
100% (3)
Application of Numerical Methods in Chemical Engineering
11 pages
Lecture2 Methods of Determining Stability-Rhc
No ratings yet
Lecture2 Methods of Determining Stability-Rhc
30 pages
02 Systems of Linear Equations
No ratings yet
02 Systems of Linear Equations
42 pages
Search Applications - Games: This Unit Has Two Main Sections Planning Learning Adaptation and Heuristics
No ratings yet
Search Applications - Games: This Unit Has Two Main Sections Planning Learning Adaptation and Heuristics
53 pages
dsp2020-21
No ratings yet
dsp2020-21
4 pages
Tenambit Ps Maths Key Ideas Ass Yr5 t2
No ratings yet
Tenambit Ps Maths Key Ideas Ass Yr5 t2
3 pages
Set Cse Ca 1
No ratings yet
Set Cse Ca 1
13 pages
Magicindicator
No ratings yet
Magicindicator
3 pages
Assignment Problems
100% (1)
Assignment Problems
22 pages
MM Unit-III - 0
No ratings yet
MM Unit-III - 0
22 pages
Week 4
No ratings yet
Week 4
27 pages
Dsa Lecture 14 Graphs
No ratings yet
Dsa Lecture 14 Graphs
39 pages
Dsa Lab 13 064 BSCS
No ratings yet
Dsa Lab 13 064 BSCS
5 pages
Simplex Method (Minimization Example) : Object Function
No ratings yet
Simplex Method (Minimization Example) : Object Function
6 pages

lecture13

Uploaded by

lecture13

Uploaded by

Lecture 13:

Minimum Spanning Trees

Department of Computer Science

(a) (b) (c) (d)

The optimum traveling salesman tour is at most twice the

• If Prim’s algorithm is not correct, these must be some

• But if (x, y) is not in M ST (G), then there must be a path

for (i = 1; i <= g->nvertices; i++) {

Thus Kruskal’s algorithm is correct!

Put the edges in a heap

void union_find_init(union_find *s, int n) {

for (i = 1; i <= n; i++) {

Since the number of nodes is related to the height, the height

If (d1 > d2) then d = d1 ≤ blog k1c ≤ blg(k1 + k2)c = blog kc

On a find, if we are going down a path anyway, why not

This path compression will let us do better than O(n log n)

bool same_component(union_find *s, int s1, int s2) {

int find(union_find *s, int x) {

void union_sets(union_find *s, int s1, int s2) {

if (s->size[r1] >= s->size[r2]) {

You might also like