0% found this document useful (0 votes)

47 views

Cs5002 Lect9 Fall18 Notes

The document introduces algorithms and asymptotic analysis. It defines big-O, big-Omega and big-Theta notations to classify algorithms by runtime complexity. Examples are given to show how functions can be classified using these notations.

Uploaded by

Abhay Sontakke

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

Cs5002 Lect9 Fall18 Notes

Uploaded by

Abhay Sontakke

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

CS 5002: Discrete Structures Fall 2018

1
Lecture 9: November 8, 2018
Instructors: Adrienne Slaughter, Tamara Bonaci

Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications.
They may be distributed outside this class only with the permission of the Instructor.

Introduction to Algorithms

Readings for this week:

Rosen, Chapter 2.1, 2.2, 2.5, 2.6
Sets, Set Operations, Cardinality of Sets, Matrices

9.1 Overview

Objective: Introduce algorithms.

1. Review logarithms
2. Asymptotic analysis
3. Define Algorithm
4. How to express or describe an algorithm
5. Run time, space (Resource usage)
6. Determining Correctness
7. Introduce representative problems

1. foo

9.2 Asymptotic Analysis

The goal with asymptotic analysis is to try to find a bound, or an asymptote, of a function. This allows
us to come up with an “ordering” of functions, such that one function is definitely bigger than another,
in order to compare two functions. We do this by considering the value of the functions as n goes to
infinity, so for very large values of n, as opposed to small values of n that may be easy to calculate.

Once we have this ordering, we can introduce some terminology to describe the relationship of two
functions.

9-1
9-2 Lecture 9: November 8, 2018

Growth of Functions
nn
4096
n!
2048
1024
512
2n
256
128
n2
64
32 n log(n)
16
n
8 √
4 n log(n)
2
1
1

2 3 4 5 6 7 8

From this chart, we see:

√
1 log n n n n log(n) n2 2n n! nn (9.1)

Complexity Terminology
Θ(1) Constant
Θ(log n) Logarithmic
Θ(n) Linear
Θ(n log
n) Linearithmic
Θ nb Polynomial
Θ(bn ) (where b > 1) Exponential
Θ(n!) Factorial

9.2.1 Big-O: Upper Bound

Definition 9.1 (Big-O: Upper Bound) f (n) = O(g(n)) means there exists some constant c such
that f (n) ≤ c · g(n), for large enough n (that is, as n → ∞).
We say f (n) = O(g(n))

Example: I claim 3n2 − 100n + 6 = O(n2 ). I can prove this using the definition of big-O:
Lecture 9: November 8, 2018 9-3

f (n) = 3n2 − 100n + 6 (9.2)

2
g(n) = n (9.3)
2 2
⇒3n − 100n + 6 ≤ c · n for some c (9.4)
2 2
If c = 3 : 3n − 100n + 6 ≤ 3n (9.5)

To prove using Big-O:

• Determine f (n) and g(n)
• Write the equation based on the definition
• Choose a c such that the equation is true.
– If you can find a d, then f (n) = O(g(n)). If not, then f (n) 6= O(g(n)).
These statements are all true:
3n2 − 100n + 6 = O(n2 ) (9.6)
2 3
3n − 100n + 6 = O(n ) (9.7)
2
3n − 100n + 6 6= O(n) (9.8)

Proving 9.7:

f (n) = 3n2 − 100n + 6 (9.9)

3
g(n) = n (9.10)
2 3
⇒ 3n − 100n + 6 = c · n (for some c) (9.11)
2 3
If c = 1 : 3n − 100n + 6 ≤ n (when n > 3) (9.12)

We also know this to be true because order is transitive: if f (n) = O(g(n)), and g(n) = O(h(n)), then
f (n) = O(h(n)). Since n2 = O(n3 ), then any f (n) = O(n2 ) is also O(n3 ).
Proving 9.8:

f (n) = 3n2 − 100n + 6 (9.13)

g(n) = n (9.14)
2
For any c : cn < 3n (when n > c) (9.15)

9.2.2 Big-Omega: Lower Bound

Definition 9.2 (Big-Omega: Lower Bound) f (n) = Ω(g(n)) means there exists some constant c
such that f (n) ≥ c · g(n), for large enough n (that is, as n → ∞).
We say f (n) = Ω(g(n)) or “f of n is Big Omega of g of n”
9-4 Lecture 9: November 8, 2018

Example: I claim 3n2 − 100n + 6 = Ω(n2 ). I can prove this using the definition of big-Omega:

f (n) = 3n2 − 100n + 6 (9.16)

2
g(n) = n (9.17)
2 2
⇒3n − 100n + 6 ≥ c · n for some c (9.18)
2 2
If c = 2 : 3n − 100n + 6 ≤ 2n (9.19)

We show Big-Omega the same way we show Big-O.

These statements are all true:

3n2 − 100n + 6 = Ω(n2 ) (9.20)

2 3
3n − 100n + 6 6= Ω(n ) (9.21)
2
3n − 100n + 6 = Ω(n) (9.22)

Proving 9.21:

f (n) = 3n2 − 100n + 6 (9.23)

3
g(n) = n (9.24)
2 3
⇒ 3n − 100n + 6 ≥ c · n (for some c) (9.25)
2 3
If c = 1 : 3n − 100n + 6 ≥ n (when n > 3) (9.26)

Proving 9.22:

f (n) = 3n2 − 100n + 6 (9.27)

g(n) = n (9.28)
2
For any c : cn < 3n (when n > 100c) (9.29)

9.2.3 Big-Theta: “Tight” Bound

Definition 9.3 (Big-Theta: “Tight” Bound) f (n) = Θ(g(n)) means there exists some constants c1
and c2 such that f (n) ≤ c1 g(n) and f (n) ≥ c2 g(n).
We say f (n) = Θ(g(n)) or “f of n is Big-Theta of g of n”.

Definition 9.4 (Theta and “order of ”) When f (x) = Θ(g(x)), it is the same as saying f (x) is the
order of g(x), or that f (x) and g(x) are the same order.
3n2 − 100n + 6 = Θ(n2 ) Both O and Ω apply
Lecture 9: November 8, 2018 9-5

3n2 − 100n + 6 6= Θ(n3 ) Only O applies

3n2 − 100n + 6 6= Θ(n) Only Ω applies
Interesting Aside Donald Knuth popularized the use of Big-O notation. It was originally inspired
by the use of “ell” numbers, written as L(5), which indicates a number that we don’t know the exact
value of, but is less than 5. That allows us to reason about the value without knowing the exact value:
we know L(5) < 100, for example.

Theorem 9.5 If f (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 , then f (x) = O(n )

a) f (x) = 17x + 11 b) f (x) = x2 + 1000

c) f (x) = x log x d) f (x) = x4 /2
e) f (x) = 2x f) f (x) = bxc · dxe

9.2.4 Logs, Powers, Exponents

We’ve seen f (n) = O(nd ). If d > c > 1, then nc = O(nc ). nc is O nd , but nd is not O (nc ).
logb n is O(n) whenever b > 1. Whenever b > 1, c and d are positive:

c c
(logb n) is O nd , but nd is not (O (logb n) )

(9.30)

This tells us that every positive power of the logarithm of n to the base b, where b ¿ 1, is big-O of every
positive power of n, but the reverse relationship never holds. In Example 7, we also showed that n is
O(2n). More generally, whenever d is positive and b ¿ 1, we have

nd is O (bn ) , but bn is not O nd

(9.31)

This tells us that every power of n is big-O of every exponential function of n with a base that is greater
than one, but the reverse relationship never holds. Furthermore, we have when c ¿ b ¿ 1,

bn is O (cn ) but cn is not O (bn ) (9.32)

This tells us that if we have two exponential functions with different bases greater than one, one of these
functions is big-O of the other if and only if its base is smaller or equal.

9.2.5 Adding Functions

There are a set of rules that govern combining functions together.

O(f (n)) + O(g(n)) → O(max(f (n), g(n))) (9.33)

Ω(f (n)) + Ω(g(n)) → Ω(max(f (n), g(n))) (9.34)
Θ(f (n)) + Θ(g(n)) → Θ(max(f (n), g(n)) (9.35)

These statements express the notion that the largest term of the statement is the dominant one. For
example, n3 + 2n2 + 3 = O(n3 ).
Example: Prove that n2 = O(2n ).
9-6 Lecture 9: November 8, 2018

Example: Prove that if f1 (n) = O(g1 (n)) and f2 (n) = O(g2 (n)), then f1 (n) + f( n) = O(g1 (n) + g2 (n)).

Example:

f (n) = n + log n (9.36)

p
g(n) = (n) (9.37)

Is f (n) = O(g(n)), g(n) = O(f (n)), or both?

√
Example: If f (n) = n + log n + n, find a simple function g such that f (n) = Θ(g(n)).
Lecture 9: November 8, 2018 9-7

Summary
• f (n) = O(g(n)) means c · g(n) is an upper bound on f (n). Thus there exists some constant c such
that f (n) is always ≤ c · g(n), for large enough n (i.e., n ≥ n0 for some constant n0 ).
• f (n) = Ω(g(n)) means c · g(n) is a lower bound on f (n). Thus there exists some constant c such
that f (n) is always ≥ c · g(n), for all n ≥ n0 .
• f (n) = Θ(g(n)) means c1 · g(n) is an upper bound on f (n) and c2 · g(n) is a lower bound on f (n),
for all n ≥ n0. Thus there exist constants c1 and c2 such that f (n) ≤ c1 · g(n) and f (n) ≥ c2 · g(n).
This means that g(n) provides a nice, tight bound on f (n).

9.2.6 Introduction to Algorithms

• An algorithm is a set of instructions for accomplishing a task.
• Technically, any program is an algorithm
• We talk about algorithms as general approaches to specific problems
• An algorithm is general, but is implemented in code to make it specific
Algorithms are like Recipes
• If I were to use a simile, I’d say algorithms are like recipes.
• People have been cooking and baking for a looong time
– Let’s take advantage of solved problems and use them as starting blocks
• There are general approaches to different kinds of foods
• Each recipe for a chocolate chip cookie is a little different, but follows the same general structure.
• I can adapt a recipe for chocolate chip cookies to a different kind of cookie if I want.
• I might modify my recipe depending on the context I’m cooking in: cooking for a 200 person formal
dinner versus playing around on a Saturday afternoon.
What is an algorithm?
• An algorithm is the part of the “recipe” that stays the same no matter what it’s implemented in
or what hardware it’s running on.
• An algorithm solves a general, specified problem
• An algorithmic problem is specified by describing:
– The set of instances it works on
– Desired properties of the output
Example: Sorting
Input: A sequence of N numbers: n1 , n2 , n3 , . . . , nn
Output: The permutation of the input sequence such as n1 ≤ n2 ≤ n3 . . . ≤ nn
We look to ensure that an algorithm is:
9-8 Lecture 9: November 8, 2018

• Correct
• Efficient in time
• Efficient in space

The rest of today:

• Example algorithms
– Binary Search
– Selection Sort
• Algorithm Analysis
– Proving Correctness (briefly)
– Run time: How long does it take for an algorithm to run?
– Run space: How much extra memory/storage does an algorithm require?
• Asymptotic Analysis and Growth of Functions

9.3 Some Algorithms

9.3.1 Expressing Algorithms

Expressing Algorithms
We need some way to express the sequence of steps in an algorithm.

In order of increasing precision:

• English
• Graphically
• Pseudocode
• real programming languages (C, Java, Python, etc)

Unfortunately, ease of expression moves in the reverse order.

An algorithm is an idea. If the idea is not clear when you express the algorithm, then you are using a
too low-level way to express it.

9.3.2 Binary search

Searching
Input: A set of N values: n1 , n2 , n3 , . . . , nn and a target value t

Output: Whether the set contains t

Imagine...
A (sub) roster of athletes on the USA Olympic Ski & Snowboard team for 2018:
Lecture 9: November 8, 2018 9-9

1 Andy Newell

2 Bryan Fletcher

3 Chloe Kim

4 Jessie Diggins

5 Lindsey Vonn

6 Sadie Bjornsen

7 Sophie Caldwell

8 Taylor Fletcher

Is Chloe Kim on the US Ski & Snowboard team for 2018?

Is Chloe Kim on the US Ski-Snowboard team for 2018?
Let’s make this a little more complicated...
Assume I have 1,000,000 athletes. How do I answer this question?
OR: Maybe I can’t actually *see* the list here in its entirety.
How do I search?
A slight aside...
Consider a dictionary (the book kind!) You want to look up a word. First, you open up to the middle.
If you’ve gone too far, you split the first half of the dictionary; if you haven’t gone far enough, you split
the second half of the dictionary.
Binary Search
A: Array to search I: Item to find min: starting point max: end point

Binary-Search(A, I, min, max)

1 if (min == max)
2 return false
3 mid = min + b(max − min)/2c
4 if (A[mid] == I)
5 return true
6 if (A[mid] < I)
7 Binary-Search(A, I, mid, max)
8 else
9 Binary-Search(A, I, min, mid)

1 Binary-Search(A, I, 1, numElems)

What’s interesting about Binary Search?

• It’s recursive
– It’s defined in terms of itself
• Each time we call Binary-Search, we are searching on only half the size of the input
– Since we know what the middle element is, we know whether our final element is before or after
that one, so can discard half the array with each comparison!
9-10 Lecture 9: November 8, 2018

Quick Review
1. I stated the problem
2. I described a solution in English (using a metaphor)
3. I described the solution with psuedocode
4. I provided a graphical solution

9.3.3 Selection sort

Selection Sort
Input: A sequence A of n numbers: n1 , n2 , n3 , . . . , nn , and empty sequence B
Output: The input sequence such that B contains the elements of A ordered such that n1 ≤ n2 ≤
n3 . . . ≤ nn

Selection-Sort(A, B)
1 for i = 1 to A. length
2 min ind = 0
3 for j = 1 to A.length
4 if A[j] < A[min ind]
5 min ind = j
6 B[i] = A[min ind]
7 A[min ind] = Inf

The C code
1 void selection_sort(int a[], int b[], int len){
2
3 for (int i=0; i<len; i++){
4 int min_ind = 0;
5 for (int j=0; j<len; j++){
6 if (a[j] < a[min_ind]){
7 min_ind = j;
8 }
9 }
10 b[i] = a[min_ind];
11 a[min_ind] = 10000; //sentinel val
12 }
13 }

whew What did we just do?

• Expressing Algorithms
– English
– Psuedocode
– Programming Language (C)
– Graphically!
• Binary Search– a first search algorithm
• Selection sort– a first sort algorithm
• Next up: Analyzing the algorithms

9.4 Analysis
What is Algorithm Analysis?
When we analyze algorithms, we are analyzing for 3 things:
Lecture 9: November 8, 2018 9-11

1. Correctness
2. Run time
3. Run space

9.4.1 Correctness
Proving Correctness
How to prove that an algorithm is correct?
For any algorithm, we must prove that it always returns the desired output for all legal instances of the
problem.
For sorting, this means even if the input is already sorted or it contains repeated elements.
Proof by:
• Induction
• Counterexample
• Loop Invariant
Proof by Counterexample Searching for counterexamples is the best way to disprove the cor-
rectness of a heuristic.
• Think about small examples
• Think about examples on or around your decision points
• Think about extreme examples (big or small)
Proof by Induction
Failure to find a counterexample to a given algorithm does not mean “it is obvious” that the algorithm
is correct.
Mathematical induction is a very useful method for proving the correctness of recursive algorithms.
1. Prove base case
2. Assume true for arbitrary value n
3. Prove true for case n + 1
Proof by Loop Invariant
Built off proof by induction.
Useful for algorithms that loop.
1. Find p, a loop invariant
2. Show the base case for p
3. Use induction to show the rest.

9.4.2 Run time

What is run time?
The amount of time it takes for an algorithm to run, in terms of the size of the input n.
A faster algorithm running on a slower computer will always win for sufficiently large instances.
Usually, problems don’t have to get that large before the faster algorithm wins.
This is where Big-O comes in.
Essentially, the number of lines of code that are run.
What is run time?
9-12 Lecture 9: November 8, 2018

• Best Case
– Given an optimal input and all the best decisions that can be made, how many steps until the
algorithm terminates?
– In a sort problem, it’s usually a sorted input.
• Worst Case
– Given the worst possible input and all the worst decisions that can be made, how many steps
until the algorithm terminates?
– In a search problem, it’s usually the last item looked at.
• Average Case
– Somewhere between the two; frequently an averaging of best & worst.
Run Time of Selection Sort

Selection-Sort(A, B)
1 for i = 1 to A. length. // n
2 min ind = −1 // 1
3 for j = 1 to A.length // n
4 if A[j] < A[min ind] // 1
5 min ind = j // 1
6 B[i] = A[min ind] // 1
7 A[min ind] = Inf // 1

TODO: Show run time in terms of a sum?

Pn Pn
i=1 (3 + j=1 2)

Run time: n · (n + 1 + 1 + 1 + 1 + 1) ⇒ O(n2 )

Is this best, worst or average?
Run Time of Selection Sort
Best case: {1, 2, 3, 4, 5, 6}
(It’s already sorted)

Worst case: {6, 5, 4, 3, 2, 1 }

(It’s reverse sorted)

Average case: {1, 6, 4, 5, 2, 3}

(It’s a little of this and that)

Actually, for Selection Sort, there’s no difference in run time for Best/Worst/Average case.
In all cases, we still iterate through all the elements.
⇒ O(n2 )
Run Time of Binary Search
Lecture 9: November 8, 2018 9-13

Binary-Search(A, I, min, max)

1 if (min == max)
2 return false
3 mid = min + b(max − min)/2c
4 if (A[mid] == I)
5 return true
6 if (A[mid] < I)
7 Binary-Search(A, I, mid, max)
8 else
9 Binary-Search(A, I, min, mid)

Best case:

Binary-Search({1, 2, 3}, 2, 1, 3)

Worst case:

Binary-Search({1, 2, 3, 4, 5, 6, 7, 9, 10}, 2, 1, 10)

Run Time of Binary Search

Binary-Search(A, I, min, max)
1 if (min == max)
2 return false // 1
3 mid = min + b(max − min)/2c // 1
4 if (A[mid] == I)
5 return true // 1
6 if (A[mid] < I)
7 Binary-Search(A, I, mid, max) // R(n/2)
8 else
9 Binary-Search(A, I, min, mid) // R(n/2)

R(n) = 1 + 1 + 1 + R(n/2)
R(n) = O(lg n)

Run Time of Binary Search

Best Case: O(1)
Worst Case: O(lg n)
Average Case: O(lg n)
Example: Given the following algorithm:

PrintFoobar(n)
1 for i = 1 to n/2
2 for j = i to n − 1
3 for k = 1 to j
4 Print(’foobar’)

Assume n is even. Let T (n) denote the number of times ‘foobar’ is printed as a function of n.
9-14 Lecture 9: November 8, 2018

• Express T (n) as three nested summations.

• Simplify the summation.

Run time as Clock time

So far we’ve focused on counting “number of instructions” as a proxy for measuring the “clock time”
(that is, number of seconds) that an algorithm will run. However, we can use the number of instructions
as a tool to help us figure out clock time, when we have some specifics.

Number of Instructions
= Number of Seconds (9.38)
Instructions Per Second
The number of instructions is measured in terms of n, the size of the input to the algorithm. While (for
the most part) the number of instructions is about the same from machine to machine, what varies is
the number of instructions per second that are run. This gives us 3 variables: number of instructions,
instructions per second and number of seconds to run the algorithm. Therefore, if we know 2 we can
calculate the third.
Here’s an example: Let’s say I’ve implemented Selection-Sort on my MacBook Pro. I know Selection-
Sort takes n2 instructions to run. I choose to run it on an input length of 10,000 items. It takes 2 clock
seconds to run (this is a number I’m choosing for illustration purposes; that’s way too long!).

Number of Instructions
= Number of Seconds (9.39)
Instructions per Second
n2
= 2 seconds (9.40)
Instructions Per Second
10, 0002
n = 10, 000 : = 2 seconds (9.41)
x Instructions per Second
10, 0002 instructions
= x = 50 MIPS2 (9.42)
2 seconds

2 Millions of Instructions per Second. For reference, the iPhone 6 was probably around 25,000 MIPS (in 2014).
Lecture 9: November 8, 2018 9-15

Thus, if we know our algorithm has n2 instructions, and we measure that it takes 2 seconds to run on
our machine with an input of 10,000 items, then our machine runs at about 50 MIPS.
Further, now that we know our machine runs at 50 MIPS, we can use that to estimate how long it will
take to run a different algorithm (that is, different run time) or different input size.
Let’s say we have one million items as input to the same algorithm:

(1, 000, 000)2

n = 1, 000, 000 : =? (9.43)
50 MIPS
(1, 000, 000)2
= 20, 000 seconds (9.44)
50, 000, 000

9.4.2.1 Runtime, Clocktime, and Efficiency

Let’s take two relatively recent machines. One is powered by the Intel Core i7 500U which runs at 49,360
(roughly 50K MIPS), and the other is an Intel Core i7 2600K at 117,160 MIPS.
Runtime Size of Input Intel i7 A Intel i7 B

1000 3 1000
n 1,000 50,000 MIPS = 0.02µsec 117,000 MIPS = 0.009µsec linear
10000
10,000 50,000 MIPS = 0.2µsecs
1,000,000 1,000,000
1,000,000 50,000 MIPS = 20µsecs 117,000 MIPS = 8.5µsecs
1000 log 1000 3000 3000
n log n 1000 50,000 MIPS = 50,000 MIPS = 0.06µsec 117,000 MIPS = 0.03µsec logarithmic
10000
1000000 log 1000000 6,000,000 6,000,000
1,000,000 50,000 MIPS = 50,000 MIPS = 120µsec 117,000 MIPS = 51.3µsec
2
2 1000 10002
n 1000 50,000 MIPS = 20µsecs 117,000 MIPS = 8.5µsecs quadratic/
10000
1,000,000
250 1,125,899,906,842,624 250
cn 50 50,000 MIPS = 50,000,000,000 = 22518µsecs 117,000 MIPS = 9623µsecs4 exponentia
10000
1,000,000
Run Time, Summary
• We count up the number of statements that are run.
• Consider whether there’s a difference in how long it takes a function to run given different inputs.
• Selection sort is O(n2 ), pretty much all the time.
• Binary search can be either O(1) or O(lg n), depending on what the input looks like.

9.4.3 Memory Use

How much memory?
Another resource that we sometimes care about is the amount of memory it takes to run an algorithm.
Sometimes this is total, sometimes it’s just the amount in addition to the input.
Binary Search:
Needs nothing, so memory is O(1) (constant— nothing other than the input.
4 1µsec = 10−6 seconds
4 = 1,125,899,906,842,624
117,000,000,000
9-16 Lecture 9: November 8, 2018

Selection sort:
Needs a whole other array! O(n)
Note: this is just how it was implemented here, in this discussion. If we chose not to use that other
array, it wouldn’t be O(n).

9.5 Representative Problems

Stable Matching
• Gale-Shapley
• The problem of: matching residents with med schools
• Each resident has a prioritized list of schools they want to go to
• Each school has a prioritized list of students they want to accept
• A stable match is one where if either the school or the student is offered another match, they
won’t change.
Interval Scheduling
The Problem: We have a resource r, such as a classroom, and a bunch of requests q : {start, f inish}.
How can we schedule the requests to use the resource?
• We want to identify a set S of requests such that no requests overlap.
• Ideally, the S that we find contains the maximum number of requests.

In this diagram, we see three sets of requests.

Which set of requests is the preferred choice for the interval scheduling problem as defined?

time

Solution: A simple heuristic that is an example of a greedy algorithm.

Weighted Interval Scheduling
The Problem: Same as interval scheduling, but this time, the request has a weight.
• The weight may be how much we’ll earn by satisfying this request
• Find a subset that maximizes the weights
• This is very similar to the problem we just saw, but these weights cause a problem.
• Consider: If all requests except one have weight = 1, and one has weight greater than the sum of
all the others.
Solution: An approach called dynamic programming, where we calculate the weight of each subset
and use that to find the best set overall.
Bipartite Matching
The Problem: We have two groups of objects that we need to match, or assign to another object.
• An example: matching residents with med schools
• Each resident has a prioritized list of schools they want to go to
• Each school has a prioritized list of students they want to accept
• A stable match is one where if either the school or the student is offered another match, they
won’t change.
An Aside: Bipartite Graph
Lecture 9: November 8, 2018 9-17

• A graph G = (V, E) is bipartite if the nodes V can be partitioned into sets X and Y in such a way
that every edge has one end in X and the other in Y .
• It’s just a graph, but we tend to depict bipartite graphs in two columns to emphasize the bipartite-
ness.
1 1

2 2

3
3

4
4

5
5
6
6

Bipartite Matching
• Bipartite Matching is relevant when we want to match one set of things to another set of things.
– Nodes could be Jobs and Machines; Edges indicate that a given machine can do the job.
– Nodes could be men and women; Edges indicate that a given man is married to a given woman.
(Okay, in the real world it’s more complex, but this is a classic “problem” I feel required to
present...)
Solution: use backtracking and augmentation to solve the problem, which contributes to network
flow problems
Independent Set
Independent Set is a very general problem:
• Given a graph G = (V, E), a set of nodes S ⊆ V is independent if no two nodes in S are joined
by an edge.
• Goal: Find an independent set that is as large as possible.
• Applicable to any problem where you are choosing a collection of objects and there are pairwise
conflicts.

1 2

3 4 5

6 7

In this graph, the largest independent set is {1, 4, 5, 6}

Independent Set
9-18 Lecture 9: November 8, 2018

• Example: Each node is a friend, and each edge indicates a conflict between those two friends. Use
Independent Set to find the largest group of people you can invite to a party with no conflicts.
• Interval Scheduling is a special case of Independent Set:
– Define graph G = (V, E) where V is the set of requests or intervals, and E is the set of edges
that indicate conflicts between two requests.
• Bipartite Matching is a special case of Independent Set:
– A little more complex than I want to explain in class; see the book and we’ll cover it later.
• Solution: No efficient algorithm is known to solve this problem.
• However: If we’re given an independent set for a given graph G, we can easily check that it’s a
correct answer.
Competitive Facility Location
This time, we have a two-player game.
• Dunkin Donuts puts a cafè at one location.
• Then Starbucks does.
• BUT! Cafès can’t be too close (zoning requirement)
• Goal: Make your shops in the most convenient locations as possible.
• Model the problem:
– Consider each location as a zone (rather than a point) that has an estimated value or revenue.
– Model the problem as a graph: G = (V, E) where V is the set zones as noted above, and E
represents whether two zones are adjacent.
– The zoning requirement says that the set of cafès is an independent set in G.

10 1 15 5 1 15

• Can’t put a cafè in adjacent zones

• Can’t put two cafès in one zone
• Edges indicate two zones are adjacent
• The set of cafès opened must be an independent set.
Competitive Facility Location
• Another question: Can we find a strategy such that Starbucks, no matter where Dunkin Donuts
opens a cafè, can open cafès in locations with a total value of at least B?
• Even if I could give you a strategy, you’d have a hard time believing that the strategy is correct.
• This is in contrast to Independent Set!
• This problem is what we call a PSPACE-complete problem
• Independent Set is a NP complete problem
Representative Problems: Summary
Interval Scheduling Solved easily with a greedy algorithm
Weighted Interval Scheduling Solved with dynamic programming
Bipartite Matching Solved with backtracking and augmentation
Independent Set No efficient approach to generate a solution, but it’s easy to check a given solution
Competitive Facility Location No easy way to generate a solution, and NO EASY WAY to check a
given solution
What is efficient?
• In general, it’s pretty easy to come up with a brute force solution to a problem.
• For example, we can generate all possible solutions for a problem, and then check which one is
correct (or acceptable).
• Definition attempt 1: An algorithm is efficient if it achieves qualitatively better worst-case perfor-
mance, at an analytical level, than brute-force search.
Lecture 9: November 8, 2018 9-19

• What’s “qualitatively better”?

• Final definition: An algorithm is efficient if it has a polynomial running time.
Why does this matter?
• Algorithms are important
– Many performance gains outstrip Moore’s law: We can’t always just throw hardware at the
problem.
• Simple problems can be hard
– Factoring, TSP
• Simple ideas don’t always work
– Nearest neighbor, closest pair heuristics
• Simple algorithms can be very slow
– Brute-force factoring, TSP
• Changing your objective can be good
– Guaranteed approximation for TSP
• And: for some problems, even the best algorithms are slow

Readings for NEXT week:

Rosen, Chapter 4.1, 4.2, 4.3, 4.4

Divisibility and Modular Arithmetic, Integer Representations and Algorithms,
Primes and Greatest Common Divisors, Solving Congruences Solving Congruences
9-20 Lecture 9: November 8, 2018

9.6 Appendix: Logarithms

9.7 Logarithms
9.7.1 Definition
What is a logarithm?

bx = y
m
logb (y) = x

We say “log base b of y equals x”

Once again,
b is called the base
x is called the exponent
Some Practice

log7 (49) =?
⇒ 7? = 49
⇒ 72 = 49
⇒ log7 (49) = 2

Let’s re-write this using the formula we have.

That let’s us change the question to “49 is the 7 raised to what power?”
Or, “What is the exponent?”

9.7.2 Properties
Special Logs
• Base b = 2: binary logarithm, also referred to as lg x
• Base b = e: natural logarithm, also referred to ln x, where e = 2.718.....
– The inverse of ln x is exp(x) = ex
⇒ exp(ln x) = x
• Base b = 10: The common logarithm, also referred to as log x.
• If it’s not one of these, the base is specified.
Restrictions
logb (a) is only defined when b > 1 and a > 0.
Practice: Use what you know about exponents to convince yourself why this is true.
The Product Rule
loga (xy) = loga (x) + loga (y)
Lecture 9: November 8, 2018 9-21

The logarithm of a product is the sum of the logs of its factors.

The Quotient Rule

x
loga y = loga (x) − loga (y)
The logarithm of a quotient is the difference of the logs of its factors.
The Power Rule
loga (xy ) = y loga (x)
When the term of a logarithm has an exponent, it can be pulled out in front of the log.
Change of Base Rule
logc b
loga b = logc a

9.7.3 Logarithm Exercises

2. solve these.
3+log7 x
(a) 2−log7 x =4 x>0

3 + log7 x = 8 − 4 log7 x
5 log7 x = 5
log7 x = 1 (9.45)
x = 71 = 7
k = {7}

(b)
5 + log x
x>0
3 − log x
2= 8
x>0

5+log x
3−log x =3
5 + log x = 9 − 3 log x
4 log x = 4
(9.46)
log x = 1
x = 101 = 10
K = {10}
9-22 Lecture 9: November 8, 2018

llog4 x2 − 9 − log4 (x + 3) = 3 x > 3 ∧ x > −3 ⇒ x ∈ (3; 00)

log4 x2 − 9 − log4 (x + 3) = log4 64

x2 − 9
log4 = log4 64
x+3
x2 − 9
= 64
x+3
(x − 3)(x + 3)
= 64
x+3
x = 67 ∈ (3, 00)
x = {67}
Summary

The Product Rule loga (xy) = loga (x) + loga (y)

The Quotient Rule loga xy = loga (x) − loga (y)
The Power Rule loga (xy ) = y loga (x)
logc b
The Change of Base Rule loga b = logc a

9.7.4 Relevance of Logs

9.7.4.1 Logs and Binary Search

Logs and Binary Search

TODO: Something about in asymptotic analysis log base 10 is equivalent to log base 3 or
whatever.
Binary search is O(log n).
Given a telephone book of n names:
• Looking for person p
nd
• Compare p to the person in the middle, or the n2 name
• After one comparison, you discard 21 of the names in this book.
• The number of steps the algorithm takes = the number of times we can halve n until only one name
is left.
=⇒ log2 n comparisons
• In this case, x =? , y = n, and b = 2

9.7.4.2 Logs and Trees

Logs and Trees

A binary tree of height 2 can have up to 4 leaves:
Lecture 9: November 8, 2018 9-23

What is the height h of a binary tree with n leaf nodes?

For n leaves, n = 2h
⇒ h = log2 n

9.7.4.3 Logs and Bits

Logs and Bits

Let’s say we have 2 bit patterns of length 1 (0 and 1),
and 4 bit patterns of length 2 (00, 01, 10, 11).
How many bits w do we need to represent any one of n different possibilities, either one of n items, or
integers from 1 to n?
• There are at least n different bit patterns of length w
• We need at least w bits where 2w = n
⇒ w = log2 n bits
Takeaway
Logs arise whenever things are repeatedly halved or doubled.
9-24 Lecture 9: November 8, 2018

9.8 Appendix: Selection Sort, graphically

A 2 3 7 9 2 4 8 5 1 6

CH9 1
No ratings yet
CH9 1
28 pages
1 2
No ratings yet
1 2
56 pages
Week 2 Growth Asymptotic Insertion 05042021 022500pm
No ratings yet
Week 2 Growth Asymptotic Insertion 05042021 022500pm
48 pages
Lecture21 PDF
No ratings yet
Lecture21 PDF
13 pages
DAA or Algorithms PPT (1)
No ratings yet
DAA or Algorithms PPT (1)
77 pages
Daa L2
No ratings yet
Daa L2
19 pages
1.-Introduction-to-Asymptotic-analysis
No ratings yet
1.-Introduction-to-Asymptotic-analysis
38 pages
Algorithms and Complexity: Zeph Grunschlag
No ratings yet
Algorithms and Complexity: Zeph Grunschlag
71 pages
CSE 101 Algorithm Lecture Notes 2
No ratings yet
CSE 101 Algorithm Lecture Notes 2
20 pages
1 - Basics and Asymptotic Analysis
No ratings yet
1 - Basics and Asymptotic Analysis
43 pages
CS161: Design and Analysis of Algorithms
No ratings yet
CS161: Design and Analysis of Algorithms
87 pages
04_Algorithm_Analysis_Asymptotic Notation_Growth of Functions
No ratings yet
04_Algorithm_Analysis_Asymptotic Notation_Growth of Functions
40 pages
Daa Complete
No ratings yet
Daa Complete
343 pages
Dsa 1
No ratings yet
Dsa 1
9 pages
ADA Lect2
No ratings yet
ADA Lect2
11 pages
DAA L2
No ratings yet
DAA L2
19 pages
1 Asymptotic Notation - Algorithms (Series Lecture)
No ratings yet
1 Asymptotic Notation - Algorithms (Series Lecture)
35 pages
Asymptotic Notations For Time Efficiency Analysis
No ratings yet
Asymptotic Notations For Time Efficiency Analysis
33 pages
CS 253: Algorithms: Growth of Functions
No ratings yet
CS 253: Algorithms: Growth of Functions
22 pages
Introduction To Algorithm Asympt With Anno 1730542931917
No ratings yet
Introduction To Algorithm Asympt With Anno 1730542931917
69 pages
Ada
No ratings yet
Ada
62 pages
Bignotation
No ratings yet
Bignotation
12 pages
L02_2_Analysis3_Ch04_Ch05
No ratings yet
L02_2_Analysis3_Ch04_Ch05
37 pages
1 Introduction
No ratings yet
1 Introduction
104 pages
CSE-245 Algorithms: Asymptotic Notation
No ratings yet
CSE-245 Algorithms: Asymptotic Notation
51 pages
weeks_1_2
No ratings yet
weeks_1_2
30 pages
Algorithm
No ratings yet
Algorithm
12 pages
1 Asymptotic Notation Pptx
No ratings yet
1 Asymptotic Notation Pptx
51 pages
Unit I Asymptotic Notations Asymptotic Notation: O, , !, and
No ratings yet
Unit I Asymptotic Notations Asymptotic Notation: O, , !, and
59 pages
Analysis of Algorithms: Dr. Ijaz Hussain
No ratings yet
Analysis of Algorithms: Dr. Ijaz Hussain
61 pages
Analysis of Algorithms & Orders of Growth
No ratings yet
Analysis of Algorithms & Orders of Growth
54 pages
Lecture 09 Algo
No ratings yet
Lecture 09 Algo
50 pages
Lecture 2
No ratings yet
Lecture 2
30 pages
Chapter 1
No ratings yet
Chapter 1
52 pages
DSA2 - Chap2 - Algorithm Analysis
No ratings yet
DSA2 - Chap2 - Algorithm Analysis
75 pages
AlgorithmLec 2asymptotic Notations
No ratings yet
AlgorithmLec 2asymptotic Notations
36 pages
CSE 373 Data Structures and Algorithms: Lecture 4: Asymptotic Analysis II / Math Review
No ratings yet
CSE 373 Data Structures and Algorithms: Lecture 4: Asymptotic Analysis II / Math Review
21 pages
Analysis of Performance Using Big O Notation
No ratings yet
Analysis of Performance Using Big O Notation
9 pages
1 Asymptotic
No ratings yet
1 Asymptotic
40 pages
1.2L Efficiency
No ratings yet
1.2L Efficiency
39 pages
Lecture 2 - Sorting and Asymptotic Analysis
No ratings yet
Lecture 2 - Sorting and Asymptotic Analysis
90 pages
Algo - Lecure3 - Asymptotic Analysis
No ratings yet
Algo - Lecure3 - Asymptotic Analysis
26 pages
Algorithm-Lecture3-Asymtotic Analysis
No ratings yet
Algorithm-Lecture3-Asymtotic Analysis
47 pages
Lecture 2 DS
No ratings yet
Lecture 2 DS
41 pages
CS 702 Lec10
No ratings yet
CS 702 Lec10
9 pages
Assignment No 1......
No ratings yet
Assignment No 1......
9 pages
Asymptotic Notations
No ratings yet
Asymptotic Notations
44 pages
Chapter One
No ratings yet
Chapter One
52 pages
Class6-7 DSA Asymtotic Analysis 15-17jan2022
No ratings yet
Class6-7 DSA Asymtotic Analysis 15-17jan2022
33 pages
FALLSEM2022-23 BCSE202L TH VL2022230103397 Reference Material I 26-07-2022 MODULE 1 - Asymptotic Notations and Orders of Growth
No ratings yet
FALLSEM2022-23 BCSE202L TH VL2022230103397 Reference Material I 26-07-2022 MODULE 1 - Asymptotic Notations and Orders of Growth
19 pages
BigONotes
No ratings yet
BigONotes
5 pages
22 Growth of Functions
No ratings yet
22 Growth of Functions
29 pages
DS - Mod1 - Asymptotic - Notations
No ratings yet
DS - Mod1 - Asymptotic - Notations
33 pages
Asymtotic Notation
No ratings yet
Asymtotic Notation
38 pages
O-Notation: For A Given Function G (N), We Denote by o (G (N) ) The Set of Functions
No ratings yet
O-Notation: For A Given Function G (N), We Denote by o (G (N) ) The Set of Functions
29 pages
Lecture 02
No ratings yet
Lecture 02
55 pages
Nanomaterials 13 00160
No ratings yet
Nanomaterials 13 00160
44 pages
Data Structures Notes - 2023 - 10 - 07 - 15 - 10 - 15 - 160
No ratings yet
Data Structures Notes - 2023 - 10 - 07 - 15 - 10 - 15 - 160
39 pages
Virk 2009
No ratings yet
Virk 2009
7 pages
Cradle To Gate Life Cycle Analysis of High Density Polyethylene HDPE Resin
No ratings yet
Cradle To Gate Life Cycle Analysis of High Density Polyethylene HDPE Resin
50 pages
PCTM Notes - 2023 - 10 - 03 - 21 - 18 - 43 - 999
No ratings yet
PCTM Notes - 2023 - 10 - 03 - 21 - 18 - 43 - 999
43 pages
Vibration Damping in Flax & Hemp Fibre Composites - EN
No ratings yet
Vibration Damping in Flax & Hemp Fibre Composites - EN
8 pages
ATM Notes
No ratings yet
ATM Notes
44 pages
Mechanical Characterization of Hybrid Yarn Thermoplastic Composites From Multi-Layer Woven Fabrics With Function Integration
No ratings yet
Mechanical Characterization of Hybrid Yarn Thermoplastic Composites From Multi-Layer Woven Fabrics With Function Integration
20 pages
Icaet 4166
No ratings yet
Icaet 4166
3 pages
2 Absorbable
No ratings yet
2 Absorbable
12 pages
Abhay Public Speaking Worksheet
No ratings yet
Abhay Public Speaking Worksheet
2 pages
A Research About The Effect of The Anti-Pilling Treatments On Different Structured Cotton Knitted Fabrics
No ratings yet
A Research About The Effect of The Anti-Pilling Treatments On Different Structured Cotton Knitted Fabrics
8 pages
Global Trade Trends in Nonwovens and The Future
0% (1)
Global Trade Trends in Nonwovens and The Future
11 pages
Effectofresinfinish
No ratings yet
Effectofresinfinish
9 pages
Hyperbolic Solution
100% (1)
Hyperbolic Solution
15 pages
Abhay Knitting Notes - 2023 - 03 - 10 - 15 - 09 - 51 - 186 PDF
No ratings yet
Abhay Knitting Notes - 2023 - 03 - 10 - 15 - 09 - 51 - 186 PDF
49 pages
AATCCpaper 16
No ratings yet
AATCCpaper 16
7 pages
Abhay Sontakke - 71 Computer Cpps Lab Journel Project 1
No ratings yet
Abhay Sontakke - 71 Computer Cpps Lab Journel Project 1
35 pages
Expansion of Function
No ratings yet
Expansion of Function
13 pages

Cs5002 Lect9 Fall18 Notes

Uploaded by

Cs5002 Lect9 Fall18 Notes

Uploaded by

CS 5002: Discrete Structures Fall 2018

Readings for this week:

Objective: Introduce algorithms.

9.2 Asymptotic Analysis

From this chart, we see:

9.2.1 Big-O: Upper Bound

f (n) = 3n2 − 100n + 6 (9.2)

To prove using Big-O:

f (n) = 3n2 − 100n + 6 (9.9)

f (n) = 3n2 − 100n + 6 (9.13)

9.2.2 Big-Omega: Lower Bound

f (n) = 3n2 − 100n + 6 (9.16)

We show Big-Omega the same way we show Big-O.

3n2 − 100n + 6 = Ω(n2 ) (9.20)

f (n) = 3n2 − 100n + 6 (9.23)

f (n) = 3n2 − 100n + 6 (9.27)

9.2.3 Big-Theta: “Tight” Bound

3n2 − 100n + 6 6= Θ(n3 ) Only O applies

Theorem 9.5 If f (x) = an xn + an−1 xn−1 + · · · + a1 x + a0 , then f (x) = O(n )

a) f (x) = 17x + 11 b) f (x) = x2 + 1000

9.2.4 Logs, Powers, Exponents

nd is O (bn ) , but bn is not O nd

bn is O (cn ) but cn is not O (bn ) (9.32)

9.2.5 Adding Functions

O(f (n)) + O(g(n)) → O(max(f (n), g(n))) (9.33)

f (n) = n + log n (9.36)

Is f (n) = O(g(n)), g(n) = O(f (n)), or both?

9.2.6 Introduction to Algorithms

The rest of today:

9.3 Some Algorithms

9.3.1 Expressing Algorithms

In order of increasing precision:

Unfortunately, ease of expression moves in the reverse order.

9.3.2 Binary search

Output: Whether the set contains t

Is Chloe Kim on the US Ski & Snowboard team for 2018?

Binary-Search(A, I, min, max)

What’s interesting about Binary Search?

9.3.3 Selection sort

*whew* What did we just do?

9.4.2 Run time

TODO: Show run time in terms of a sum?

Run time: n · (n + 1 + 1 + 1 + 1 + 1) ⇒ O(n2 )

Worst case: {6, 5, 4, 3, 2, 1 }

Average case: {1, 6, 4, 5, 2, 3}

Binary-Search(A, I, min, max)

Binary-Search({1, 2, 3, 4, 5, 6, 7, 9, 10}, 2, 1, 10)

Run Time of Binary Search

Run Time of Binary Search

• Express T (n) as three nested summations.

Run time as Clock time

(1, 000, 000)2

9.4.2.1 Runtime, Clocktime, and Efficiency

9.4.3 Memory Use

9.5 Representative Problems

In this diagram, we see three sets of requests.

Solution: A simple heuristic that is an example of a greedy algorithm.

In this graph, the largest independent set is {1, 4, 5, 6}

• Can’t put a cafè in adjacent zones

• What’s “qualitatively better”?

Readings for NEXT week:

Rosen, Chapter 4.1, 4.2, 4.3, 4.4

9.6 Appendix: Logarithms

We say “log base b of y equals x”

Let’s re-write this using the formula we have.

The logarithm of a product is the sum of the logs of its factors.

9.7.3 Logarithm Exercises

llog4 x2 − 9 − log4 (x + 3) = 3 x > 3 ∧ x > −3 ⇒ x ∈ (3; 00)

log4 x2 − 9 − log4 (x + 3) = log4 64

The Product Rule loga (xy) = loga (x) + loga (y)

9.7.4 Relevance of Logs

9.7.4.1 Logs and Binary Search

Logs and Binary Search

9.7.4.2 Logs and Trees

Logs and Trees

What is the height h of a binary tree with n leaf nodes?

whew What did we just do?