0% found this document useful (0 votes)

4 views

4 Randquicksort

Uploaded by

arastogi1997

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

4 Randquicksort

Uploaded by

arastogi1997

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Algorithms for Data Science

CSOR W4246

Eleni Drinea
Computer Science Department

Columbia University

Randomized quicksort
Outline

1 Randomized Quicksort
Today

1 Randomized Quicksort
Pseudocode for randomized Quicksort

Randomized-Quicksort(A, lef t, right)

if |A| = 0 then return //A is empty
end if
split = Randomized-Partition(A, lef t, right)
Randomized-Quicksort(A, lef t, split − 1)
Randomized-Quicksort(A, split + 1, right)

Randomized-Partition(A, lef t, right)

b = random(lef t, right)
swap(A[b], A[right])
return Partition(A, lef t, right)

Subroutine random(i, j) returns a random number between i and j

inclusive.
Expected running time of randomized Quicksort

I Let T (n) be the expected running time of

Randomized-Quicksort.
I We want to bound T (n).
I Randomized-Quicksort differs from Quicksort only in
how they select their pivot elements.

⇒ We will analyze Randomized-Quicksort based on

Quicksort and Partition.
Pseudocode for Partition

Partition(A, lef t, right)

pivot = A[right] line 1
split = lef t − 1 line 2
for j = lef t to right − 1 do line 3
if A[j] ≤ pivot then line 4
swap(A[j], A[split + 1]) line 5
split = split + 1 line 6
end if
end for
swap(pivot, A[split + 1]) line 7
return split + 1 line 8
Few observations

1. How many times is Partition called?

Few observations

1. How many times is Partition called?

At most n.

2. Further, each Partition call spends some work

1. outside the for loop

2. inside the for loop

Few observations

1. How many times is Partition called?

At most n.

2. Further, each Partition call spends some work

1. How many times is Partition called?

At most n.

2. Further, each Partition call spends some work

1. outside the for loop
I every Partition spends constant work ouside the for loop
I at most n calls to Partition
⇒ total work outside the for loop in all calls to Partition is O(n)
2. inside the for loop
I let X be the total number of comparisons performed at line 4 in
all calls to Partition
I each comparison may require some further constant work (lines 5
and 6)
⇒ total work inside the for loop in all calls to Partition is O(X)
Towards a bound for T (n)

X = the total number of comparisons in all Partition calls.

The running time of Randomized-Quicksort is

O(n + X).

Since X is a random variable, we need E[X] to bound T (n).

Towards a bound for T (n)

X = the total number of comparisons in all Partition calls.

The running time of Randomized-Quicksort is

O(n + X).

Since X is a random variable, we need E[X] to bound T (n).

Fact 1.
Fix any two input items. During the execution of the algorithm,
they may be compared at most once.
Towards a bound for T (n)

X = the total number of comparisons in all Partition calls.

The running time of Randomized-Quicksort is

O(n + X).

Since X is a random variable, we need E[X] to bound T (n).

Fact 1.
Fix any two input items. During the execution of the algorithm,
they may be compared at most once.

Proof.
Comparisons are only performed with the pivot of each Partition
call. After Partiton returns, pivot is in its final location in the output
and will not be part of the input to any future recursive call.
Simplifying the analysis

n(n−1)
There are n numbers in the input, hence n2 =

I
2
distinct (unordered) pairs of input numbers.
n

I From Fact 1, the algorithm will perform at most 2
comparisons.
I What is the expected number of comparisons?
Simplifying the analysis

To simplify the analysis

I relabel the input as z1 , z2 , . . . , zn , where zi is the i-th
smallest number.
I assume that all input numbers are distinct; thus zi < zj ,
for i < j.
Writing X as the sum of indicator random variables

Let Xij be an indicator random variable such that

1, if zi and zj are ever compared
Xij =
0, otherwise
Writing X as the sum of indicator random variables

Let Xij be an indicator random variable such that

1, if zi and zj are ever compared
Xij =
0, otherwise
P
The total number of comparisons is given by X = Xij .
1≤i<j≤n
Writing X as the sum of indicator random variables

Let Xij be an indicator random variable such that

1, if zi and zj are ever compared
Xij =
0, otherwise
P
The total number of comparisons is given by X = Xij .
1≤i<j≤n
E[X] =?
Writing X as the sum of indicator random variables

Let Xij be an indicator random variable such that

1, if zi and zj are ever compared
Xij =
0, otherwise
P
The total number of comparisons is given by X = Xij .
1≤i<j≤n
By linearity of expectation
n−1 n
" #
X X X X
E[X] = E Xij = E[Xij ] = Pr[Xij = 1]
1≤i<j≤n 1≤i<j≤n i=1 j=i+1
Writing X as the sum of indicator random variables

Let Xij be an indicator random variable such that

Goal: compute Pr[Xij = 1], that is, the probability that two
fixed items zi and zj are ever compared.
Fix two items zi and zj . When are they compared?

Notation: let Zij = {zi , zi+1 , . . . , zj }

Consider the initial call Partition(A, 1, n). Assume it picks zk

outside Zij as pivot (see figure below).
pivot
Zij

z₁ < z₂ < … < zᵢ < zᵢ₊₁ < … !"""z!"" < … < zk < … < zn !!

1. zi and zj are not compared in this call (why?).

2. All items in Zij will be greater (or smaller) than zk , so they
will all be input to the same subproblem after
Partition(A, 1, n) returns.
In the first Partition with pivot ∈ Zij = {zi , . . . , zj }
The first Partition call that picks its pivot from Zij
determines if zi , zj are ever compared. Three possibilities:

1. pivot = zi

2. pivot = zj

3. pivot = z` , for some i < ` < j

In the first Partition with pivot ∈ Zij = {zi , . . . , zj }
The first Partition call that picks its pivot from Zij
determines if zi , zj are ever compared. Three possibilities:

1. pivot = zi

zi is compared with every element in Zij − {zi }, thus with zj

too. zi is placed in its final location in the output and will not
appear in any future calls to Partition.

2. pivot = zj

zj is compared with every element in Zij − {zj }, thus with zi

too. zj is placed in its final location in the output and will not
appear in any future recursive calls.

3. pivot = z` , for some i < ` < j

zi and zj are never compared (why?)

So zi and zj are compared when . . .

. . . either of them is chosen as pivot in that first Partition call

that chooses its pivot element from Zij .

Now we can compute Pr[Xij = 1]:

Pr[Xij = 1] = Pr[zi is chosen as pivot by the first Partition

that picks its pivot from Zij , or
zj is chosen as pivot by the first Partition
that picks its pivot from Zij ] (1)
The union bound

Suppose we are given a set of events ε1 , ε2 , . . . , εn , and we are

interested in the probability that any of them happens.

Union bound: Given events ε1 , ε2 , . . . , εn , we have

"n # n
[ X
Pr εi ≤ Pr[εi ].
i=1 i=1

Union bound for mutually exclusive events: Suppose that

εi ∩ εj = ∅ for each pair of events. Then
"n # n
[ X
Pr εi = Pr[εi ].
i=1 i=1
Computing the probability that zi and zj are compared

Since the two events in equation (1) are mutually exclusive, we

obtain

Pr[Xij = 1] = Pr[zi is chosen as pivot by the first Partition

call that picks its pivot from Zij ]
+ Pr[zj is chosen as pivot by the first Partition
call that picks its pivot from Zij ]
1 1 2
= + = , (2)
j−i+1 j−i+1 j−i+1
since the set Zij contains j − i + 1 elements.
From Pr[Xij = 1] to E[X]
n−1 n n−1 n
X X X X 2
E[X] = Pr[Xij = 1] =
j−i+1
i=1 j=i+1 i=1 j=i+1

X n−i+1
n−1 X 1
= 2 (3)
`
i=1 `=2

k
P 1
Note that ` = Hk is the k-th harmonic number, such that
`=1

ln k ≤ Hk ≤ ln k + 1 (4)

n−i+1
P 1
Hence ` ≤ ln (n − i + 1). Substituting in (3), we get
`=2

n−1
X n−1
X
E[X] ≤ 2 ln (n − i + 1) ≤ 2 ln n = O(n ln n)
i=1 i=1
From E[X] to T (n)

I Equations (3), (4) also yield a lower bound of Ω(n ln n) for

E[X] (show this!).

I Hence E[X] = Θ(n ln n). Then the expected running time

of Randomized-Quicksort is

T (n) = Θ(n ln n)

3 Divide and Conquer 5 Quicksort
No ratings yet
3 Divide and Conquer 5 Quicksort
79 pages
General Directions
No ratings yet
General Directions
3 pages
Randomizing Quick Sort
No ratings yet
Randomizing Quick Sort
46 pages
4_quicksort.v2
No ratings yet
4_quicksort.v2
101 pages
Qsort
No ratings yet
Qsort
3 pages
Algo Scribe
No ratings yet
Algo Scribe
10 pages
Notes, 5 (B) : ECE 606 Quicksort
No ratings yet
Notes, 5 (B) : ECE 606 Quicksort
5 pages
Notes Randomization
No ratings yet
Notes Randomization
7 pages
DAA Solutions
No ratings yet
DAA Solutions
7 pages
Median Order Statistics
No ratings yet
Median Order Statistics
26 pages
Csce411 Random3
No ratings yet
Csce411 Random3
25 pages
Cs 161 Lecture 05
No ratings yet
Cs 161 Lecture 05
5 pages
Lecture5 Compressed
No ratings yet
Lecture5 Compressed
36 pages
Quick Sort Algorithm
No ratings yet
Quick Sort Algorithm
23 pages
Eci 2023
No ratings yet
Eci 2023
507 pages
Chapter 7: Quicksort: Divide
No ratings yet
Chapter 7: Quicksort: Divide
18 pages
ECE 606, Fall 2019, Assignment 5: Zhijie Wang, Student ID Number: 20856733 Zhijie - Wang@uwaterloo - Ca October 8, 2019
No ratings yet
ECE 606, Fall 2019, Assignment 5: Zhijie Wang, Student ID Number: 20856733 Zhijie - Wang@uwaterloo - Ca October 8, 2019
3 pages
DAA Quicksort
No ratings yet
DAA Quicksort
18 pages
Randomized Algorithm
No ratings yet
Randomized Algorithm
28 pages
Lecture 4: The Linear Time Selection
No ratings yet
Lecture 4: The Linear Time Selection
22 pages
BNP Unit-5 Lecture 21
No ratings yet
BNP Unit-5 Lecture 21
22 pages
DAA_LECT_8
No ratings yet
DAA_LECT_8
30 pages
Lec 8 12 Algo Spr15 MergeSort QuickSort
No ratings yet
Lec 8 12 Algo Spr15 MergeSort QuickSort
39 pages
Week13 Lecture37
No ratings yet
Week13 Lecture37
19 pages
L03 DivideConquer - Part02
No ratings yet
L03 DivideConquer - Part02
17 pages
Lecture4 Notes
No ratings yet
Lecture4 Notes
7 pages
0.1 Review (Recurrences)
No ratings yet
0.1 Review (Recurrences)
8 pages
Quick Sort Lomu To
No ratings yet
Quick Sort Lomu To
4 pages
Writeup (1)
No ratings yet
Writeup (1)
3 pages
N Queens: Backtracking: General Form
No ratings yet
N Queens: Backtracking: General Form
3 pages
Quicksort: Introduction To Algorithms
No ratings yet
Quicksort: Introduction To Algorithms
36 pages
Practical Session 10 - Huffman Code, Sort Properties, Quicksort Algorithm
No ratings yet
Practical Session 10 - Huffman Code, Sort Properties, Quicksort Algorithm
10 pages
1 Summary on Θ (n) Sorting Algorithms (Section 7.2) : Lecture Seven
No ratings yet
1 Summary on Θ (n) Sorting Algorithms (Section 7.2) : Lecture Seven
4 pages
week_11
No ratings yet
week_11
40 pages
Lecture 8 QuickSort
No ratings yet
Lecture 8 QuickSort
64 pages
L03 Randomized Algorithms
No ratings yet
L03 Randomized Algorithms
61 pages
4 Quicksort and Balls in Bins
No ratings yet
4 Quicksort and Balls in Bins
74 pages
Lecture Quick Sort
No ratings yet
Lecture Quick Sort
42 pages
Sorting
No ratings yet
Sorting
34 pages
DAA Unit 2 notes - 4
No ratings yet
DAA Unit 2 notes - 4
14 pages
CS-E3190 Lect04 PDF
No ratings yet
CS-E3190 Lect04 PDF
19 pages
07 Sort2
No ratings yet
07 Sort2
87 pages
Devide and Conqure Rule
No ratings yet
Devide and Conqure Rule
11 pages
Randomized Algorithms Randomized Algorithms
No ratings yet
Randomized Algorithms Randomized Algorithms
43 pages
Sort
No ratings yet
Sort
15 pages
Announcements: Weekly Reading: Chap 11 (CLRS) (Not On Upcoming Exam)
No ratings yet
Announcements: Weekly Reading: Chap 11 (CLRS) (Not On Upcoming Exam)
28 pages
Lecture 11 - Randomized-QuickSort
No ratings yet
Lecture 11 - Randomized-QuickSort
12 pages
Tutorial Week13
No ratings yet
Tutorial Week13
2 pages
L09 RandomizedQuicksort
No ratings yet
L09 RandomizedQuicksort
29 pages
Utkarsh Garg Roll No. 12771 Collaborated With: Kartik Agrawal (12344)
No ratings yet
Utkarsh Garg Roll No. 12771 Collaborated With: Kartik Agrawal (12344)
6 pages
Quicksort: Pseudo Code For Recursive Quicksort Function
No ratings yet
Quicksort: Pseudo Code For Recursive Quicksort Function
11 pages
Fundamental Algorithms, Problem Set 2 Solutions
No ratings yet
Fundamental Algorithms, Problem Set 2 Solutions
4 pages
Michelle Bodnar, Andrew Lohr September 17, 2017
No ratings yet
Michelle Bodnar, Andrew Lohr September 17, 2017
12 pages
Quick Sort (11.2) : CSE 2011 Winter 2011
No ratings yet
Quick Sort (11.2) : CSE 2011 Winter 2011
27 pages
q1 Soln
No ratings yet
q1 Soln
4 pages
Cs 161 Lecture 04
No ratings yet
Cs 161 Lecture 04
6 pages
Quicksort Quicksort: Introduction To Algorithms Introduction To Algorithms
No ratings yet
Quicksort Quicksort: Introduction To Algorithms Introduction To Algorithms
9 pages
Analysis of Quicksort: Worst Case Best Case Average Behavior
No ratings yet
Analysis of Quicksort: Worst Case Best Case Average Behavior
10 pages
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
CP Saga Course
No ratings yet
CP Saga Course
7 pages
Optimality Test For Transportation Problem: V O Thomas
No ratings yet
Optimality Test For Transportation Problem: V O Thomas
79 pages
Introduction To State Space Models (SSM) 2
No ratings yet
Introduction To State Space Models (SSM) 2
1 page
Ashish - Mali - DS - AlmaBetter - Ashish Mali
No ratings yet
Ashish - Mali - DS - AlmaBetter - Ashish Mali
1 page
Ee247 - Lecture24
No ratings yet
Ee247 - Lecture24
32 pages
Heaps
No ratings yet
Heaps
10 pages
Directed Acyclic Graph (DAG)
No ratings yet
Directed Acyclic Graph (DAG)
16 pages
Fourier Series of Periodic Discrete-Time Signals
No ratings yet
Fourier Series of Periodic Discrete-Time Signals
36 pages
Algorithm Design and Data Structures Questions
No ratings yet
Algorithm Design and Data Structures Questions
3 pages
Neural Networks:: Basics Using MATLAB
No ratings yet
Neural Networks:: Basics Using MATLAB
54 pages
Arabic Competitive Programming: Channel Road Map
No ratings yet
Arabic Competitive Programming: Channel Road Map
8 pages
L4
No ratings yet
L4
2 pages
Laboratory 10: Discrete Fourier Transform: Exercise
No ratings yet
Laboratory 10: Discrete Fourier Transform: Exercise
9 pages
Ss Lab Manual With Scilab Programs
No ratings yet
Ss Lab Manual With Scilab Programs
49 pages
Proble Solution Os
No ratings yet
Proble Solution Os
7 pages
Anishish Sharan
No ratings yet
Anishish Sharan
15 pages
Assignment-1 ADS1 2021-22 - 1936
No ratings yet
Assignment-1 ADS1 2021-22 - 1936
7 pages
A Comparative Study Between Various Sorting Algorithms
No ratings yet
A Comparative Study Between Various Sorting Algorithms
6 pages
Em8720 e
No ratings yet
Em8720 e
28 pages
Signature Verification and Detection
No ratings yet
Signature Verification and Detection
61 pages
Determinant Matlab Project
No ratings yet
Determinant Matlab Project
17 pages
HW 4 Sol
No ratings yet
HW 4 Sol
8 pages
A Deep Reinforcement Learning-Based Resource
No ratings yet
A Deep Reinforcement Learning-Based Resource
15 pages
Asymptotic Notation - Analysis of Algorithms
No ratings yet
Asymptotic Notation - Analysis of Algorithms
31 pages
Edge Canny
No ratings yet
Edge Canny
15 pages
Signal Flow Graph
No ratings yet
Signal Flow Graph
51 pages
Ada qp1
No ratings yet
Ada qp1
3 pages
HW#7 HuynhVanTinh 20205079
No ratings yet
HW#7 HuynhVanTinh 20205079
15 pages
Discrete-Time System: 3.1.1 Accumulator
No ratings yet
Discrete-Time System: 3.1.1 Accumulator
27 pages
MAD Blooms Taxonomy Question Paper Format
No ratings yet
MAD Blooms Taxonomy Question Paper Format
3 pages

4 Randquicksort

Uploaded by

4 Randquicksort

Uploaded by

Algorithms for Data Science

Randomized-Quicksort(A, lef t, right)

Randomized-Partition(A, lef t, right)

Subroutine random(i, j) returns a random number between i and j

I Let T (n) be the expected running time of

⇒ We will analyze Randomized-Quicksort based on

Partition(A, lef t, right)

1. How many times is Partition called?

1. How many times is Partition called?

2. Further, each Partition call spends some work

2. inside the for loop

1. How many times is Partition called?

2. Further, each Partition call spends some work

1. How many times is Partition called?

2. Further, each Partition call spends some work

X = the total number of comparisons in all Partition calls.

Since X is a random variable, we need E[X] to bound T (n).

X = the total number of comparisons in all Partition calls.

Since X is a random variable, we need E[X] to bound T (n).

X = the total number of comparisons in all Partition calls.

Since X is a random variable, we need E[X] to bound T (n).

To simplify the analysis

Let Xij be an indicator random variable such that

Let Xij be an indicator random variable such that

Let Xij be an indicator random variable such that

Let Xij be an indicator random variable such that

Let Xij be an indicator random variable such that

Notation: let Zij = {zi , zi+1 , . . . , zj }

Consider the initial call Partition(A, 1, n). Assume it picks zk

1. zi and zj are not compared in this call (why?).

3. pivot = z` , for some i < ` < j

zi is compared with every element in Zij − {zi }, thus with zj

zj is compared with every element in Zij − {zj }, thus with zi

3. pivot = z` , for some i < ` < j

zi and zj are never compared (why?)

. . . either of them is chosen as pivot in that first Partition call

Now we can compute Pr[Xij = 1]:

Pr[Xij = 1] = Pr[zi is chosen as pivot by the first Partition

Suppose we are given a set of events ε1 , ε2 , . . . , εn , and we are

Union bound: Given events ε1 , ε2 , . . . , εn , we have

Union bound for mutually exclusive events: Suppose that

Since the two events in equation (1) are mutually exclusive, we

Pr[Xij = 1] = Pr[zi is chosen as pivot by the first Partition

I Equations (3), (4) also yield a lower bound of Ω(n ln n) for

I Hence E[X] = Θ(n ln n). Then the expected running time

You might also like