0% found this document useful (0 votes)

6 views

1 Insertionsort

Uploaded by

arastogi1997

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

1 Insertionsort

Uploaded by

arastogi1997

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Algorithms for Data Science

CSOR W4246

Eleni Drinea
Computer Science Department

Columbia University

Insertion sort, efficient algorithms

Outline

1 Overview

2 A first algorithm: insertion sort

3 Analysis of algorithms

4 Efficiency of algorithms
Today

1 Overview

2 A first algorithm: insertion sort

3 Analysis of algorithms

4 Efficiency of algorithms
Algorithms

I An algorithm is a well-defined computational procedure

that transforms the input (a set of values) into the
output (a new set of values).

I The desired input/output relationship is specified by the

statement of the computational problem for which the
algorithm is designed.

I An algorithm is correct if, for every input, it halts with the

correct output.
Efficient Algorithms

I In this course we are interested in algorithms that are

correct and efficient.

I Efficiency is related to the resources an algorithm uses:

time, space
I How much time/space are used?
I How do they scale as the input size grows?

We will primarily focus on efficiency in running time.

Running time

Running time = number of primitive computational steps

performed; typically these are

1. arithmetic operations: add, subtract, multiply, divide

fixed-size integers
2. data movement operations: load, store, copy
3. control operations: branching, subroutine call and return

We will use pseudocode for our algorithm descriptions.

Today

1 Overview

2 A first algorithm: insertion sort

3 Analysis of algorithms

4 Efficiency of algorithms
Sorting

I Input: A list A of n integers x1 , . . . , xn .

I Output: A permutation x01 , x02 , . . . , x0n of the n integers
where they are sorted in non-decreasing order, i.e.,
x01 ≤ x02 ≤ . . . ≤ x0n
Sorting

I Input: A list A of n integers x1 , . . . , xn .

I Output: A permutation x01 , x02 , . . . , x0n of the n integers
where they are sorted in non-decreasing order, i.e.,
x01 ≤ x02 ≤ . . . ≤ x0n

Example
I Input: n = 6, A = {9, 3, 2, 6, 8, 5}
Sorting

I Input: A list A of n integers x1 , . . . , xn .

I Output: A permutation x01 , x02 , . . . , x0n of the n integers
where they are sorted in non-decreasing order, i.e.,
x01 ≤ x02 ≤ . . . ≤ x0n

Example
I Input: n = 6, A = {9, 3, 2, 6, 8, 5}
I Output: A = {2, 3, 5, 6, 8, 9}

What data structure should we use to represent the list?

Sorting

I Input: A list A of n integers x1 , . . . , xn .

I Output: A permutation x01 , x02 , . . . , x0n of the n integers
where they are sorted in non-decreasing order, i.e.,
x01 ≤ x02 ≤ . . . ≤ x0n

Example
I Input: n = 6, A = {9, 3, 2, 6, 8, 5}
I Output: A = {2, 3, 5, 6, 8, 9}

What data structure should we use to represent the list?

Array: collection of items of the same data type
I allows for random access
I “zero” indexed in C++ and Java
Main idea of insertion sort

sorted unsorted

key
key

1 i-1 i n

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

Main idea of insertion sort

sorted unsorted

key
key

1 i-1 i n

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

element of A, call it key, in the correct position in the sorted
subarray to its left. How?
Main idea of insertion sort

sorted unsorted

key
key

1 i-1 i n

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

element of A, call it key, in the correct position in the sorted
subarray to its left. How?
I Compare key with every element x in the sorted subarray to
the left of key, starting from the right.
I If x > key, move x one position to the right.
I If x ≤ key, insert key after x.
Main idea of insertion sort

sorted unsorted

key
key

1 i-1 i n

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

3. Repeat Step 2. until the sorted subarray has size n.

Example of insertion sort: n = 6, A = {9, 3, 2, 6, 8, 5}
key

sorted unsorted

9 3 2 6 8 5 beginning of iteration i=2

key

sorted unsorted

3 9 2 6 8 5 beginning of iteration i=3

key

sorted unsorted

2 3
9 9 6 8 5 beginning of iteration i=4

key

unsorted
sorted
2 3
9 6 9 8 5 beginning of iteration i=5

key

sorted unsorted

2 3
9 6 8 9 5 beginning of iteration i=6

sorted

2 3
9 5 6 8 9 end of iteration i=6
Pseudocode

Let A be an array of n integers.

insertion-sort(A)
for i = 2 to n do
key = A[i]
//Insert A[i] into the sorted subarray A[1, i − 1]
j =i−1
while j > 0 and A[j] > key do
A[j + 1] = A[j]
j =j−1
end while
A[j + 1] = key
end for
Today

1 Overview

2 A first algorithm: insertion sort

3 Analysis of algorithms

4 Efficiency of algorithms
Analysis of algorithms

I Correctness

I Running time

I Space
Analysis of algorithms

I Correctness: formal proof often by induction

I Running time: number of primitive computational steps

I Not the same as time it takes to execute the algorithm.
I We want a measure that is independent of hardware.
I We want to know how running time scales with the size of
the input.

I Space: how much space is required by the algorithm

Analysis of insertion sort

Notation: A[i, j] is the subarray of A that starts at position i and

ends at position j.

I Correctness: follows from the key observation that after

loop i, the subarray A[1, i] is sorted

I Running time: number of primitive computational steps

I Space: in place algorithm (at most a constant number of

elements of A are stored outside A at any time)
Example of induction

Fact 1.
Pn n(n+1)
For all n ≥ 1, i=1 i = 2 .
Example of induction

Fact 1.
Pn n(n+1)
For all n ≥ 1, i=1 i = 2 .

Proof.
I Base case: n = 1
I Inductive hypothesis: PAssume that the statement is
true for n ≥ 1, that is, ni=1 i = n(n+1)
2 .
I Inductive step: We show that the statement is true for
n + 1. That is, i=1 i = (n+1)(n+2)
Pn+1
2 . (Show this!)
I Conclusion: It follows that the statement is true for all n
since we can apply the inductive step for n = 2, 3, . . ..
Correctness of insertion-sort

Notation: A[i, j] is the subarray of A that starts at position i

and ends at position j.

Minor change in the pseudocode: in line 1, start from i = 1

rather than i = 2. How does this change affect the algorithm?

Claim 1.
Let n ≥ 1 be a positive integer. For all 1 ≤ i ≤ n, after the i-th
loop, the subarray A[1, i] is sorted.

Correctness of insertion-sort follows if we show Claim 1

(why?).
Proof of Claim 1

By induction on i.

I Base case: i = 1, trivial.

I Induction hypothesis: assume that the statement is true for

some 1 ≤ i < n.

I Inductive step: Show it true for i + 1.

In loop i + 1, element key = A[i + 1] is inserted into A[1, i]. By

the induction hypothesis, A[1, i] is sorted. Since
1. key is inserted after the last element A[`] such that
0 ≤ ` ≤ i and A[`] ≤ key;
2. all elements in A[` + 1, j] are shifted one position to the
right with their order preserved,
s the statement is true for i + 1.
Visual proof of the inductive step

A[ℓ] is the rightmost element

of A[0,i] such that A[ℓ] ≤ key
unexamined

End of i-th iteration:

key
A[1,i] is sorted
1 ℓ ℓ+1 i-1 i i+1 n

...

unexamined

key key
End of i+1-st iteration:
A[1,i+1] is sorted
1 ℓ ℓ+1 ℓ+2 i i+1 n
Running time T (n) of insertion-sort

for i = 2 to n do
key = A[i]
//Insert A[i] into the sorted subarray A[1, i − 1]
j =i−1
while j > 0 and A[j] > key do
A[j + 1] = A[j]
j =j−1
end while
A[j + 1] = key
end for

I How many primitive computational steps are executed by the

algorithm?
I Equivalently, what is the running time T (n)? Bounds on T (n)?
Running time T (n) of insertion-sort

for i = 2 to n do line 1
key = A[i] line 2
//Insert A[i] into the sorted subarray A[1, i − 1]
j =i−1 line 3
while j > 0 and A[j] > key do line 4
A[j + 1] = A[j] line 5
j =j−1 line 6
end while
A[j + 1] = key line 7
end for

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed.

Running time T (n) of insertion-sort

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed. Then

Xn n
X n
X
T (n) = n + 3(n − 1) + ti + 2 (ti − 1) = 3 ti + 2n − 1
i=2 i=2 i=2

I Which input yields the smallest (best-case) running time?

I Which input yields the largest (worst-case) running time?
Running time T (n) of insertion-sort
for i = 2 to n do line 1
key = A[i] line 2
//Insert A[i] into the sorted subarray A[1, i − 1]
j =i−1 line 3
while j > 0 and A[j] > key do line 4
A[j + 1] = A[j] line 5
j =j−1 line 6
end while
A[j + 1] = key line 7
end for

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed. Then

n
X
T (n) = 3 ti + 2n − 1
i=2

I Best-case running time: 5n − 4

3n2 7n
I Worst-case running time: 2 + 2 −4
Worst-case analysis

Definition 2.
Worst-case running time: largest possible running time of the
algorithm over all inputs of a given size n.

Why worst-case analysis?

I It gives well-defined computable bounds.
I Average-case analysis can be tricky: how do we generate a
“random” instance?

The worst-case running time of insertion-sort is quadratic.

Is insertion-sort efficient?
Today

1 Overview

2 A first algorithm: insertion sort

3 Analysis of algorithms

4 Efficiency of algorithms
Efficiency of insertion-sort and the brute force solution

Compare to brute force solution:

I At each step, generate a new permutation of the n integers.
I If sorted, stop and output the permutation.
Efficiency of insertion-sort and the brute force solution

Compare to brute force solution:

I At each step, generate a new permutation of the n integers.
I If sorted, stop and output the permutation.

Worst-case analysis: generate n! permutations. Is brute force

solution efficient?
Efficiency of insertion-sort and the brute force solution

Compare to brute force solution:

I At each step, generate a new permutation of the n integers.
I If sorted, stop and output the permutation.

Worst-case analysis: generate n! permutations. Is brute force

solution efficient?
I Efficiency relates to the performance of the algorithm
as n grows.
n
I Stirling’s approximation formula: n! ≈ ne .
I For n = 10, generate 3.6710 ≥ 210 permutations.
I For n = 50, generate 18.350 ≥ 2200 permutations.
I For n = 100, generate 36.7100 ≥ 2700 permutations!

⇒ Brute force solution is not efficient.

Efficient algorithms –Attempt 1
Definition 3 (Attempt 1).
An algorithm is efficient if it achieves better worst-case
performance than brute-force search.
Efficient algorithms –Attempt 1
Definition 3 (Attempt 1).
An algorithm is efficient if it achieves better worst-case
performance than brute-force search.

Caveat: fails to discuss the scaling properties of the algorithm;

if the input size grows by a constant factor, we would like the
running time T (n) of the algorithm to increase by a constant
factor as well.
Efficient algorithms –Attempt 1
Definition 3 (Attempt 1).
An algorithm is efficient if it achieves better worst-case
performance than brute-force search.

Caveat: fails to discuss the scaling properties of the algorithm;

if the input size grows by a constant factor, we would like the
running time T (n) of the algorithm to increase by a constant
factor as well.

Polynomial running times: on input of size n, T (n) is at most

c · nd for c, d > 0 constants.
I Polynomial running times scale well!
I The smaller the exponent of the polynomial the better.
Efficient algorithms

Definition 4.
An algorithm is efficient if it has a polynomial running time.

Caveat
I What about huge constants in front of the leading term or
large exponents?
However
I Small degree polynomial running times exist for most
problems that can be solved in polynomial time.
I Conversely, problems for which no polynomial-time
algorithm is known tend to be very hard in practice.
I So we can distinguish between easy and hard problems.

Remark 1.
Today’s big data: even low degree polynomials might be too slow!
Are we done with sorting?

Insertion sort is efficient. Are we done with sorting?

Are we done with sorting?

Insertion sort is efficient. Are we done with sorting?

1. Can we do better?

2. And what is better?

3n2 7n
I E.g., is T (n) = n2 better than 2 + 2 − 4?
Running time in terms of # primitive steps

To discuss this, we need a coarser classification of running times

of algorithms; exact characterizations

I are too detailed;

I do not reveal similarities between running times in an
immediate way as n grows large;
I are often meaningless: pseudocode steps will expand by
a constant factor that depends on the hardware.
Asymptotic notation

A framework that will allow us to compare the rate of growth of

different running times as the input size n grows.

I We will express the running time as a function of

the number of primitive steps, which is a function
of the size of the input n.
I To compare functions expressing running times, we will
ignore their low-order terms and focus solely on the
highest-order term.
A faster algorithm for sorting using the divide-and-conquer
principle.

Anatomy Volume 2 1
100% (3)
Anatomy Volume 2 1
163 pages
Slides9 8
No ratings yet
Slides9 8
55 pages
Data Structures: Lecture 1: Introduction
No ratings yet
Data Structures: Lecture 1: Introduction
27 pages
04 Sorting
No ratings yet
04 Sorting
58 pages
L1 L3
No ratings yet
L1 L3
54 pages
7_Sorting
No ratings yet
7_Sorting
85 pages
Week 3 Slide 2 Insertion Sort
No ratings yet
Week 3 Slide 2 Insertion Sort
25 pages
DSA Week2
No ratings yet
DSA Week2
84 pages
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
31 pages
Insertion Sort Bubble Sort Selection Sort
No ratings yet
Insertion Sort Bubble Sort Selection Sort
31 pages
MA214 Lecture Slides 2
No ratings yet
MA214 Lecture Slides 2
86 pages
Unit1 SP
No ratings yet
Unit1 SP
41 pages
Insertion Sort
No ratings yet
Insertion Sort
5 pages
Alg Wks1 2
No ratings yet
Alg Wks1 2
67 pages
Insertion Sort: Analysis and Correctness: Runtime Complexity
No ratings yet
Insertion Sort: Analysis and Correctness: Runtime Complexity
4 pages
Introduction and Insertion Sort
No ratings yet
Introduction and Insertion Sort
7 pages
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
31 pages
Insertion Sort Bubble Sort Selection Sort
No ratings yet
Insertion Sort Bubble Sort Selection Sort
24 pages
Insertion Sort
No ratings yet
Insertion Sort
10 pages
5 Insertion and Merge Sort
No ratings yet
5 Insertion and Merge Sort
37 pages
Unit-1 DAA
No ratings yet
Unit-1 DAA
194 pages
6 Sorting (Insertion, Merge)
No ratings yet
6 Sorting (Insertion, Merge)
68 pages
Design and Analysis of Algorithms: Israr Ali
No ratings yet
Design and Analysis of Algorithms: Israr Ali
79 pages
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
31 pages
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
31 pages
Lecture InsertionSortBubbleSortSelectionSort
No ratings yet
Lecture InsertionSortBubbleSortSelectionSort
33 pages
L04 InsertionSort
No ratings yet
L04 InsertionSort
25 pages
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Sorting - Part A Instructor: George Bebis
31 pages
Insertion Sort Bubble Sort Selection Sort
No ratings yet
Insertion Sort Bubble Sort Selection Sort
31 pages
Insertion Sort Bubble Sort Selection Sort
No ratings yet
Insertion Sort Bubble Sort Selection Sort
31 pages
Insertion Sort
No ratings yet
Insertion Sort
13 pages
Ca729 Design and Analysis of Algorithms
No ratings yet
Ca729 Design and Analysis of Algorithms
52 pages
Algo - Mod3 and Mod4 - Extended - Evon
No ratings yet
Algo - Mod3 and Mod4 - Extended - Evon
156 pages
Lecture 2
No ratings yet
Lecture 2
35 pages
Insertion Sort Bubble Sort Selection Sort
No ratings yet
Insertion Sort Bubble Sort Selection Sort
31 pages
Design & Analysis of Algorithms: DR Anwar Ghani
No ratings yet
Design & Analysis of Algorithms: DR Anwar Ghani
21 pages
Week 2
No ratings yet
Week 2
46 pages
Lecture 4
No ratings yet
Lecture 4
23 pages
Unit 2.2 InsertionSortBubbleSortSelectionSort
No ratings yet
Unit 2.2 InsertionSortBubbleSortSelectionSort
32 pages
Insertion Sort Chapter 1-4
No ratings yet
Insertion Sort Chapter 1-4
11 pages
Introduction and Elementary Data Structures: Analysis of Algorithms
No ratings yet
Introduction and Elementary Data Structures: Analysis of Algorithms
12 pages
UNIT I (Repaired)
No ratings yet
UNIT I (Repaired)
43 pages
Book:, By:: Introduction To Algorithms
No ratings yet
Book:, By:: Introduction To Algorithms
67 pages
Getting Started: Sun-Yuan Hsieh
No ratings yet
Getting Started: Sun-Yuan Hsieh
30 pages
Algorithms_Unit1_074220
No ratings yet
Algorithms_Unit1_074220
73 pages
Sorting
No ratings yet
Sorting
25 pages
Algorithm ASSIGNMENT 1 Group 2
No ratings yet
Algorithm ASSIGNMENT 1 Group 2
6 pages
01 - Insertion Sort
No ratings yet
01 - Insertion Sort
47 pages
Daaa 1 Introduction
No ratings yet
Daaa 1 Introduction
33 pages
Chapter 2.0 Introduction To Algorithm 4th Edition
No ratings yet
Chapter 2.0 Introduction To Algorithm 4th Edition
4 pages
Introduction To Algorithm Design and Analysis: A, A,, A A ', A ',, A ' Such That A ' A ' A
No ratings yet
Introduction To Algorithm Design and Analysis: A, A,, A A ', A ',, A ' Such That A ' A ' A
21 pages
ADA MSE-2
No ratings yet
ADA MSE-2
17 pages
Comparison of Sorting Algorithms Based On Input Sequences: Ashutosh Bharadwaj Shailendra Mishra
No ratings yet
Comparison of Sorting Algorithms Based On Input Sequences: Ashutosh Bharadwaj Shailendra Mishra
4 pages
L5 - Insertion Sort
No ratings yet
L5 - Insertion Sort
50 pages
Get the entire Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein in PDF with one simple click.
100% (6)
Get the entire Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein in PDF with one simple click.
48 pages
CS161Lecture02
No ratings yet
CS161Lecture02
7 pages
373 Lecture 1
No ratings yet
373 Lecture 1
35 pages
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein - Read Now Or Download For A Complete Experience
100% (6)
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein - Read Now Or Download For A Complete Experience
43 pages
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein instant download
100% (5)
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Stein instant download
44 pages
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Steindownload
100% (6)
Solution Manual for Introduction to Algorithms, third edition By Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest and Clifford Steindownload
40 pages
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
Hugs and Kisses
No ratings yet
Hugs and Kisses
4 pages
Fluid Coupling
No ratings yet
Fluid Coupling
20 pages
Malaysian Airlines Versus Airasia: Customer Satisfaction, Service Quality and Service Branding
No ratings yet
Malaysian Airlines Versus Airasia: Customer Satisfaction, Service Quality and Service Branding
9 pages
18073-EasyLine-23-07 Chave Seccionadora ABB
No ratings yet
18073-EasyLine-23-07 Chave Seccionadora ABB
38 pages
(Individual) : Assessment Rubric For Class Presentation
No ratings yet
(Individual) : Assessment Rubric For Class Presentation
2 pages
Authority To Travel Form
No ratings yet
Authority To Travel Form
5 pages
Multiple Choice
No ratings yet
Multiple Choice
12 pages
Ot Chronic Pain
No ratings yet
Ot Chronic Pain
2 pages
Kharkiv National Medical Uni-Brochure
No ratings yet
Kharkiv National Medical Uni-Brochure
14 pages
3.DSX XXL 2 Z 1500 335mch
No ratings yet
3.DSX XXL 2 Z 1500 335mch
23 pages
Business Intelligence Overview
No ratings yet
Business Intelligence Overview
10 pages
Test Bank Essentials of Business Communication 11th 11E
No ratings yet
Test Bank Essentials of Business Communication 11th 11E
19 pages
VLT5000
No ratings yet
VLT5000
2 pages
(2020) Forensic Assessment May Be Based On Common Sense Assumptions Rather Science
No ratings yet
(2020) Forensic Assessment May Be Based On Common Sense Assumptions Rather Science
10 pages
Get Exploring Microsoft Office The Illustrated Practical Guide to Using Office and Microsoft 365 7 Exploring Tech Wilson free all chapters
100% (4)
Get Exploring Microsoft Office The Illustrated Practical Guide to Using Office and Microsoft 365 7 Exploring Tech Wilson free all chapters
22 pages
Self Study Report: Adv - Ramkrishnaji Rathi Law College, Washim
No ratings yet
Self Study Report: Adv - Ramkrishnaji Rathi Law College, Washim
141 pages
Cataloge-Stress ECG Systems (800HS)
No ratings yet
Cataloge-Stress ECG Systems (800HS)
2 pages
Mango
No ratings yet
Mango
19 pages
Analysis of Biodiesel From Jatropha Fuel Properties: Volume 2, Issue 4, April 2013
No ratings yet
Analysis of Biodiesel From Jatropha Fuel Properties: Volume 2, Issue 4, April 2013
7 pages
Brain Tumor Classification Project Report
No ratings yet
Brain Tumor Classification Project Report
39 pages
Gce 204 Practise Questions
No ratings yet
Gce 204 Practise Questions
6 pages
Teamas: "Where Passion Begins "
No ratings yet
Teamas: "Where Passion Begins "
15 pages
AN-Safety-Tutorial On Safety Testing-042014
No ratings yet
AN-Safety-Tutorial On Safety Testing-042014
9 pages
Python Important
No ratings yet
Python Important
35 pages
PDF Exercises Say and Tell As Introductory Verbs Compress
No ratings yet
PDF Exercises Say and Tell As Introductory Verbs Compress
8 pages
Permutations and Combinations - 5
No ratings yet
Permutations and Combinations - 5
11 pages
MacBook Pro (13-Inch, Mid 2009) - Technical Specifications
No ratings yet
MacBook Pro (13-Inch, Mid 2009) - Technical Specifications
4 pages
Basic Science Summary For The MRCS
100% (3)
Basic Science Summary For The MRCS
111 pages
Database Management Final Report - Warehouse
No ratings yet
Database Management Final Report - Warehouse
5 pages

1 Insertionsort

Uploaded by

1 Insertionsort

Uploaded by

Algorithms for Data Science

Insertion sort, efficient algorithms

2 A first algorithm: insertion sort

2 A first algorithm: insertion sort

I An algorithm is a well-defined computational procedure

I The desired input/output relationship is specified by the

I An algorithm is correct if, for every input, it halts with the

I In this course we are interested in algorithms that are

I Efficiency is related to the resources an algorithm uses:

We will primarily focus on efficiency in running time.

Running time = number of primitive computational steps

1. arithmetic operations: add, subtract, multiply, divide

We will use pseudocode for our algorithm descriptions.

2 A first algorithm: insertion sort

I Input: A list A of n integers x1 , . . . , xn .

I Input: A list A of n integers x1 , . . . , xn .

I Input: A list A of n integers x1 , . . . , xn .

What data structure should we use to represent the list?

I Input: A list A of n integers x1 , . . . , xn .

What data structure should we use to represent the list?

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

1. Start with a (trivially) sorted subarray of size 1 consisting of A[1].

2. Increase the size of the sorted subarray by 1, by inserting the next

3. Repeat Step 2. until the sorted subarray has size n.

9 3 2 6 8 5 beginning of iteration i=2

3 9 2 6 8 5 beginning of iteration i=3

Let A be an array of n integers.

2 A first algorithm: insertion sort

I Correctness: formal proof often by induction

I Running time: number of primitive computational steps

I Space: how much space is required by the algorithm

Notation: A[i, j] is the subarray of A that starts at position i and

I Correctness: follows from the key observation that after

I Running time: number of primitive computational steps

I Space: in place algorithm (at most a constant number of

Notation: A[i, j] is the subarray of A that starts at position i

Minor change in the pseudocode: in line 1, start from i = 1

Correctness of insertion-sort follows if we show Claim 1

I Base case: i = 1, trivial.

I Induction hypothesis: assume that the statement is true for

I Inductive step: Show it true for i + 1.

In loop i + 1, element key = A[i + 1] is inserted into A[1, i]. By

A[ℓ] is the rightmost element

End of i-th iteration:

I How many primitive computational steps are executed by the

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed.

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed. Then

I Which input yields the smallest (best-case) running time?

I For 2 ≤ i ≤ n, let ti = # times line 4 is executed. Then

I Best-case running time: 5n − 4

Why worst-case analysis?

The worst-case running time of insertion-sort is quadratic.

2 A first algorithm: insertion sort

Compare to brute force solution:

Compare to brute force solution:

Worst-case analysis: generate n! permutations. Is brute force

Compare to brute force solution:

Worst-case analysis: generate n! permutations. Is brute force

⇒ Brute force solution is not efficient.

Caveat: fails to discuss the scaling properties of the algorithm;

Caveat: fails to discuss the scaling properties of the algorithm;

Polynomial running times: on input of size n, T (n) is at most

Insertion sort is efficient. Are we done with sorting?

Insertion sort is efficient. Are we done with sorting?

2. And what is better?

To discuss this, we need a coarser classification of running times

I are too detailed;

A framework that will allow us to compare the rate of growth of

I We will express the running time as a function of

You might also like