0% found this document useful (0 votes)

289 views17 pages

CSI 2110 Summary PDF

The document summarizes key concepts about algorithm analysis from the CSI 2110 course. It discusses big-O, big-Omega, and big-Theta notation for analyzing algorithm efficiencies. Common data structures like stacks, queues, arrays, linked lists, trees, heaps, and maps are described along with their time complexities. Sorting algorithms like selection sort and insertion sort are covered as well as tree traversal methods and binary search trees.

Uploaded by

john

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

289 views17 pages

CSI 2110 Summary PDF

Uploaded by

john

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

CSI 2110

Summary

Fall 2012
CSI 2110 Summary, Fall 2012

Analysis of Algorithms
Algorithm - a step by step procedure for solving a problem in a finite amount of time. Analyzing an
algorithm means determining its efficiency.
Primitive operations - low-level computations independent from the programming language that can be
identified in the psuedocode.

Big-Oh - given two functions and , we say is if and only if there are positive
constant and such that for . It means that an algorithm has the complexity of
AT MOST .
 Just multiply all terms by the highest degree of , add, and you have your (the coefficient) and
(the function).

 Logarithms always in base 2 in this class unless otherwise stated
 Always want the lowest possible bound, approximation should be as tight as possible
 Drop lower order terms and constant factors
Big-Omega - is such that for all . It means an algorithm has
the complexity of AT LEAST .
Big-Theta - is if it is ( ) ( ) It means the complexity IS EXACTLY .

Properties of logarithms and exponentials:


 ( )







Remember the following equations:

 ∑
 ∑
( )
 ∑

 ∑
 ∑ Only if the graph is implemented as an adjacency list

Stacks and Queues

Stacks - First in, last out.
Queues - First in, first out.

1|P a ge
CSI 2110 Summary, Fall 2012

Extendable Arrays
 Growth function is

Regular push: 1
Special push: Cost of creating the new array ( ) + copying the old elements into the new array
(length of old array) + 1
Phase: starts at creation of the new array and ends with the last element being pushed. The one pushed
after that is a special push, and starts a new phase.

Complexities
Method ArrayList LinkedList Unsorted Seq SortedSeq
size
isEmpty
get
replace
insert
remove
minKey/minElement
removeMin

Selection Sort
Using an external data structure - insert your elements into the first data structure, find the smallest
one, add it into your second data structure. Takes time.
In place (Not using an extra data structure) - search through the array, find the smallest, add it to the
front of the array. Repeat until it's sorted.
 Complexity: Average and worst case: performs in operations regardless of input, since
is always executed in .

Insertion Sort
Using an external data structure - insert the first element of the first data structure into the other one,
but adding each element before or after the existing elements according to size. Then,
and add it into the old ADT.
In place - Take the second element and switch it with the first if needed. Take the third element and
switch it with the second, and then the first if need be, and so on.
 Complexity: Average and worst case is .

Trees
Graph - consists of vertices and edges.
Tree - a graph that contains no cycles.
Root - a node without a parent.
Internal node - a node with at least one child.
External node - a node without any children.
Ancestor - parent, grandparent, great-grandparent, etc.
Descendent - child, grandchild, great-grandchild.
Subtree - a tree consisting of a node and its descendants.
Distance - number of edges between two nodes.

2|P a ge
CSI 2110 Summary, Fall 2012

Depth - number of ancestors.

Height - the maximum depth.
Preorder - visit, left, right.
Postorder - left, right, visit.
Inorder - left, visit, right.

Binary Trees
 Each node has at most two children.
 Examples: decision trees (yes/no), arithmetic expressions,

Full binary tree - each node is either a leaf, or has two children.
 In the book, children are completed with dummy nodes, and all trees are considered full.
Perfect binary tree - a full binary tree with all the leaves at the same level.
Complete binary tree - perfect until the level , then one or more leaves at level .

Properties of Binary Trees


o In Perfect Binary Trees:



 Maximum number of nodes at each level is

Properties of Height
 Binary:
 Binary (Full):
 Binary(Complete): (integer part of )
 Binary(Perfect):

Implementing Complete Binary Trees with ArrayList

 , , , , , ,
, , all have complexity .
 Left child of is if .
 Right child of is if .
 Parent of is if .
 Root is if
 Is a leaf? True if .

Representing General Trees

A tree T is represented by a binary tree T' with the following algorithm, seen in class:
 .
 First child in in is the left child of in
 First sibling of in is right child of in
 If is a leaf and has no siblings, then the children of are leaves.

3|P a ge
CSI 2110 Summary, Fall 2012

 If is internal and v is its first child then is the left child of in

 If has a sibling immediately following it, is the right child of in .
Heaps
MinHeap - All children are larger than its parent.
MaxHeap - All children are smaller than its parent.
Height of a heap: a heap storing keys has a height of

Removal -
 Remove the top element
 Replace with last key in the heap.
 Begin the downheap.

Downheap
 Compares the parent with the smallest child.
 If the child is smaller, switch the two.
 Keep going.
 Stops when the key is greater than the keys of both its children or the bottom of the heap is
reached.


Insertion
 Add key into the next available position.
 Begin upheap.

Upheap
 Similar to downheap.
 Swap parent-child keys out of order.

Regular Heap Construction -We could insert items one at a time with a sequence of heap insertions,
∑ . But we can do better with bottom-up heap construction.

Bottom-up Heap Construction (WILL be a question on midterm and/or final exam!)

 Idea: recursively rearrange each subtree in the heap starting with the leaves.
 Switch starting from the bottom, work your way up to bigger subtrees and rearrange them.
 To construct a heap bottom-up, construct a tree with the number of nodes you need to insert.
Start from the back of the array, and start adding from the right of the subtree, filling all the
leaves.
 These are all okay. Move on to the right of the level and fill in the entire level (again,
working backwards from the array). Rearrange those subtrees.
 Continue to the next level. Fill, rearrange, etc.
 Number of Swaps: How many nodes at level ? There are . Will do swaps.
 Complexity is which is the best we can possibly get.

Heaps are implemented the same way as binary trees.

Heap Sort - Cost is

4|P a ge
CSI 2110 Summary, Fall 2012

Heap Sort In Place

 Phase 1 - We build a max-heap to occupy the whole structure
 Phase 2 - We start the part "sequence" empty and we grow it by removing at each step
( ) the max value from the heap and by adding it to the part "sequence", always
maintaining the heap properties for the "heap" part.
 Take the root and switch it with the rightmost child. Reheap. Keep going along the last row the
same way. Switch with root, reheap, etc.

Maps and Dictionaries

Map - multiple items with the same key are NOT allowed
Dictionary - multiple items with the same key ARE allowed

Unordered Sequence
 Searching and removing takes time
 Inserting takes time.
 Applications to log files (frequent insertions, rare searches and removals)

Ordered Sequence - Array-based

 Searching takes with binary search.
 Inserting and removing takes time.
 Applications to look-up tables (frequent searches, rare insertions and removals).
 Start in the middle, if the key we're searching for if larger, go to the right. If it's smaller, go to the
left.

Binary Search Trees

 A binary tree such that keys stored to the left are less than , and keys stored to the right are
greater than or equal to .
 External nodes do not hold elements but serve as place holders.
 Inorder traversal gives you the keys in increasing order

5|P a ge
CSI 2110 Summary, Fall 2012

 Complexity:
o Worst case: where all keys are to the right or to the left.
o Best case: where leaves are on the same level or on an adjacent level.

Insertion
 Always insert at the end of the search tree based on the correct order
 If you're adding a double, always insert to the right, not to the left of the tree.

Deletion
 If it’s at the end, you can just remove it.
 If it's not at the end, replace it with the next in the inorder traversal.

AVL Trees
 AVL trees are balanced.
 They are binary search trees that for every internal node of , the heights of the children can
differ by at most .
 Height of an AVL tree is always .
 The height is .

Insertion
 Balanced - if for ever node , the height of the 's children differ by at most 1.
 It tree becomes unbalanced, we need to rebalance.
 Rebalance
o Identify three nodes (grandparent, parent, child) and the 4 subtrees attached to them.
o Find the node whose grandparent is unbalanced. This node (child) is , the parent is , and
grandparent is .
 Choice of is not unique
o Identify the four subtrees, left to right, as and
o The first who comes in the inorder traversal is , second is , and third is .

Removal
 Remove the same way as in a binary search tree. However, this may cause an imbalance.

6|P a ge
CSI 2110 Summary, Fall 2012

Complexity
 Searching, inserting, and removing are all , that's what makes AVL trees so nice.

Trees
 A tree is a multi-way search tree with the following properties:
o Node size property: every internal node has at most four children
o Depth property: all external nodes have the same depth
 Can't have more than four children or less than two children.
 Depending on the number of children, an internal node of a tree is called a -node, -node,
or -node.

 Searching in a tree with items takes time
 Min number of items: when all internal nodes have key and children: ,

 Maximum number of items :when all internal nodes have three keys and four children.
∑

Insertion
 Insert similar to a binary search tree. Insert at the end after (binary) searching where it goes.
 May cause overflow, since you're only allowed three elements in one node.
o Take the third element, send it up, and make new nodes out of the first two (one) and the
fourth one (two).
 Insertion takes time

Deletion
 Replace the deleted key with the inorder successor
 Can cause underflow, might need to fuse nodes together to fix this. To handle an underflow at
node with parent , we consider two cases:
o Case 1: the adjacent siblings of are 2-nodes.
 Fusion operation: we merge with an adjacent sibling and move an item from to
the merged node .
 After a fusion, the underflow may propagate to the parent .
o Case 2: an adjacent sibling of is a -node or a -node.
 Transfer operation:
1. We move a child from to
2. We move an item from to
3. We move an item from to
 After a transfer, no underflow occurs.

 Deleting takes time. Note that searching, inserting, and deleting all take time.

7|P a ge
CSI 2110 Summary, Fall 2012

Hash Tables
Problem A / Address Generation: Construction of the function . It needs to be simple to calculate,
and must uniformly distribute the elements in the table. For all keys , is the position of in the
table. This position is an integer. Also, ( ) if . Searching for a key and inserting a key
(all dictionary ADT operations) takes time. We have the function ( ) where we
have the two following sub-functions:
1. Hash code map
o They reinterpret a key as an integer. They need to:
i. Give the same result for the same key
ii. Provide a good "spread"
o Polynomial accumulation
 We partition the bits of the key into a sequence of components of fixed length,
.
 Then we evaluate the polynomial at a fixed value
, ignoring overflows.
 Especially suitable for strings.
o Examples
 Memory address (we reinterpret the memory address of the key object as an integer,
this is the default hash code for all Java objects)
 Integer cast (Reinterpret the bits of the key as an integer)
 Component sum (We partition the bits of the key into components of fixed length and
we sum the components).
2. Compression map
o They take the output of the hash code and compress into the desired range.
o If the result of the hash code was the same, the result of the compression map should be
the same.
o Compression maps should maximize "spread" so as to minimize collisions
o Examples
 Division: where is usually chosen to be a prime number (number
theory).
 Multiply Add Divide (MAD): where and are nonnegative
integers such that .
Problem B / Collision Resolution: What strategy do we use if two keys map to the same location ?
Load factor of a Hash table: where is the number of elements and is the number of cells. The
smaller the load factor, the better.

Linear Probing: We have . Consider a hash table A that uses linear probing.
In order to search, we do the following:
 :We start at cell , and we probe consecutive locations until one of the
following occurs:
o An item with key is found
o An empty cell is found
o cells have been unsuccessfully probed
To handle insertions and deletions, we introduce a special object, called AVAILABLE, denoted , which
replaces deleted elements.

8|P a ge
CSI 2110 Summary, Fall 2012

 : We search for an item with key . If such an item is found, re

replace it with the special object and we return element . Otherwise, we return

 : We throw an exception if the table is full. We start at cell . We probe

consecutive cells until one of the following occurs:
o A cell is found that is either empty or labelled
o cells have been unsuccessfully probed
We store item in cell .
Performance of Linear Probing: Average number of probes is where is the loading factor. Even if
it'd 90% full, on average you'll find it in 5 or 6 tries, so it's , which is very good. In the worst case,
hash is not better than AVL, when clustering occurs, it's .

Quadratic Probing: We have . The problem with this is that modulo is hard
to calculate and it only visits half of the table but it's not a big deal. Similar problem: you avoid linear
clustering, but every key that's mapped to the same cell will follow the same path. There's more
distributed clustering, which should be avoided. Called secondary clustering.

Double Hashing: We have where is the primary hashing function

and is the secondary hashing function.

Bubble Sort
 You literally bubble up the largest element. Move from the front to the end, Bubble the largest
value to the end using pairwise comparisons and swapping. Once you go through the whole array,
you start from the first element again and go through the same way.
 In order to detect that an array is already sorted so we don't have to go through it again
unnecessarily, we can use a boolean flag. If no swaps occurred, we know that the collection is
already sorted. The flag needs to be reset after each "bubble up".
 Complexity is .

Recursive Sorts
Divide and Conquer paradigm:
 Divide: divide one large problem into two smaller problems of the same type.
 Recur: solve the subproblems.
 Conquer: combine the two solutions into a solution to the larger problem.

Merge Sort
Merge sort on an input sequence S with n elements consists of three steps. It is based on the divide-and-
conquer paradigm:
 Divide: partition into two groups of about each.
 Recur: recursively sort and .
 Conquer: merge and into a unique sorted sequence.
The conquer step merged the two sorted sequences and into one sorted sequence by comparing
the lowest element of each and and insert whichever is smaller. Merging two sorted sequences,
each with elements and implemented by means of a doubly linked list, takes time.
It's depicted in a binary search tree kind of style.

9|P a ge
CSI 2110 Summary, Fall 2012

The height h of the merge sort tree is , since at each recursive call we divide the sequence in
half. The overall amount or work done at the nodes of depth is , since we partition and merge
of size , and we make recursive calls. From this we get:
 Complexity:

Quick Sort
Quick sort is also based on the divide-and-conquer paradigm.
 Divide: pick an element , called the pivot, and partition into
o elements less than
o elements equal to
o elements greater than
 Recur: sort and
 Conquer: join , , and .
Pivot can always be chosen randomly, or we can decide always to choose the first element of the array,
or the last.
 Complexity:
o Worst case:
o Average case:

10 | P a g e
CSI 2110 Summary, Fall 2012

Summary of Sorts

Radix-Sort
 Crucial point of this whole idea is the stable sorting algorithm, which is a sorting algorithm which
preserves the order of items with identical key.
 Question: the best sorts that we have seen so far have been , and there is no way to beat
that unless we're under certain circumstances, in which case we can reach .

Bucket Sort
Let be a sequence of n (key, element) items with keys in the range . Bucket sort uses the
keys as indices to an auxiliary array of sequences (buckets).
 Phase 1: Empty sequence by moving each item into its bucket .
 Phase 2: For , move the items of bucket to the end of sequence .
 Takes time.

Lexicographic Order
 A -tuple is a sequence of d keys where is said to be the th dimension of the tuple.
 The lexicographic order of two -tuples is recursively defined as , ie
the tuples are compared by the first dimension, then the second, etc.
 Lexicographic sort: Let be the comparator that compares two tuples by their ith dimension, ie
for , if . Let be any stable sorting
algorithm that uses comparator . Lexicographic sort sorts a sequence of -tuples in lexicographic
order by executing times algorithm , one per dimension.
 You do it by starting from dimension , putting those in order, then moving to and putting
them in order, then continue this way until you reach dimension , put that in order, and you're
done.
 Lexicographic sort runs in where is the running time of the stable-sort algorithm.

11 | P a g e
CSI 2110 Summary, Fall 2012

Radix Sort Variation 1: This one uses the bucket-sort as the stable sorting algorithm. Applicable to
tuples where keys in each dimension are integers in the range .
 This one runs in time.
Radix Sort Variation 2: Consider a sequence of n b-bit integers We represent each
element as a b-tuple of integers in the range [0,1] and apply radix sort with N=2. It sorts Java integers
(32-bits) in linear time.
 This one runs in time.
Radix Sort Variation 3: The keys are integers in the range . We represent a key as a -tuple
of digits in the range and apply variation 1, ie write it in base notation.
 This means write a number in this notation: where are the coefficients of the
following equation:

where is the number you're putting in base .
 Examples
o If , write as , since
o If , write as , since
 This one runs in time.

Graph Traversals
Subgraphs
Subgraph: A subgraph of a graph is a graph such that the vertices of are a subset of the vertices of
and the edges of are a subset of the edges of .
Spanning subgraph: A subgraph that contains all the vertices of .
Connected: When there is a path between every pair of vertices.
Connected component: A maximal connected subgraph of G.
(Free) Tree: An undirected graph such that is connected and has no cycles.
Forest: A collection of trees. The connected components of a forest are trees.
Spanning tree: A spanning subgraph that is a tree.
Spanning forest: A spanning subgraph that is a forest.
Traversal: A traversal of a graph visits all vertices and edges of , determines whether is connected,
computes the connected components of , computes a spanning forest of , builds a spanning tree in a
connected graph.

Depth First Search

DFS is a graph traversal technique that, on a graph with vertices and edges, has a complexity of
and can be further extended to solve other graph problems.
Idea: The idea is to start at an arbitrary vertex, follow along a simple path until you get to a vertex which
has no unvisited adjacent vertices, then start tracing back up the path, one vertex at a time, to find a
vertex with unvisited adjacent vertices.

With a stack: Start at a vertex, add it to your visited set . Push all the edges into the stack, then
pop the first one. Add the vertex it brings you to to . Push that vertex's vertices, then pop, visit
that vertex if it hasn't been visited yet, push its vertices, etc. until there are no edges left in the stack.
With recursion: : mark visited, for all vertices that are adjacent to , visit if it hasn't been
visited yet, .

12 | P a g e
CSI 2110 Summary, Fall 2012

Unexplored vertex: A vertex that hasn't been visited yet.

Visited vertex: A vertex that we've already included in our set.
Unexplored edge: An edge that we have not visited yet.
Discovery edge: An edge that leads to an unexplored vertex.
Back edge: An edge leading to an already visited vertex.

Properties of DFS:
 visits all the evrtices and edges in the connected component of
 The discovery edges labeled by form a spanning tree of the connected component of
v.
 Setting/getting a vertex/edge label takes time.
 Each vertex is labeled twice, once as unexplored and once as visited.
 Each edge is labeled twice, once as unexplored and once as discovery (or back).
 Complexity:
o Adjacency List
 Average case is
 Worst case is when , so .
o Adjacency Matrix


Applications:
 Path Finding: We can specialize the DFS algorithm to find a path between two given vertices u and
z using the template method pattern. We call with as the starting vertex, using a
stack to keep track of the path between the start vertex and the current vertex. As soon as we
reach , we return the path, which is the contents of stack .
 Cycle Finding: We can specialize the DFS algorithm to find a simple cycle using the template
method pattern. We use a stack to keep track of the path between the start vertex and the
current vertex. As soon as a back edge is encountered, we return the cycle as the portion
of the stack from the top to vertex .

Breadth-First Search
BFS is a graph traversal that can be further extended to solve other problems and, on a graph with
vertices and edges, takes time (with adjacency list implementation).
Idea: Visit a vertex, then visit all unvisited vertices that are adjacent to it before visiting a vertex which is
two sports away from it.

With a queue: Add your starting vertex to . Enqueue its adjacent vertices, then dequeue and
visit that vertex if it hasn't been visited already. Enqueue its adjacent vertices. Dequeue the next one,
visit, enqueue adjacent… and so on until everything has been visited.

Unexplored vertex: A vertex that hasn't been visited yet.

Visited vertex: A vertex that we've already included in our set.
Unexplored edge: An edge that we have not visited yet.
Discovery edge: An edge that leads to an unexplored vertex.
Cross edge: An edge leading to an already visited vertex. They're always at the same or at an adjacent
level.

13 | P a g e
CSI 2110 Summary, Fall 2012

Properties of BFS:
 Notation: is the connected component of .
 visits all the vertices and edges of
 The discovery edges labeled by form a spanning tree of
 For each vertex in , the path of from to has edges, and every path from to in
has at least edges.
 Setting/getting a vertex/edge label takes time.
 Each vertex is labeled twice (once as unexplored, once as visited).
 Each edge is labeled twice (once as unexplored, once as discovery or cross)
 Runs in time given the graph is represented in an adjacency list.

Applications: Using the template method pattern, we can specialize the BFS traversal of a graph G to
solve the following problems in O(n+m) time:
 Compute connected components of G
 Compute spanning forest of G
 Find a simple cycle in G or report that G is a forest
 Given two vertices of G, find a path between them with the minimum number of edges, or report
that no such path exists.

DFS vs BFS
Application DFS BFS
Spanning forest X X
Connected components X X
Paths X X
Cycles X X
Shortest paths X
Biconnected components (if the removal of any single X
vertex, and all edges incident on that vertex, cannot
disconnect the graph)
Edges that lead to an already visited vertex Back edge Cross edge

Shortest Path
Shortest path describes the shortest path to the starting vertex.
Properties:
 A subpath of a shortest path is itself a shortest path.
 There is a tree of shortest paths from a start vertex to all other vertices.

Dijkstra's Algorithm
 The distance of a vertex from a vertex is the length of a shortest path between and .
Dijkstra's algorithm computes the distances of all the vertices from a given start vertex .
 Assumptions: Graph is connected, Edges are undirected, The edge weights are nonnegative.
 We can grow a cloud of vertices, beginning with and eventually covering all the vertices. At each
vertex , we store , which is the best distance of from in the subgraph consisting of the
cloud and its adjacent vertices.

14 | P a g e
CSI 2110 Summary, Fall 2012

 At each step, we add to the cloud the vertex outside the could with the smallest distance label,
and we update the labels of the vertices adjacent to , ie we change the labels if there's a better
way to get to the s vertex with the newly augmented cloud.
 We use a priority queue to store the vertices not in the cloud, where is the key of a vertex
in .
 Using a heap: Add the vertex to the cloud (distance in the priority queue is 0). Then you
and update. Whatever returned is the new vertex in your cloud.
Updating means removing old keys and putting in new ones. Again, , add to cloud,
update, etc. until the heap is empty.
 Complexity:
o In a heap, it is .
o In an unsorted sequence, it is

Minimum Spanning Tree

Refers to the shortest way into the cloud, not to the root or starting vertex. Doesn't matter where you
start, it'll be the same all the time. It's a spanning tree where you minimize the sum of the weights.
Applications to communication networks and transportation networks.
Cycle property: Let be a minimum spanning tree of a weighted graph and be an edge of that is
not in and let be the cycle formed by adding to , then for every edge of ,
. In human language that means for every cycle, the largest weight is excluded from the
minimum spanning tree.
Partition property: Consider a partition of the vertices of into subsets and . Let be an edge of
minimum weight across the partition. There is a minimum spanning tree of containing edge .

Prim-Jarnik Algorithm
 We assume the graph is connected.
 We pick an arbitrary vertex , and we grow the minimum spanning three as a cloud of vertices,
starting from . We store with each vertex a label , which is the smallest weight of an edge
connecting to any vertex in the cloud (not the root as in Dijkstra).
 At each step, we add to the cloud the vertex u outside the cloud with the smallest distance label.
We update the labels of the vertices adjacent to .
 We use a priority queue whose keys are labels, and whose elements are vertex-edge pairs.
Key: distance, Element: vertex. Any vertex can be the starting vertex. We still initialize all the
values to infinite, and also initialize (edge associated with ) to null. It returns the
minimum spanning tree .
 It is an application of the cycle property.
 Complexity is .

Kruskal's Algorithm
 Each vertex is initially stored as its own cluster.
 At each iteration, the minimum weight edge is added to the spanning tree if it joins two distinct
clusters.
 The algorithm ends when all the vertices are in the same cluster.
 Application of the partition property.

15 | P a g e
CSI 2110 Summary, Fall 2012

 A priority queue stores the edges outside the cloud. Key: weight, Element: edge. At the end of the
algorithm, we are left with one cloud that encompasses the minimum spanning tree, and with a
tree which is our minimum spanning tree.
 Essentially: start with the edge with the lowest weight, add it to the tree. Continue with the next
lowest weight, and add it to the tree (if it does not form a cycle with existing edges of the tree).
Continue until you've gone through all the edges. The resulting tree is your minimum spanning
tree.
 Complexity is ( )

Pattern Matching
Brute-Force
 Compares the pattern with the text for each possible shift of relative to , until either a
match is found or all placements of the pattern have been tried.
 Compare, shift over one, compare, shift over one…
 Worst case: ,
 Complexity , where is the size of the text and is the size of the pattern that we're
trying to find.

Boyer-Moore
 Based on two heuristics
o Looking-glass heuristic: Compare with a subsequence of moving backwards.
o Character-jump heuristic: When a mismatch occurs at , where c is the character in
at which the mismatch occurs (with ):
 If contains , shift to align the last occurrence of in with .
 If does not contain , shift completely past to align with .
If a match occurs, compare the previous two characters. Of they match, keep comparing
right to left. If a mismatch occurs, do what it says up there.
 Worst case: ,
 Complexity is | | where | | is the size of the alphabet you're using.

KMP Algorithm
 Compares the pattern to the text in left-to-right, but shifts the pattern more intelligently than
brute force.
 Compare each letter of , left to right, to . When you find a mismatch, look at the word to the
left of the mismatch. Find the largest prefix such that there is an identical suffix, and move by
matching up the suffix with the prefix.
 Failure function is defined as the size of the largest prefix of that is also a suffix of
. Usually organized into a table. It is computed in time.
 The complexity of this algorithm is .

16 | P a g e

AID 4th Semester Machine Learning Laboratory - Lab Manual
No ratings yet
AID 4th Semester Machine Learning Laboratory - Lab Manual
56 pages
SQL Certification Study Guide
No ratings yet
SQL Certification Study Guide
2 pages
exam2
No ratings yet
exam2
2 pages
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
No ratings yet
Basic Statistical Descriptions of Data: Dr. Amiya Ranjan Panda
35 pages
Demo Course PPT - Python
No ratings yet
Demo Course PPT - Python
18 pages
Semarchy XDM Ebook
No ratings yet
Semarchy XDM Ebook
46 pages
Assignment: Course Title: Computer Algorithm Course Code: CSE 1001
No ratings yet
Assignment: Course Title: Computer Algorithm Course Code: CSE 1001
20 pages
Exams 2024 Python For Beginners
No ratings yet
Exams 2024 Python For Beginners
22 pages
CSE 4th Semester Artificial Intelligence and Machine Learning Laboratory - CS3491 - Lab Manual
No ratings yet
CSE 4th Semester Artificial Intelligence and Machine Learning Laboratory - CS3491 - Lab Manual
109 pages
Chapter 3 - Solving Problems by Searching
No ratings yet
Chapter 3 - Solving Problems by Searching
71 pages
Lecture 1
No ratings yet
Lecture 1
53 pages
Unsupervised Learning 2024-PPG
No ratings yet
Unsupervised Learning 2024-PPG
85 pages
ML Unit-3.-1
No ratings yet
ML Unit-3.-1
28 pages
Recursion Problems
No ratings yet
Recursion Problems
7 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
4 pages
H13 311 Enu V8.02
No ratings yet
H13 311 Enu V8.02
30 pages
Artificial Intelligence DITI 3113: Uniformed Search I
No ratings yet
Artificial Intelligence DITI 3113: Uniformed Search I
51 pages
Notes - EDA-Unit1 (2)
No ratings yet
Notes - EDA-Unit1 (2)
34 pages
FCE Review (Quiz3)
No ratings yet
FCE Review (Quiz3)
14 pages
Artificial Intelligence DITI 1113: Uniformed Search II
No ratings yet
Artificial Intelligence DITI 1113: Uniformed Search II
36 pages
ET4248E - Chap9 - K-Means and GMM
No ratings yet
ET4248E - Chap9 - K-Means and GMM
27 pages
MobilenetV2 (Quantization)
No ratings yet
MobilenetV2 (Quantization)
4 pages
Unit 1 Discrete Maths
No ratings yet
Unit 1 Discrete Maths
1 page
Elementary Data Structures
No ratings yet
Elementary Data Structures
66 pages
The Rabin-Karp Algorithm: String Matching
No ratings yet
The Rabin-Karp Algorithm: String Matching
18 pages
Sparql: Parql Rotocol ND DF Uery Anguage
No ratings yet
Sparql: Parql Rotocol ND DF Uery Anguage
22 pages
NEETCODE BLIND 75
No ratings yet
NEETCODE BLIND 75
55 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
2 pages
TDT4136 Introduction To Artificial Intelligence: Lecture 1: Introduction (Chapter 1 in The Textbook)
No ratings yet
TDT4136 Introduction To Artificial Intelligence: Lecture 1: Introduction (Chapter 1 in The Textbook)
40 pages
CH 3 (AI)
No ratings yet
CH 3 (AI)
122 pages
DAA Decrease and Conquer ADA
No ratings yet
DAA Decrease and Conquer ADA
16 pages
CSPC24 Chapter 6 - Decrease and Conquer Algorithm Design Technique
No ratings yet
CSPC24 Chapter 6 - Decrease and Conquer Algorithm Design Technique
16 pages
AraBERT Transformer Model For Arabic Comments and Reviews Analysis
No ratings yet
AraBERT Transformer Model For Arabic Comments and Reviews Analysis
9 pages
OS Lecture3 - Inter Process Communication
No ratings yet
OS Lecture3 - Inter Process Communication
43 pages
iLoveMerge
No ratings yet
iLoveMerge
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
Support Vector Machines
No ratings yet
Support Vector Machines
14 pages
A Algorithm
No ratings yet
A Algorithm
9 pages
A Hybrid CNN-LSTM Approach For Deepfake Audio Detection CRC FINAL
No ratings yet
A Hybrid CNN-LSTM Approach For Deepfake Audio Detection CRC FINAL
6 pages
Big Data Hadoop Insight
No ratings yet
Big Data Hadoop Insight
46 pages
Data Mining Final Exam
No ratings yet
Data Mining Final Exam
1 page
Ads Record Lab Notes
No ratings yet
Ads Record Lab Notes
64 pages
R20CSE2101 - Data Structures - Unit 4 Notes
No ratings yet
R20CSE2101 - Data Structures - Unit 4 Notes
39 pages
Les 3 DWM
No ratings yet
Les 3 DWM
21 pages
Algorithms - CS3401 - Question Bank and Important Questions with Answer
No ratings yet
Algorithms - CS3401 - Question Bank and Important Questions with Answer
55 pages
Graphs Notes
No ratings yet
Graphs Notes
59 pages
Pycryptodome Master
100% (1)
Pycryptodome Master
82 pages
Chapter 5 Hive
No ratings yet
Chapter 5 Hive
69 pages
Unit VI PPT
No ratings yet
Unit VI PPT
19 pages
Support Vector Machines PDF
100% (1)
Support Vector Machines PDF
37 pages
WorkShop On PLO Exit Exam
No ratings yet
WorkShop On PLO Exit Exam
88 pages
Data Structure Unit (3 &4) Notes (SnapED)
No ratings yet
Data Structure Unit (3 &4) Notes (SnapED)
25 pages
Unit-4 - Data Structures Using C
No ratings yet
Unit-4 - Data Structures Using C
24 pages
OA TD2 Correction 2019 2020 - Compressed
No ratings yet
OA TD2 Correction 2019 2020 - Compressed
3 pages
2.1 Decision Theory
No ratings yet
2.1 Decision Theory
7 pages
DSA Chapter 6
No ratings yet
DSA Chapter 6
45 pages
Algorithms Manual r21
No ratings yet
Algorithms Manual r21
44 pages
DSC Answer Bank
No ratings yet
DSC Answer Bank
41 pages
UNIT V Data Structures OU
No ratings yet
UNIT V Data Structures OU
42 pages
Flex Et Bison Exercice Calculatrice
100% (1)
Flex Et Bison Exercice Calculatrice
2 pages
2324 HuongDanTrinhBayThuatToan
No ratings yet
2324 HuongDanTrinhBayThuatToan
8 pages
Data Exploration and Visualization - AD3301 - Hand Written Notes - Unit 5 - Multivariate and Time Series Analysis
No ratings yet
Data Exploration and Visualization - AD3301 - Hand Written Notes - Unit 5 - Multivariate and Time Series Analysis
59 pages
CS502 Finl Term Hndsout by Dream Team
No ratings yet
CS502 Finl Term Hndsout by Dream Team
88 pages
DAA - Introduction & Elementary Data Structures
No ratings yet
DAA - Introduction & Elementary Data Structures
16 pages
Unit Iv Non Linear Data Structures - Graphs
No ratings yet
Unit Iv Non Linear Data Structures - Graphs
29 pages
2023 CSC14003 21CLC06 IA02 Solved
No ratings yet
2023 CSC14003 21CLC06 IA02 Solved
9 pages
Data Mining
No ratings yet
Data Mining
6 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
Week 7: Graph: Data Structures & Algorithm Analysis
No ratings yet
Week 7: Graph: Data Structures & Algorithm Analysis
58 pages
An To An A That It It An: I. (L, The
No ratings yet
An To An A That It It An: I. (L, The
10 pages
DS Module 4 Graphs
No ratings yet
DS Module 4 Graphs
28 pages
Busca em Grafos
No ratings yet
Busca em Grafos
48 pages
Convolution in 1D and 2D
No ratings yet
Convolution in 1D and 2D
18 pages
Chapter19 PDF
No ratings yet
Chapter19 PDF
22 pages
20MC9102 - DS Question Bank
No ratings yet
20MC9102 - DS Question Bank
5 pages
AI Berkeley Solution PDF
No ratings yet
AI Berkeley Solution PDF
9 pages
Design & Analysis of Algorithms: DR Anwar Ghani
No ratings yet
Design & Analysis of Algorithms: DR Anwar Ghani
57 pages
Test Blanc
No ratings yet
Test Blanc
23 pages
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 4
No ratings yet
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 4
1 page
Recitation 1 Graph Search: BFS and DFS: 1.1 Announcements
No ratings yet
Recitation 1 Graph Search: BFS and DFS: 1.1 Announcements
8 pages
Graph Traversal Bfs HTML
No ratings yet
Graph Traversal Bfs HTML
6 pages
ATLAS Transformation Language: Rubby Casallas Grupo de Construcción de Software Uniandes
No ratings yet
ATLAS Transformation Language: Rubby Casallas Grupo de Construcción de Software Uniandes
18 pages
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 6
No ratings yet
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 6
2 pages
Assignments1 2
No ratings yet
Assignments1 2
6 pages
Distributed Databases: Solutions To Practice Exercises
No ratings yet
Distributed Databases: Solutions To Practice Exercises
4 pages
Ceg 2136 F 2018 Midterm Review
No ratings yet
Ceg 2136 F 2018 Midterm Review
13 pages
CSI 3131 Assignment 1 Winter 2018: Process Creation and Inter-Process Communication
No ratings yet
CSI 3131 Assignment 1 Winter 2018: Process Creation and Inter-Process Communication
3 pages
High Performance and Scalable GPU Graph Traversal
No ratings yet
High Performance and Scalable GPU Graph Traversal
15 pages
2223 CSC14003 21CLC HW01 Solution
No ratings yet
2223 CSC14003 21CLC HW01 Solution
5 pages
University of Tunis Fall 2013 Tunis Business School Decision & Game Theory Tutorial 3
No ratings yet
University of Tunis Fall 2013 Tunis Business School Decision & Game Theory Tutorial 3
4 pages
Java Exercises Beginning
No ratings yet
Java Exercises Beginning
1 page
Tut08 CSI3140 Ajax PDF
No ratings yet
Tut08 CSI3140 Ajax PDF
3 pages
CSI3105 2018 Midterm
No ratings yet
CSI3105 2018 Midterm
6 pages
Lab07 CSI3140 Canvas PDF
No ratings yet
Lab07 CSI3140 Canvas PDF
2 pages
Sisd, Simd, Misd, Mimd
No ratings yet
Sisd, Simd, Misd, Mimd
2 pages
Unit-4 Dsa
No ratings yet
Unit-4 Dsa
7 pages
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 7
No ratings yet
CSI 3104 Introduction To Formal Languages Winter 2020 Assignment 7
1 page
Single Link Example
No ratings yet
Single Link Example
8 pages
Dsa Assignment: 1. Define Binary Tree
No ratings yet
Dsa Assignment: 1. Define Binary Tree
6 pages
Square Topology For NoCs
No ratings yet
Square Topology For NoCs
4 pages

CSI 2110 Summary PDF

Uploaded by

CSI 2110 Summary PDF

Uploaded by

CSI 2110

Properties of logarithms and exponentials:

Remember the following equations:

Stacks and Queues

Depth - number of ancestors.

Properties of Binary Trees

Implementing Complete Binary Trees with ArrayList

Representing General Trees

 If is internal and v is its first child then is the left child of in

Bottom-up Heap Construction (WILL be a question on midterm and/or final exam!)

Heaps are implemented the same way as binary trees.

Heap Sort - Cost is

Heap Sort In Place

Maps and Dictionaries

Ordered Sequence - Array-based

Binary Search Trees

 : We search for an item with key . If such an item is found, re

 : We throw an exception if the table is full. We start at cell . We probe

Double Hashing: We have where is the primary hashing function

Depth First Search

Unexplored vertex: A vertex that hasn't been visited yet.

Unexplored vertex: A vertex that hasn't been visited yet.

Minimum Spanning Tree

You might also like