0% found this document useful (0 votes)

5 views

4th_Sem_DAA_Module_4

The document discusses various string matching algorithms, including the Naive pattern searching, Rabin-Karp, and Knuth-Morris-Pratt (KMP) algorithms, detailing their methodologies and complexities. It also briefly covers problems like the N-Queen problem, Hamiltonian Circuit problem, and Subset Sum problem, explaining their approaches and solutions. Each algorithm and problem is illustrated with examples to clarify their workings and applications.

Uploaded by

Subhransu Behera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

4th_Sem_DAA_Module_4

Uploaded by

Subhransu Behera

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Module – IV :

String Matching Algorithms :

What is String matching ?
Ans: Finding all occurrences of a pattern in a given text(or body of text).

 Naive pattern searching is the simplest method among other pattern searching
algorithms. It checks for all character of the main string to the pattern.

 Naive algorithm is exact string matching(means finding one or all exact occurrences
of a pattern in a text) algorithm.
 This algorithm is helpful for smaller texts. It does not need any pre-processing phases.
We can find substring by checking once for the string. It also does not occupy extra
space to perform the operation.

 The naive approach tests all the possible placement of Pattern P [1…….m] relative to
text T [1……n]. We try shift s = 0, 1…….n-m, successively and for each shift s.
Compare T [s+1…….s+m] to P [1……m].It returns all the valid shifts found.

NAIVE-STRING-MATCHER (T, P)
1. n ← length [T]
2. m ← length [P]
3. for s ← 0 to n -m
4. do if P [1.....m] = T [s + 1....s + m]
5. then print "Pattern occurs with shift"

Analysis: This for loop from 3 to 5 executes for n-m + 1(we need at least m characters at the
end) times and in iteration we are doing m comparisons. So the total complexity is O (n-
m+1).

 The test on line 4 determines whether the current shift is valid or not;this test involves
an implicit loop to check corresponding character positions until all positions.

 Line 5 prints out each valid shift s.

Working of Naive String Matching

The naive-string-matching procedure can be interpreted graphically as sliding a “template”

containing the pattern over the text, noting for which shifts all of the characters on the
template equal the corresponding characters in the text.

Example 1:

[1]
Example 2:

Input: txt[] = "THIS IS STRING MATCHING ALGORITHM"

pat[] = "STRING"
Output: Pattern found at position 10

Example 3:
Input:
Main String: “ABAAABCDBBABCDDEBCABC”
pattern: “ABC”
Output:
Pattern found at position: 4
Pattern found at position: 10
Pattern found at position: 18

What is the best case?

→The best case occurs when the first character of the pattern is not present in text at all.

txt[] = "BBACCAADDEE";

pat[] = "HBB";

The number of comparisons in best case is O(n).

What is the worst case ?

→The worst case of Naive Pattern Searching occurs in following scenarios.
1) When all characters of the text and pattern are same.

txt[] = "DDDDDDDDDDDD";

pat[] = "DDDDD";

2) Worst case also occurs when only the last character is different.

txt[] = "VVVVVVVVVVVVK";

pat[] = "VVVK";
The number of comparisons in the worst case is O(m*(n-m+1)).

Problem with Naive Algorithm

Suppose T=cabababcd and P=ababc

[2]
 Whenever a character mismatch occurs after matching of several characters, the
comparison begins by going back in from the character which follows the last.

Rabin Karp Algorithm :

The Rabin-Karp algorithm is a pattern-matching algorithm that uses hashing to
compare patterns and text. Here, the term Hashing refers to the process of mapping a
larger input value to a smaller output value, called the hash value. This process will help
in avoiding unnecessary comparison which optimizes the complexity of this algorithm.
Therefore, the Rabin-Karp algorithm has a time complexity of O(n + m), where n is the
length of the text and m is the length of the pattern.

 How does Rabin Karp Algorithm work?

 The Rabin-Karp algorithm checks the given pattern within a text by moving
window one by one, but without checking all characters for all cases, it finds the hash
value. Then, compare it with the hash values of all the substrings of the text that have
the same length as the pattern.
 If the hash values match, then there is a possibility that the pattern and the substring
are equal, and we can verify it by comparing them character by character. If the hash
values do not match, then we can skip the substring and move on to the next one. In
the next section, we will understand how to calculate hash values.
 Calculating hash value in Rabin Karp Algorithm
The steps to calculate hash values are as follows –

 Step 1: Assign modulus and a base value

 Suppose we have a text Txt = "DAACABCDBA" and a pattern Ptrn = "CAB". We
will first assign numerical values to the characters of text based on their ranking. The
leftmost character will have rank 1 and the rightmost ranks 10. Also, use base b =
10 (number of characters in the text) and modulus m = 11 for our hash function. It
should be noted that the modulus m needs to be a prime number as it will help in
avoiding overflow issues.

Step 2: Calculate hash value of Pattern

The equation to calculate the hash value of the pattern is as follows −

hash value(Ptrn) = (r * bl-i-1) mod 11

where, r: ranking of character

l: length of Pattern

[3]
i: index of character within the pattern

Therefore, the hash value of Patrn is −

h(Ptrn) = ((4 * 102) + (5 * 101) + (6 * 100)) mod 11

= 456 mod 11

Step 3: Calculate hash value of first Text window

Start calculating the hash value for all characters in the text by sliding over them. We
will start with the first substring as shown below −

h(DAA) = ((1 * 102) + (2 * 101) + (3 * 100)) mod 11

= 123 mod 11
=6

Now, compare the hash value of pattern and the substring. If they match, check
whether characters are matching or not. If they do, we found our match otherwise,
move to the next characters.
In the above example, hash value did not matched. Hence, we move to the next
character.

Step 4: Updating the hash value

Now, we need to remove the previous character and move to the next character. In this
process, the hash value should also be updated till we find the match.

Knuth Morris Pratt String Matching Algorithm :

The KMP algorithm is used to solve the pattern matching problem which is a task of
finding all the occurrences of a given pattern in a text. It is very useful when it comes
to finding multiple patterns. For instance, if the text is "aabbaaccaabbaadde" and the
pattern is "aabaa", then the pattern occurs twice in the text, at indices 0 and 8.

The naive solution to this problem is to compare the pattern with every possible
substring of the text, starting from the leftmost position and moving rightwards. This
takes O(n*m) time, where 'n' is the length of the text and 'm' is the length of the
pattern.

When we work with long text documents, the brute force and naive approaches may
result in redundant comparisons. To avoid such redundancy, Knuth, Morris, and Pratt
developed a linear sequence-matching algorithm named the KMP pattern matching
algorithm. It is also referred to as Knuth Morris Pratt pattern matching algorithm.

[4]
How does KMP Algorithm work?
The KMP algorithm starts the search operation from left to right. It uses the prefix
function to avoid unnecessary comparisons while searching for the pattern. This
function stores the number of characters matched so far which is known as LPS
value. The following steps are involved in KMP algorithm −

 Define a prefix function.

 Slide the pattern over the text for comparison.

 If all the characters match, we have found a match.

 If not, use the prefix function to skip the unnecessary comparisons. If the LPS value
of previous character from the mismatched character is '0', then start comparison from
index 0 of pattern with the next character in the text. However, if the LPS value is
more than '0', start the comparison from index value equal to LPS value of the
previously mismatched character.

The KMP algorithm takes O(n + m) time and O(m) space. It is faster than the naive
solution because it skips the redundant comparisons, and only compares each
character of the text at most once.

Let's understand the input-output scenario of a pattern matching problem with an

example −

Input:

main String: "AAAABCAAAABCBAAAABC"

pattern: "AAABC"

Output:

Pattern found at position: 1

Pattern found at position: 7

Pattern found at position: 14

[5]
What is N Queen Problem?
In N-Queen problem, we are given an NxN chessboard and we have to
place N number of queens on the board in such a way that no two queens attack each
other. A queen will attack another queen if it is placed in horizontal, vertical or
diagonal points in its way. The most popular approach for solving the N Queen puzzle
is Backtracking.

Input Output Scenario

Suppose the given chessboard is of size 4x4 and we have to arrange exactly 4 queens
in it. The solution arrangement is shown in the figure below −

The final solution matrix will be −

0 0 1 0
1 0 0 0

0 0 0 1

0 1 0 0

Backtracking Approach to solve N Queens Problem

In the naive method to solve n queen problem, the algorithm generates all possible
solutions. Then, it explores all of the solutions one by one. If a generated solution
satisfies the constraint of the problem, it prints that solution.

Follow the below steps to solve n queen problem using the backtracking approach −
 Place the first queen in the top-left cell of the chessboard.

 After placing a queen in the first cell, mark the position as a part of the solution and
then recursively check if this will lead to a solution.

 Now, if placing the queen doesnt lead to a solution. Then go to the first step and place
queens in other cells. Repeat until all cells are tried.
 If placing queen returns a lead to solution return TRUE.

 If all queens are placed return TRUE.

 If all rows are tried and no solution is found, return FALSE.

[6]
Hamiltonian Circuit Problem :
 A Hamiltonian cycle is a cycle that contains all vertices in a graph . If a graph has

a Hamiltonian cycle, then the graph is said to be Hamiltonian.

 A Hamiltonian cycle, also called a Hamiltonian circuit, Hamilton cycle, or Hamilton

circuit, is a graph cycle (i.e., closed loop) through a graph that visits each node exactly

once . A graph possessing a Hamiltonian cycle is said to be a Hamiltonian graph.

 The Hamiltonian cycle problem is a special case of the travelling salesman problem,

obtained by setting the distance between two cities to one if they are adjacent and two

otherwise, and verifying that the total distance travelled is equal to n (if so, the route is

a Hamiltonian circuit; if there is no Hamiltonian circuit then the shortest route will be

longer).

Example:-

 Solution: Firstly, we start our search with vertex 'a.' this vertex 'a' becomes the root of

our implicit tree.

 Next, we choose vertex 'b' adjacent to 'a' as it comes first in lexicographical order (b, c,

d).

 Next, we select 'c' adjacent to 'b.'

 Next, we select 'd' adjacent to 'c.'

[7]
 Next, we select 'e' adjacent to 'd.'

 Next, we select vertex 'f' adjacent to 'e.' The vertex adjacent to 'f' is d and e, but they

have already visited. Thus, we get the dead end, and we backtrack one step and remove

the vertex 'f' from partial solution.

 From backtracking, the vertex adjacent to 'e' is b, c, d, and f from which vertex 'f' has

already been checked, and b, c, d have already visited. So, again we backtrack one

step. Now, the vertex adjacent to d are e, f from which e has already been checked, and

adjacent of 'f' are d and e. If 'e' vertex, revisited them we get a dead state. So again we

backtrack one step.

 Now, adjacent to c is 'e' and adjacent to 'e' is 'f' and adjacent to 'f' is 'd' and adjacent to

'd' is 'a.' Here, we get the Hamiltonian Cycle as all the vertex other than the start vertex

'a' is visited only once. (a - b - c - e - f -d - a).

[8]
Subset Sum Problem :
Subset sum problem is to find subset of elements that are selected from a given set
whose sum adds up to a given number K. We are considering the set contains non-
negative values.

In computer science, the subset sum problem is an important decision

problem in complexity theory and cryptography. There are several equivalent
formulations of the problem.

The Subset-Sum Problem is to find a subset's' of the given set S = (S1 S2 S3...Sn)
where the elements of the set S are n positive integers in such a manner that s'∈S and
sum of the elements of subset's' is equal to some positive integer 'X.'

The Subset-Sum Problem can be solved by using the backtracking approach. In this
implicit tree is a binary tree. The root of the tree is selected in such a way that
represents that no decision is yet taken on any input. We assume that the elements of
the given set are arranged in increasing order:

S1 ≤ S2 ≤ S3... ≤ Sn

The left child of the root node indicated that we have to include 'S1' from the set 'S'
and the right child of the root indicates that we have to execute 'S1'. Each node stores
the total of the partial solution elements. If at any stage the sum equals to 'X' then the
search is successful and terminates.
The dead end in the tree appears only when either of the two inequalities exists:

The sum of s' is too large i.e.

s'+ Si + 1 > X

The sum of s' is too small i.e.

Example: Given a set S = (3, 4, 5, 6) and X =9. Obtain the subset sum using
Backtracking approach.

Solution:

Initially S = (3, 4, 5, 6) and X =9.

S'= (∅)

The implicit binary tree for the subset sum problem is shown as fig:

[9]
The number inside a node is the sum of the partial solution elements at a particular
level.

Thus, if our partial solution elements sum is equal to the positive integer 'X' then at
that time search will terminate, or it continues if all the possible solution needs to be
obtained.

[10]

String Matching
No ratings yet
String Matching
35 pages
Unit II
No ratings yet
Unit II
94 pages
Module-5-28march
No ratings yet
Module-5-28march
10 pages
CH-8
No ratings yet
CH-8
26 pages
String Matching
No ratings yet
String Matching
63 pages
String Matching
No ratings yet
String Matching
30 pages
DAA Unit 5 Part 1
No ratings yet
DAA Unit 5 Part 1
27 pages
4string Matching Kmprabin Karp and Naive
No ratings yet
4string Matching Kmprabin Karp and Naive
57 pages
String Matching
No ratings yet
String Matching
34 pages
54.string Inotes
No ratings yet
54.string Inotes
20 pages
Ada Notes Unit 4
No ratings yet
Ada Notes Unit 4
28 pages
String Matching
No ratings yet
String Matching
4 pages
DAA_unit_5
No ratings yet
DAA_unit_5
22 pages
8 and 9 exp
No ratings yet
8 and 9 exp
13 pages
Lecture 34, 35 36 - String Matching Algorithms
No ratings yet
Lecture 34, 35 36 - String Matching Algorithms
42 pages
Adobe Scan Nov 24, 2023
No ratings yet
Adobe Scan Nov 24, 2023
5 pages
Abstract
No ratings yet
Abstract
12 pages
Lecture 56string Matching
No ratings yet
Lecture 56string Matching
43 pages
Lecture#8 - String Matching Algorithm
No ratings yet
Lecture#8 - String Matching Algorithm
38 pages
patternmatching
No ratings yet
patternmatching
29 pages
DAA Assignment (Module4)
No ratings yet
DAA Assignment (Module4)
10 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
25 pages
A Two Way Pattern Matching Algorithm Using Sliding Patterns
No ratings yet
A Two Way Pattern Matching Algorithm Using Sliding Patterns
5 pages
Unit8 ADA SPPDF 2022 11 11 17 17 37pdf 2023 12 06 16 57 08
No ratings yet
Unit8 ADA SPPDF 2022 11 11 17 17 37pdf 2023 12 06 16 57 08
18 pages
String Matching
100% (1)
String Matching
27 pages
Module 6 AOA
No ratings yet
Module 6 AOA
19 pages
Strings
No ratings yet
Strings
23 pages
Advanced String Lecture
No ratings yet
Advanced String Lecture
50 pages
Lecture 18 - String Matching-KMP
No ratings yet
Lecture 18 - String Matching-KMP
40 pages
String Matching Chapter 12 Goodrich Nep
No ratings yet
String Matching Chapter 12 Goodrich Nep
43 pages
Trings and Attern Atching: - Brute Force, Rabin-Karp, Knuth-Morris-Pratt
No ratings yet
Trings and Attern Atching: - Brute Force, Rabin-Karp, Knuth-Morris-Pratt
49 pages
Rabin-Karp String Matching Algorithm
No ratings yet
Rabin-Karp String Matching Algorithm
11 pages
UNIT-5 DAA Complete Notes
No ratings yet
UNIT-5 DAA Complete Notes
52 pages
Trings and Attern Atching: - Brute Force, Rabin-Karp, Knuth-Morris-Pratt - Regular Expressions
No ratings yet
Trings and Attern Atching: - Brute Force, Rabin-Karp, Knuth-Morris-Pratt - Regular Expressions
21 pages
pattern matching
No ratings yet
pattern matching
33 pages
Module III Problem Solving
No ratings yet
Module III Problem Solving
16 pages
ADS UNIT5
No ratings yet
ADS UNIT5
26 pages
11 Data Structures and Algorithms - Narasimha Karumanchi
No ratings yet
11 Data Structures and Algorithms - Narasimha Karumanchi
12 pages
Unit 3-Pattern Matching.pptx
No ratings yet
Unit 3-Pattern Matching.pptx
43 pages
KMP algorithm
No ratings yet
KMP algorithm
19 pages
AOA1
No ratings yet
AOA1
38 pages
Unit 5 String Matching 2010
No ratings yet
Unit 5 String Matching 2010
5 pages
M3-string_matching
No ratings yet
M3-string_matching
74 pages
Algorithms in Bioinformatics
No ratings yet
Algorithms in Bioinformatics
7 pages
KMP 2
No ratings yet
KMP 2
7 pages
Rabin Karp Algorithm of Pattern Matching (Goutam Padhy)
No ratings yet
Rabin Karp Algorithm of Pattern Matching (Goutam Padhy)
15 pages
String Matching Algorithms
No ratings yet
String Matching Algorithms
46 pages
Pattern Matching 2
No ratings yet
Pattern Matching 2
46 pages
A357460420 - 22393 - 2 - 2018 - String Matching
No ratings yet
A357460420 - 22393 - 2 - 2018 - String Matching
27 pages
String Matching 2019
No ratings yet
String Matching 2019
50 pages
String Matching - RYS - Lect - 1 - 2 - 3 - Update
No ratings yet
String Matching - RYS - Lect - 1 - 2 - 3 - Update
61 pages
DS V Unit Notes
No ratings yet
DS V Unit Notes
33 pages
5CS4-AOA-Unit-3 @zammers
No ratings yet
5CS4-AOA-Unit-3 @zammers
7 pages
Unit-5
No ratings yet
Unit-5
52 pages
SOU Lecture Handout ADA Unit-8
No ratings yet
SOU Lecture Handout ADA Unit-8
17 pages
M269_lec8 Fall 1819
No ratings yet
M269_lec8 Fall 1819
24 pages
Strings and Pattern Matching
No ratings yet
Strings and Pattern Matching
17 pages
Basic Exercises for Competitive Programming: Python
From Everand
Basic Exercises for Competitive Programming: Python
Jan Pol
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Ifm Efector AC1144 Addressing/diagnostic Handheld For AS-I Systems
No ratings yet
Ifm Efector AC1144 Addressing/diagnostic Handheld For AS-I Systems
3 pages
MCS 224 2
No ratings yet
MCS 224 2
5 pages
CPSD DA 1(Literature review of 3 papers) Deadline 22_8_24 (Responses)
No ratings yet
CPSD DA 1(Literature review of 3 papers) Deadline 22_8_24 (Responses)
33 pages
Tds Alpha 30 (En) Rev6
No ratings yet
Tds Alpha 30 (En) Rev6
10 pages
Modul 6: Analog Input Photoresistor
No ratings yet
Modul 6: Analog Input Photoresistor
11 pages
WORKSHEET 2: Limits and Continuity: 3 2 2 X !2 3 2 X !0 2 3 4 X !1 X ! 1 2 2
No ratings yet
WORKSHEET 2: Limits and Continuity: 3 2 2 X !2 3 2 X !0 2 3 4 X !1 X ! 1 2 2
2 pages
Digital Logic - Karnaugh Maps
No ratings yet
Digital Logic - Karnaugh Maps
10 pages
Micro Controller Assignment 2 (All 24 From Shiva Prasad)
No ratings yet
Micro Controller Assignment 2 (All 24 From Shiva Prasad)
88 pages
Pre-Requisite To OnBoard Windows Server 2012 & 2016 To MDE
No ratings yet
Pre-Requisite To OnBoard Windows Server 2012 & 2016 To MDE
3 pages
Garmin GPS 5212 QuickReferenceGuide
No ratings yet
Garmin GPS 5212 QuickReferenceGuide
2 pages
Day Slot Title Class STN
No ratings yet
Day Slot Title Class STN
56 pages
CH 09
No ratings yet
CH 09
48 pages
SQP Economics
No ratings yet
SQP Economics
54 pages
Deep Learning Literature Review
100% (1)
Deep Learning Literature Review
8 pages
AJANCAMV7 User Guide
No ratings yet
AJANCAMV7 User Guide
15 pages
Whitepaper_Carbonio_Digital_Workplace
No ratings yet
Whitepaper_Carbonio_Digital_Workplace
21 pages
Dominiquepishotti Resume
No ratings yet
Dominiquepishotti Resume
1 page
454816905-Project-Wifi-Jammer-pptx
No ratings yet
454816905-Project-Wifi-Jammer-pptx
13 pages
Oopr Finals Reviewer
No ratings yet
Oopr Finals Reviewer
14 pages
Mass Effect 2 PC Manual
100% (1)
Mass Effect 2 PC Manual
15 pages
AP Computer Science A Cram Chart 2021
No ratings yet
AP Computer Science A Cram Chart 2021
1 page
Datasheet BC184C NPN Transistor
No ratings yet
Datasheet BC184C NPN Transistor
6 pages
Charge Density Difference: Dr. Renqin Zhang
No ratings yet
Charge Density Difference: Dr. Renqin Zhang
23 pages
Denah Pemasangan Wallpaper Lantai 7: Grand Classic Hotel
No ratings yet
Denah Pemasangan Wallpaper Lantai 7: Grand Classic Hotel
1 page
NILAI PTS BAHASA INGGRIS KELAS X (Jawaban)
No ratings yet
NILAI PTS BAHASA INGGRIS KELAS X (Jawaban)
64 pages
A 10 Config
No ratings yet
A 10 Config
7 pages
Necap
No ratings yet
Necap
310 pages
Assignment Cover SPACC
No ratings yet
Assignment Cover SPACC
1 page
Answers2ed-All Excercise Problems
No ratings yet
Answers2ed-All Excercise Problems
343 pages
VThunder VMw-Install Guide
0% (1)
VThunder VMw-Install Guide
32 pages

4th_Sem_DAA_Module_4

Uploaded by

4th_Sem_DAA_Module_4

Uploaded by

Module – IV :

String Matching Algorithms :

 Line 5 prints out each valid shift s.

Working of Naive String Matching

The naive-string-matching procedure can be interpreted graphically as sliding a “template”

Input: txt[] = "THIS IS STRING MATCHING ALGORITHM"

What is the best case?

The number of comparisons in best case is O(n).

What is the worst case ?

Problem with Naive Algorithm

Suppose T=cabababcd and P=ababc

Rabin Karp Algorithm :

 How does Rabin Karp Algorithm work?

 Step 1: Assign modulus and a base value

Step 2: Calculate hash value of Pattern

hash value(Ptrn) = (r * bl-i-1) mod 11

where, r: ranking of character

Therefore, the hash value of Patrn is −

h(Ptrn) = ((4 * 102) + (5 * 101) + (6 * 100)) mod 11

Step 3: Calculate hash value of first Text window

h(DAA) = ((1 * 102) + (2 * 101) + (3 * 100)) mod 11

Step 4: Updating the hash value

Knuth Morris Pratt String Matching Algorithm :

 Define a prefix function.

 Slide the pattern over the text for comparison.

Let's understand the input-output scenario of a pattern matching problem with an

main String: "AAAABCAAAABCBAAAABC"

Pattern found at position: 1

Pattern found at position: 7

Pattern found at position: 14

Input Output Scenario

The final solution matrix will be −

Backtracking Approach to solve N Queens Problem

 If all queens are placed return TRUE.

 If all rows are tried and no solution is found, return FALSE.

a Hamiltonian cycle, then the graph is said to be Hamiltonian.

 A Hamiltonian cycle, also called a Hamiltonian circuit, Hamilton cycle, or Hamilton

once . A graph possessing a Hamiltonian cycle is said to be a Hamiltonian graph.

our implicit tree.

 Next, we select 'c' adjacent to 'b.'

 Next, we select 'd' adjacent to 'c.'

the vertex 'f' from partial solution.

backtrack one step.

'a' is visited only once. (a - b - c - e - f -d - a).

In computer science, the subset sum problem is an important decision

The sum of s' is too large i.e.

The sum of s' is too small i.e.

Initially S = (3, 4, 5, 6) and X =9.

You might also like