unit5_trie

Uploaded by

ganeshpriyanmohan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

unit5_trie

Uploaded by

ganeshpriyanmohan

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Unit 5

 Balanced Tree
 AVL Tree
 Red Black Tree
 Multi way search Tree
 B- Tree
 Binary Trie
 Multi-way Trie
 Suffix Tree
Trie data structure
What is a Trie data structure?

 The word "Trie" is an excerpt from the word "retrieval".

 Trie is a sorted tree-based data-structure that stores the set of strings.
 It has the number of pointers equal to the number of characters of the
alphabet in each node.
 It can search a word in the dictionary with the help of the word's prefix.
 For example, if we assume that all strings are formed from the letters 'a' to
'z' in the English alphabet, each trie node can have a maximum
of 26 points.

 Trie is also known as the digital tree or prefix tree. The position of a node in
the Trie determines the key with which that node is connected.
Properties of the Trie for a set of the
string:
 The root node of the trie always represents the null node.
 Each child of nodes is sorted alphabetically.
 Each node can have a maximum of 26 children (A to Z).
 Each node (except the root) can store one letter of the alphabet.
Basic operations of Trie
 There are three operations in the Trie:
 Insertion of a node
 Searching a node
 Deletion of a node

Insert of a node in the Trie

 The first operation is to insert a new node into the trie.
 Every letter of the input key (word) is inserted as an individual in the Trie_node.
 Note that children point to the next level of Trie nodes.

 The key character array acts as an index of children.

 If the present node already has a reference to the present letter, set the present
node to that referenced node. Otherwise, create a new node, set the letter to be
equal to the present letter, and even start the present node with this new node.
 The character length determines the depth of the trie.
Basic operations of Trie

 Searching a node in Trie

 The second operation is to search for a node in a Trie. The searching operation is
similar to the insertion operation. The search operation is used to search a key in
the trie.

 Deletion of a node in the Trie

 The Third operation is the deletion of a node in the Trie. Before we begin the
implementation, it is important to understand some points:
 If the key is not found in the trie, the delete operation will stop and exit it.
 If the key is found in the trie, delete it from the trie.
Applications of Trie
1. Spell Checker
 Spell checking is a three-step process. First, look for that word in a dictionary, generate
possible suggestions, and then sort the suggestion words with the desired word at the
top.
 Trie is used to store the word in dictionaries. The spell checker can easily be applied in
the most efficient way by searching for words on a data structure. Using trie not only
makes it easy to see the word in the dictionary, but it is also simple to build an algorithm
to include a collection of relevant words or suggestions.
2. Auto-complete
 Auto-complete functionality is widely used on text editors, mobile applications, and the
Internet. It provides a simple way to find an alternative word to complete the word for the
following reasons.
 It provides an alphabetical filter of entries by the key of the node.
 We trace pointers only to get the node that represents the string entered by the user.
 As soon as you start typing, it tries to complete your input.

3. Browser history
 It is also used to complete the URL in the browser. The browser keeps a history of the
URLs of the websites you've visited.
Trie

Advantages of Trie
 It can be insert faster and search the string than hash tables and binary
search trees.
 It provides an alphabetical filter of entries by the key of the node.

Disadvantages of Trie
 It requires more memory to store the strings.
 It is slower than the hash table.
Multiway tries
 A binary trie uses radix search with radix 2; a multiway trie uses radix search with
radix R > 2
 multiway tries are sometimes called R-ary tries

 If each digit in a key has r bits, the radix is R = 2 r , and if keys have at most B bits,
the worst-case number of comparisons would be only B/r
 However, to implement this idea, a node in the trie must be able to have as many
as R children
 Examples:
 Keys are words made up of lower-case letters in English. There are 26 different lower-
case letters in English, so a R-ary trie with R=26 could hold these keys. (This specific
variant is sometimes called an “alphabet trie”)
 Keys are decimal integers made up of decimal digits. There are 10 different decimal
digits, so a R-ary trie with R=10 could hold these keys
 Keys are 128-bit IEEE high precision floating point numbers. Consider each as made up of
32 4-bit nybbles. There are 2 4 = 16 different nybble values, so a R-ary trie with R=16
could hold these keys (note that lexicographic ordering of such keys is not the same as
their numeric ordering)
Suffix tree
 In algorithms for string processing and pattern matching, a suffix tree is a type of
data structure. It allows for quick pattern searching and other string-related
activities by compactly representing all the suffixes of a given string
 . It was first introduced by Ukkonen in 1995 and is now a key idea in bioinformatics
and computer science.
 Trie is simply an expanded version of the suffix tree.
 It is a trie that has all of a string's suffixes compressed into it.
 Suffix trees can be used to address several string-related issues.
 Pattern matching, spotting distinctive substrings within a string, and figuring out
the longest palindrome are a few of these issues.
 A suffix is a substring that consists of all the characters in the string from a
particular location to the very end.
 For instance, the suffixes for the string "banana" are "banana," "nana," "nana," "ana,"
"na," and "a." These suffixes are all stored in a tree-like data structure called a suffix tree.
An ordered tree data structure called a trie is effective at storing a dynamic set of strings.
Each edge of a suffix tree corresponds to a single character, and the pathways from the
root to the leaves make up the suffixes of the starting string.
And a compressed trie for the given set of strings
will look like:
What is a B Tree?

 The B Tree is a special type of multiway search

tree, commonly known as the M-way tree, which
balances itself. B Tree with order 3
 Because of their balanced structure, these trees are
commonly utilized to operate and manage immense
databases and simplify searches.
 In a B Tree, each node can have at most n child
nodes.
 B Tree is an example of Multilevel Indexing in a
Database Management System (DBMS). Leaf and
Internal nodes will both have record references.
 B Tree is known as Balanced Stored Tree because all
the leaf nodes are at the same level.
Rules of the B Tree
1. All the leaf nodes are at the same level.
2. The B Tree data structure is defined by the term minimum
degree 'd'. The value of 'd' depends on the size of the disk
block.
3. Every node, excluding the root, must consist of at least d-
1 keys. The root node may consist of a minimum of 1 key.
4. All nodes (including the root node) may consist of at
most (2d-1) keys.
5. The number of children of a node is equal to the addition
of the number of keys present in it and .
6. All keys of a node are sorted in ascending order. The child
between two keys, k1 and k2, consists of all the keys
ranging between k1 and k2, respectively.
7. Unlike the Binary Search Tree, the B Tree data structure
grows and shrinks from the root. Whereas the Binary
Search Tree grows downwards and shrinks downward.
B Tree of order 5
8. Similar to other Self-Balanced Binary Search Trees, the
Time complexity of the B Tree data structure for the
operations like searching, insertion, and deletion
is O(log?n).
9. The Insertion of a Node in the B Tree happens only at the
Leaf Node.
Rules of the B Tree
Every B Tree depends upon a positive constant integer known
as MINIMUM, which is utilized in order to determine the number
of data elements that can be held in a single node.
Rule 1: The root can have as few as only one data element (or
even no data elements if it is also no children); every other node
has at least MINIMUM data elements.
Rule 2: The maximum number of data elements stored in a
node is twice the value of MINIMUM.
Rule 3: The data elements of each node of the B Tree are stored
in a partially filled array, sorted from the smallest data element
(at index 0) to the largest data element (at the final utilized
position of the array).
Rule 4: The total number of subtrees below a non-leaf node is
always one more than the number of data elements in that node.
subtree 0,subtree 1,...
Rule 5: With respect to any non-leaf node:
A data element at index is greater than all the data B Tree of order 5
elements in subtree number i of the node, and
 A data element at index is less than all the data elements in
subtree number i+1 of the node.
Rule 6: Every leaf in a B Tree has the same depth. Thus, it
ensures that a B Tree prevents the problem of an unbalanced
tree.
Operations on a B Tree data structure

 In order to ensure that none of the properties of a B Tree data structure are
violated during the operations, the B Tree may be split or joined. The
following are some operations that we can perform on a B Tree:
 Searching a data element in B Tree
 Insertion of a data element in B Tree
 Deletion of a data element in B Tree
Searching Operation on a B
Tree
 Step 1: The search begins from the root node. Compare the search
element, k, with the root.
 Step 1.1: If the root node consists of the element k, the search will be complete.
 Step 1.2: If the element k is less than the first value in the root, we will move to
the leftmost child and search the child recursively.
 Step 1.3.1: If the root has only two children, we will move to the rightmost child
and recursively search the child nodes.
 Step 1.3.2: If the root has more than two keys, we will search the rest.

 Step 2: If the element k is not found after traversing the whole tree, then
the search element is not present in the B Tree.
Let us visualize the above steps with the help of an example.
Suppose that we wanted to search for a key k=34 in the following B Tree:
Let us visualize the above steps with the help of an example.
Suppose that we wanted to search for a key k=34 in the following B Tree:
Let us visualize the above steps with the help of an example.
Suppose that we wanted to search for a key k=34 in the following B Tree:

We compared the key with four different values in the above example until we found
it. Thus, the time complexity required for the search operation in a B Tree is O(log?n).

Lab 1 - Accessing and Preparing Data Steps
No ratings yet
Lab 1 - Accessing and Preparing Data Steps
28 pages
Correct Answer: 10: 1. The Default Value of "Target Scope" For Static Route Is
100% (1)
Correct Answer: 10: 1. The Default Value of "Target Scope" For Static Route Is
4 pages
Decoding PS2 Wired and Wireless Controller For Interfacing With PIC Micro Controller
100% (4)
Decoding PS2 Wired and Wireless Controller For Interfacing With PIC Micro Controller
17 pages
Week11 1
No ratings yet
Week11 1
11 pages
Data Structures (1)-73-94
No ratings yet
Data Structures (1)-73-94
22 pages
Tries and Convex Hull
No ratings yet
Tries and Convex Hull
5 pages
Data Structure
No ratings yet
Data Structure
5 pages
Hashing Refers To The Process of Generating A Fixed-Size Output From An Input of Variable Size
No ratings yet
Hashing Refers To The Process of Generating A Fixed-Size Output From An Input of Variable Size
10 pages
This Is My Technical Interview
No ratings yet
This Is My Technical Interview
13 pages
Unit 4
No ratings yet
Unit 4
25 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Trees
No ratings yet
Trees
78 pages
Unit-4 Complete Notes
No ratings yet
Unit-4 Complete Notes
30 pages
Unit 3 Storage Strategies Indices B-Trees Hashing
No ratings yet
Unit 3 Storage Strategies Indices B-Trees Hashing
12 pages
Data Structure Unit-5-Tree-Data-Structure
No ratings yet
Data Structure Unit-5-Tree-Data-Structure
17 pages
DSA UNIT 4
No ratings yet
DSA UNIT 4
9 pages
Lesson 1 Interview Question: 1.why Do We Need Pointers?
No ratings yet
Lesson 1 Interview Question: 1.why Do We Need Pointers?
19 pages
My Research Paper On Data Structures
No ratings yet
My Research Paper On Data Structures
15 pages
DSA UNIT - 3
No ratings yet
DSA UNIT - 3
23 pages
Data Structure
No ratings yet
Data Structure
6 pages
1-INTRODUCTION TO DATA STRUCTURE
No ratings yet
1-INTRODUCTION TO DATA STRUCTURE
8 pages
Implementing Bubble Sort Algorithm
No ratings yet
Implementing Bubble Sort Algorithm
6 pages
Binary Tree
No ratings yet
Binary Tree
105 pages
Group4 Binary Search Tree
No ratings yet
Group4 Binary Search Tree
5 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
Unit 5
No ratings yet
Unit 5
37 pages
External Searching: B-Trees: Dr. Jicheng Fu
No ratings yet
External Searching: B-Trees: Dr. Jicheng Fu
66 pages
Chapter 5
No ratings yet
Chapter 5
5 pages
Algorithm Design Unit 2
No ratings yet
Algorithm Design Unit 2
35 pages
Studying For A Tech Interview Sucks
No ratings yet
Studying For A Tech Interview Sucks
8 pages
Data Structure BST
No ratings yet
Data Structure BST
37 pages
Unit4
No ratings yet
Unit4
21 pages
DSA & DAA Fast Study
No ratings yet
DSA & DAA Fast Study
6 pages
Chapter 05
No ratings yet
Chapter 05
27 pages
Unit 3 Tree Structure
100% (1)
Unit 3 Tree Structure
19 pages
Introduction to Tree (2)
No ratings yet
Introduction to Tree (2)
15 pages
Binary Tree (Part 1) - Chapter 6
No ratings yet
Binary Tree (Part 1) - Chapter 6
30 pages
R23_DS_Unit V-1
No ratings yet
R23_DS_Unit V-1
10 pages
Unit-6 Storage Strategies
No ratings yet
Unit-6 Storage Strategies
43 pages
Trie
No ratings yet
Trie
6 pages
Btree DB
No ratings yet
Btree DB
22 pages
Unit 4
No ratings yet
Unit 4
5 pages
Unit-5 Ada
No ratings yet
Unit-5 Ada
7 pages
DS UNIT 4
No ratings yet
DS UNIT 4
40 pages
Unit 03 - Non Linear Data Structure
No ratings yet
Unit 03 - Non Linear Data Structure
34 pages
Introduction To Tree
No ratings yet
Introduction To Tree
15 pages
UNIT-4
No ratings yet
UNIT-4
9 pages
Unit III – Topic 1 – TREE ADT & TREE TRAVERSAL
No ratings yet
Unit III – Topic 1 – TREE ADT & TREE TRAVERSAL
39 pages
B+ Tree Rules
No ratings yet
B+ Tree Rules
9 pages
datatstructure 4
No ratings yet
datatstructure 4
54 pages
Introduction To Binary Tree
No ratings yet
Introduction To Binary Tree
6 pages
Binary Trees Complete Lesson
No ratings yet
Binary Trees Complete Lesson
29 pages
Tries.pptx
No ratings yet
Tries.pptx
33 pages
Trab 1
No ratings yet
Trab 1
22 pages
Advantages Relative To Other Search Algorithms
No ratings yet
Advantages Relative To Other Search Algorithms
7 pages
DS Trees Short Notes
No ratings yet
DS Trees Short Notes
12 pages
Practice Problems Ans
No ratings yet
Practice Problems Ans
16 pages
UNIT-3 Data Structures Questions (Odd)
No ratings yet
UNIT-3 Data Structures Questions (Odd)
15 pages
5.4. ADS_Tries_Standard Tries
No ratings yet
5.4. ADS_Tries_Standard Tries
34 pages
DS_Unit 4
No ratings yet
DS_Unit 4
26 pages
Data Structure
No ratings yet
Data Structure
172 pages
Prefix Tree and String Matching
No ratings yet
Prefix Tree and String Matching
18 pages
Data STR
No ratings yet
Data STR
10 pages
261 SIMS Strategy and Insight Development March 30 2010
No ratings yet
261 SIMS Strategy and Insight Development March 30 2010
14 pages
YourName YourStudentID BSBWOR203 Assessment 1
No ratings yet
YourName YourStudentID BSBWOR203 Assessment 1
19 pages
Service Manual Service Manual: Integrated Amplifier Model
No ratings yet
Service Manual Service Manual: Integrated Amplifier Model
10 pages
Java/J2Ee Developer Devops Certified: Shivank
No ratings yet
Java/J2Ee Developer Devops Certified: Shivank
3 pages
Iso 15031 6 2015
No ratings yet
Iso 15031 6 2015
11 pages
U4 - 4 01 Outline
No ratings yet
U4 - 4 01 Outline
11 pages
Veeam Availability Suite v10: Configuration and Management: (VAS10CM)
No ratings yet
Veeam Availability Suite v10: Configuration and Management: (VAS10CM)
3 pages
Geo Informatics and Nano Technology
No ratings yet
Geo Informatics and Nano Technology
63 pages
Computer Fundamentals Questions and Answers
No ratings yet
Computer Fundamentals Questions and Answers
51 pages
Service Manual Printer Canon S600
No ratings yet
Service Manual Printer Canon S600
87 pages
SBI - Computer Aptitude
No ratings yet
SBI - Computer Aptitude
6 pages
Audio Amplifiers
No ratings yet
Audio Amplifiers
63 pages
Gis Interview Question
No ratings yet
Gis Interview Question
4 pages
CI867K01 Datasheet
No ratings yet
CI867K01 Datasheet
3 pages
Drops
No ratings yet
Drops
2 pages
Notebook Comparison
No ratings yet
Notebook Comparison
3 pages
CUACA Troubleshooting Guide
No ratings yet
CUACA Troubleshooting Guide
38 pages
CH 14
100% (3)
CH 14
4 pages
MIT TechnologyReview-March - April 2023
100% (1)
MIT TechnologyReview-March - April 2023
92 pages
ESACCI LC Ph2 PUGv2 - 2.0
No ratings yet
ESACCI LC Ph2 PUGv2 - 2.0
105 pages
House Judiciary Committee Discussion Draft
No ratings yet
House Judiciary Committee Discussion Draft
22 pages
CI 555 Datasheet - RevA3
No ratings yet
CI 555 Datasheet - RevA3
8 pages
Volvo Wheel Loader L150e, L180e, L220e - PDF
No ratings yet
Volvo Wheel Loader L150e, L180e, L220e - PDF
1 page
KKW Unit 3 Interactive SQL and Advanced SQL (Part A)
No ratings yet
KKW Unit 3 Interactive SQL and Advanced SQL (Part A)
12 pages
Chapter 6 - Programming Counters
No ratings yet
Chapter 6 - Programming Counters
23 pages
Advertising Objectives and Branding
100% (1)
Advertising Objectives and Branding
192 pages
Barriers To Appropriate Technology Growt
No ratings yet
Barriers To Appropriate Technology Growt
12 pages

unit5_trie

Uploaded by

unit5_trie

Uploaded by

Unit 5

 The word "Trie" is an excerpt from the word "retrieval".

Insert of a node in the Trie

 The key character array acts as an index of children.

 Searching a node in Trie

 Deletion of a node in the Trie

 The B Tree is a special type of multiway search

You might also like