0% found this document useful (0 votes)

6 views

DSA G5 Hashing Handouts

The document provides an overview of hashing, hash tables, and their key components, including hash functions and collision resolution techniques. It explains how hash tables operate, the importance of load factors, and various methods for handling collisions such as separate chaining and open addressing. Additionally, it outlines common applications of hashing in areas like database indexing and password verification.

Uploaded by

Rivicca Castillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

DSA G5 Hashing Handouts

Uploaded by

Rivicca Castillo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Data Structures and Algorithms

Group 5
Hashing
Definition
- refers to the process of transforming a given key to another value. It involves mapping data
to a specific index in a hash table using a hash function that enables fast retrieval of
information based on its key. The transformation of a key to the corresponding value is done
using a Hash Function and the value obtained from the hash function is called Hash Code..

Hash Tables

It is defined as a data structure used to insert, look up, and remove key-value pairs quickly. It operates
on the hashing concept, where each key is translated by a hash function into a distinct index in an
array.
It stores key-value pairs and uses a hash function to map each key to a specific location, or "bucket,"
in memory. Hash tables are widely used in programming for their efficiency in performing quick
insertions, deletions, and lookups.
The index functions as a storage location for the matching value. In simple words, it maps the keys
with the value.

What is Load factor?

A hash table’s load factor is determined by how many elements are kept there in relation to how big
the table is. The table may be cluttered and have longer search times and collisions if the load factor is
high.
An ideal load factor can be maintained with the use of a good hash function and proper table
resizing.

Key Components of a Hash Table

Keys - a unique identifier for values. Keys are passed through a hash function to determine where to
store or retrieve the corresponding value.
Values - it is the data associated with each key, which can be of any data type.
Hash Function - it is an algorithm that takes a key as input and produces an index in the table where
the value will be stored. A good hash function distributes keys uniformly to minimize collisions.
Buckets - are the individual slots in the hash table where key-value pairs are stored. Each bucket is
indexed by the hash function's output.

How a Hash Table Works

When a key-value pair is added, the key is hashed using a hash function, producing an index where
the value is stored.
For lookups, the hash table hashes the key to find the index where the value is stored.
If the hash function distributes keys well, the hash table can achieve average 0(1) (constant-time)
complexity for insertion, deletion, and retrieval operations.

Hash Functions

Hash functions are a fundamental concept in computer science and play a crucial role in various
applications such as data storage, retrieval, and cryptography. It is primarily used in hash tables,
which are essential for efficient data management.
It is a function that takes an input (or ‘message’) and returns a fixed-size string of bytes. The output,
typically a number, is called the hash code or hash value.
The main purpose of a hash function is to efficiently map data of arbitrary size to fixed-size values,
which are often used as indexes in hash tables.

Common Types of Hash Functions

Division Method
It is one of the simplest hashing techniques.
A key or input is divided by a certain divisor (often a prime number), and the remainder is used as
the hash value. This method is easy to implement but may lead to clustering, where multiple keys
map to the same hash value.
Formula: Hash Value = key mod divisor
Multiplication Method
It involves multiplying the key by a constant (usually a fraction between 0 and 1) and then
extracting a portion of the resulting number to use as the hash value.
This method reduces clustering by evenly distributing values across the hash table.
⌊
Formula: Hash Value= m×(k×Amod1) ⌋
where:
m is the size of the hash table,
k is the key,
A is a constant between 0 and 1 (often chosen as a fraction related to the golden ratio for
best results).
Mid-Square Method
In the Mid-Square Method, the key is squared, and the middle portion of the resulting number is
taken as the hash value. This method often reduces collisions by creating a more diverse range of
hash values.
Steps:
i. Square the key,
ii. Extract the middle digits of the squared result as the hash value.
iii. For example: If the key is 123, squaring it gives 123 (squared) = 15129. If we take the middle
three digits (512), then 512 is the hash value.
Folding Method
The Folding Method splits the key into equal parts (often using the same number of digits) and
then adds those parts together to obtain the hash value. This method is useful when keys are large
numbers, like identification numbers.
Steps:
i. Split the key into several parts.
ii. Add those parts together to get the hash value.
iii. For example: If the key is 123456, split it into 123 and 456. Summing these parts gives a hash
value of 123 + 456 = 579.
Cryptographic Hash Functions
It is designed to be secure and are used in cryptography. They are often used for tasks like digital
signatures, password storage, and data integrity checks. Examples include MD5, SHA-1, and
SHA-256.
Collision Resolution Techniques

What is Collision?

Since a hash function gets us a small number for a key which is a big integer or string, there is a
possibility that two keys result in the same value. The situation where a newly inserted key maps
to an already occupied slot in the hash table is called collision and must be handled using some
collision handling technique.

How to handle Collisions?

There are mainly two methods to handle collision:

Separate Chaining
Open Addressing

SEPARATE CHAINING

The idea behind separate chaining is to implement the array as a linked list called a chain.
The linked list data structure is used to implement this technique. So what happens is, when multiple
elements are hashed into the same slot index, then these elements are inserted into a singly-linked list
which is known as a chain.

Performance of Chaining

Performance of hashing can be evaluated under the assumption that each key is equally likely to be hashed
to any slot of the table (simple uniform hashing).
m = Number of slots in hash table
n = Number of keys to be inserted in hash table

Load factor α = n/m

Expected time to search = O(1 + α)
Expected time to delete = O(1 + α)

Time to insert = O(1)

Time complexity of search insert and delete is O(1) if α is O(1)

OPEN ADDRESING

-is a collision handling technique used in hash tables. Instead of using linked lists (as in separate
chaining), Open Addressing keeps all elements within the hash table itself, ensuring that each slot holds
only one key-value pair. When a collision occurs, it searches for an alternative empty slot within the table
to store the new key.

DIFFERENT WAYS OF OPEN ADDRESSING

Linear Probing: Searches sequentially from the point of collision until an empty slot is found.
Quadratic Probing: Uses a quadratic function to space out the search intervals, reducing clustering.
Double Hashing: Uses a second hash function to calculate probe intervals, making probing sequences
unique for each key.

Performance of Open Addressing

Load Factor a = n/m (where n is the number of keys and m is the number of slots):

α should be less than 1 for effective open addressing.

Search, insert, and delete operations take expected time ≈ 1/1-a.

Aspect Separate Chaining Open Addressing

Implementation Simpler to implement More computation required

Complexity

Table Capacity Doesn’t fill up (can always Table can become full
add more elements to a
chain)

Sensitivity to Hash Less sensitive to hash Sensitive; requires careful

Function function/load factors tuning of load factor

Usage Scenario Suitable for unknown key Better for known key count
insertion/deletion frequency and frequency

Cache Performance Poorer (due to linked list Better (data stays within
traversal) the table)

Memory Efficiency Uses extra space for links No extra links, but requires
sufficient slots

Space Utilization Some slots might be unused Each slot can be used, even
if not directly mapped

Open Addressing has better cache efficiency and uses probing techniques (linear, quadratic, double
hashing) to avoid clustering, but is limited by table capacity and is sensitive to load factor.

Application of Hashing
Database Indexing: Quickly retrieves data using key-value pairs.
Caches: Stores frequently accessed data for fast retrieval.
Dictionaries in Programming Languages: Implements associative arrays for key-value data storage.
Password Verification: Stores hashed passwords for secure authentication.
Memory Management: Tracks memory allocation and deallocation for efficient usage.
References:

GeeksforGeeks. (2024a, August 7). Hashing in data structure. https://www.geeksforgeeks.org/hashing-

data-structure/
GeeksforGeeks. (2024b, September 15). Hash table data structure.
https://www.geeksforgeeks.org/hash-table-data-structure/
JavaPoint. (n.d.). Hashing in data structure - javatpoint. www.javatpoint.com.
https://www.javatpoint.com/hashing-in-data-structure
Vats, R. (2024, July 10). Hashing in data structure: Function, techniques [with examples]. upGrad
blog. https://www.upgrad.com/blog/hashing-in-data-structure/
Yadav, P. (2022, April 1). Open addressing. Scaler Topics. https://www.scaler.com/topics/data-
structures/open-addressing/

Leader
Carlowe Deala
Members
Jenevive Sanchez
Joan Grace Patalinghug
Keisha Soler
Nathaniel Piraman
Ivann Jade Martel
John Marnell Asutilla

Abdul Sattar Edhi An Autobiography A Mirror To The Blind by Abdul Sattar Edhi PDF
No ratings yet
Abdul Sattar Edhi An Autobiography A Mirror To The Blind by Abdul Sattar Edhi PDF
6 pages
ExeCryptor 2.1.17 Official Crackme Unpacking
No ratings yet
ExeCryptor 2.1.17 Official Crackme Unpacking
10 pages
Hash Function
No ratings yet
Hash Function
9 pages
Hashing
No ratings yet
Hashing
34 pages
Unit 7
No ratings yet
Unit 7
27 pages
Hashing Part1 - 241021 - 152911
No ratings yet
Hashing Part1 - 241021 - 152911
10 pages
Hashing Slide
No ratings yet
Hashing Slide
16 pages
As 3
No ratings yet
As 3
4 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
Hashing and Skiplist_removed
No ratings yet
Hashing and Skiplist_removed
113 pages
UNIT 1- Hashing
No ratings yet
UNIT 1- Hashing
118 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
GROUP 15.Pptx Presentation
No ratings yet
GROUP 15.Pptx Presentation
29 pages
Unit 1 Dsa Hashing
No ratings yet
Unit 1 Dsa Hashing
137 pages
Hashing
No ratings yet
Hashing
30 pages
Unit 1 Dsa Hashing 2022 Compressed 1
No ratings yet
Unit 1 Dsa Hashing 2022 Compressed 1
115 pages
Hashing
No ratings yet
Hashing
7 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
CH 4 Hash Table
No ratings yet
CH 4 Hash Table
20 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hash
No ratings yet
Hash
7 pages
Hashing
No ratings yet
Hashing
4 pages
L-2005-08-Advance Data Structure Part 1-HS
No ratings yet
L-2005-08-Advance Data Structure Part 1-HS
46 pages
Week 9_Hash Functions and Collision
No ratings yet
Week 9_Hash Functions and Collision
73 pages
Notes of advanced data structures
No ratings yet
Notes of advanced data structures
202 pages
Hashing
No ratings yet
Hashing
56 pages
Hash Tables: COT4810 Ken Pritchard 2 Sep 04
No ratings yet
Hash Tables: COT4810 Ken Pritchard 2 Sep 04
20 pages
Modifed Hash
No ratings yet
Modifed Hash
42 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Hashing
No ratings yet
Hashing
37 pages
Hash Tables
100% (1)
Hash Tables
30 pages
Hashing
No ratings yet
Hashing
5 pages
Dat Astruc T Hashing Rep
No ratings yet
Dat Astruc T Hashing Rep
13 pages
Hash Table
No ratings yet
Hash Table
26 pages
HASHING
No ratings yet
HASHING
8 pages
Unit-6c DBMS - Hashing
No ratings yet
Unit-6c DBMS - Hashing
21 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
Unit 1 Dsa Hashing 2024 1
No ratings yet
Unit 1 Dsa Hashing 2024 1
146 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
Hash Tables: Dr. Dibakar Saha
No ratings yet
Hash Tables: Dr. Dibakar Saha
26 pages
12. Hashing
No ratings yet
12. Hashing
35 pages
Hashing
No ratings yet
Hashing
13 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Unit 5 Data Structure
No ratings yet
Unit 5 Data Structure
12 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing and Graphs
No ratings yet
Hashing and Graphs
28 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
Hashing
No ratings yet
Hashing
23 pages
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
No ratings yet
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
32 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
Hash
No ratings yet
Hash
17 pages
Hashing
No ratings yet
Hashing
12 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Module 5 Hashing
No ratings yet
Module 5 Hashing
66 pages
DSA Unit 1
No ratings yet
DSA Unit 1
144 pages
Hashing
No ratings yet
Hashing
37 pages
Hashing
No ratings yet
Hashing
56 pages
DSA Unit VI Hashing and File Organization
No ratings yet
DSA Unit VI Hashing and File Organization
56 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
FTTH Inspection and Quality Assurance
No ratings yet
FTTH Inspection and Quality Assurance
6 pages
Grade I - Ict - PPT - 2 - L - 1 - Ipo Meaning - 23-06
No ratings yet
Grade I - Ict - PPT - 2 - L - 1 - Ipo Meaning - 23-06
33 pages
AnnJessyJose CV3
No ratings yet
AnnJessyJose CV3
1 page
Daewoo HC-4130, 4150, 4160, 4180, 4230, 4250, 4260, 4280
No ratings yet
Daewoo HC-4130, 4150, 4160, 4180, 4230, 4250, 4260, 4280
20 pages
Tim Martinez Resume
No ratings yet
Tim Martinez Resume
1 page
Graphic Symbols For Distributed Control/Shared Display Instrumentation, Logic and Computer Systems
No ratings yet
Graphic Symbols For Distributed Control/Shared Display Instrumentation, Logic and Computer Systems
22 pages
Design and Simulation of PWM DC Motor Speed Regulator Based On Proteus
No ratings yet
Design and Simulation of PWM DC Motor Speed Regulator Based On Proteus
4 pages
How To Interface An LED With 8051 Microcontroller
No ratings yet
How To Interface An LED With 8051 Microcontroller
7 pages
L6 Slides - Intro to Python programming - Y8
No ratings yet
L6 Slides - Intro to Python programming - Y8
14 pages
020-1209-00E - Matrix MultiTouch User Guide
No ratings yet
020-1209-00E - Matrix MultiTouch User Guide
44 pages
IC Nonprofit Operational Plan 11510 - WORD
No ratings yet
IC Nonprofit Operational Plan 11510 - WORD
6 pages
Aviat CTR 8540 Data Sheet - April 26 - 2018
No ratings yet
Aviat CTR 8540 Data Sheet - April 26 - 2018
2 pages
FON Unit IV - IoT
No ratings yet
FON Unit IV - IoT
29 pages
IOT Based Smart Agriculture System: Sushanth.g@christuniversity - in Sujatha.s@christuniversity - in
No ratings yet
IOT Based Smart Agriculture System: Sushanth.g@christuniversity - in Sujatha.s@christuniversity - in
4 pages
Building A Protocol Validator For Business To Business Communications
No ratings yet
Building A Protocol Validator For Business To Business Communications
8 pages
Web X.O
No ratings yet
Web X.O
13 pages
Download Complete Kubernetes Patterns Reusable Elements for Designing Cloud Native Applications 2nd Ed 2nd Edition Bilgin Ibryam PDF for All Chapters
100% (3)
Download Complete Kubernetes Patterns Reusable Elements for Designing Cloud Native Applications 2nd Ed 2nd Edition Bilgin Ibryam PDF for All Chapters
50 pages
Check of Customer Number of Delivery Plant For SO: Symptom
No ratings yet
Check of Customer Number of Delivery Plant For SO: Symptom
3 pages
Cshell
No ratings yet
Cshell
31 pages
Department of Information Technology Subject: SL Class/Sem: T.E./VI Roll Number Batch: List of Experiments No. Name No. Date Remark
No ratings yet
Department of Information Technology Subject: SL Class/Sem: T.E./VI Roll Number Batch: List of Experiments No. Name No. Date Remark
19 pages
8051 Question
No ratings yet
8051 Question
9 pages
FLS CSS Ncii
No ratings yet
FLS CSS Ncii
31 pages
Senior Project Report - Guidelines - S2022
No ratings yet
Senior Project Report - Guidelines - S2022
8 pages
Backup Links
No ratings yet
Backup Links
89 pages
Shpro
No ratings yet
Shpro
2 pages
(Ebook) Computers Are Your Future by Catherine LaBerta ISBN 9780132545181, 0132545187 - The ebook in PDF/DOCX format is available for instant download
100% (1)
(Ebook) Computers Are Your Future by Catherine LaBerta ISBN 9780132545181, 0132545187 - The ebook in PDF/DOCX format is available for instant download
52 pages
StorageBackupSoftwareManual ALL PDF
No ratings yet
StorageBackupSoftwareManual ALL PDF
292 pages
MS Access Lab Manual
100% (1)
MS Access Lab Manual
41 pages

DSA G5 Hashing Handouts

Uploaded by

DSA G5 Hashing Handouts

Uploaded by

Data Structures and Algorithms

What is Load factor?

Key Components of a Hash Table

How a Hash Table Works

Common Types of Hash Functions

How to handle Collisions?

Load factor α = n/m

Time to insert = O(1)

DIFFERENT WAYS OF OPEN ADDRESSING

Performance of Open Addressing

α should be less than 1 for effective open addressing.

Search, insert, and delete operations take expected time ≈ 1/1-a.

Implementation Simpler to implement More computation required

Sensitivity to Hash Less sensitive to hash Sensitive; requires careful

GeeksforGeeks. (2024a, August 7). Hashing in data structure. https://www.geeksforgeeks.org/hashing-

You might also like