0% found this document useful (0 votes)

15 views

11-Hash-Tables-II

Uploaded by

movieemailid9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

11-Hash-Tables-II

Uploaded by

movieemailid9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Hash Tables II

Data Structures and Algorithms

Andrei Bulatov
Algorithms – Hash Tables II 11-2

Hash Tables
In case of collision create a list of elements with the same hash value

h(k1 )
k1 key article next

h( k 2 )
k2 key article next key article next

k4
k3 h ( k3 )
key article next key article next
k6
k5
key article next
Algorithms – Hash Tables II 11-3

Good Hash Functions

Good hash functions are those that are as close to simple uniform
hashing as possible
It is difficult to achieve, since we do not know the distribution of keys

Note, there are two types of hash functions with absolutely different
requirements:
- hash functions to support data structures
- cryptographic hash functions

Assumption:
All keys are natural numbers
Algorithms – Hash Tables 10-4

The Division Method

Choose
Then ℎ( ) =

Should be careful with some values of

Say, no powers of 2, or powers of 10, or …

Primes is a good choice, as long as they are not close to a power of 2

Algorithms – Hash Tables 10-5

The Multiplication Method

Choose
Choose with 0 < <1

1 denotes the fractional part of , that is –

Then ℎ( ) = ( 1)
= 2 is a convenient value
If the size of a computer word is , choose to be a fraction like
for a integer
To compute ℎ( ), multiply by = ⋅ 2
The result is a 2 -bit value 2 +
Then ℎ( ) is the the most significant bits of
Algorithms – Hash Tables 10-6

Universal Hashing
To guarantee hashing even closer to simple uniform, a natural idea is to
choose hash function also at random, independent of the keys being
hashed
We use universal collection of hash functions
A collection of hash functions is called universal, if for each pair of
distinct keys and , the number of hash functions ℎ ∈ such
that ℎ( ) = ℎ( ) is no more than | |/

To construct a hash table we first select ℎ ∈ (randomly!), and then

use it
Algorithms – Hash Tables 10-7

Universal Hashing (cntd)

Lemma
Suppose a hash function is chosen at random from a universal
collection and is used to hash # keys into a table of size .
If key is not in the table, then the expected length $[#& ' ]
of the list that hashes to is at most ) = #/ .
If is in the table, then the expected length $[#& ' ] of the list
containing is at most 1 + )

Corollary
Using universal hashing and collision resolution by chaining in a table
with slots, it takes expected time Θ(#) to handle any
sequence of # table operations.
Algorithms – Hash Tables 10-8

Constructing a Universal Hashing Collection

Choose a prime such that all possible keys are in the range
{0, … , – 1}
Let / = {0, … , − 1} and / ∗ = {1, … , − 1}
For 2 ∈ / ∗ and 3 ∈ / let
ℎ4,5 = 2 +3
and
∗
,6 = {ℎ 4,5 ∶ 2 ∈ / ,3 ∈ / }

Theorem
The class ,6 of hash functions is universal
Algorithms – Hash Tables II 11-9

Open Addressing
A serious drawback of chaining: it uses a lot of pointers
The idea:
Keep all the lists inside the hash table
Instead of using pointers, compute the location of the next element

To insert or search the hash table

we successfully check or probe a
sequence of entries of the table

This sequence depends on the key being

searched or inserted
Algorithms – Hash Tables II 11-10

Probe Sequence
Hash function depends on 2 arguments and generates a probe
sequence
Formally:
ℎ: 9 0,1, … , – 1 → {0,1, … , – 1}
Probe sequence
ℎ( , 0), ℎ( , 1), … , ℎ( , – 1)
We want this sequence to be a permutation of 0,1, … , – 1, so that
every slot in the hash table can be occupied.

Clearly we cannot store more elements than the number of slots in the
table
Thus the load factor does not exceed 1
Algorithms – Hash Tables II 11-11

Insertion
Hash-Insert(;, )
set <: = 0
repeat
set =: = ℎ( , <)
if ;[=] =Nil then do
set ;[=]: =
return =
else set <: = < + 1
until < =
error “hash table overflow”
Algorithms – Hash Tables II 11-12

Search and Deletion

Hash-Search(;, )
set <: = 0
repeat
set =: = ℎ( , <)
if ;[=] = then return =
set <: = < + 1
until ; = = Nil or < =
return Nil

Deletion is difficult, as it is not possible in general to shift all elements in

a sequence, for some of them may belong to different sequences
We can write `Deleted’ instead of actual deleting
Or better use chaining
Algorithms – Hash Tables II 11-13

Probing: Linear
To generate a probe sequence we use an ordinary hash function,
called auxiliary hash function
ℎ′: 9 {0,1, … , – 1}
Linear probing:
ℎ , < = ℎ′ +<
Thus we start searching from slot ℎ′( ), then check ℎ′( ) + 1, etc.

Drawbacks:
- Primary clustering, long sequences of occupied slots build up
making the average search time too long
- Since ℎ( , 0) = ℎ( ′, 0) implies ℎ( , <) = ℎ( ′, <) for all <,
there are very few different probe sequences (m to be precise)
Algorithms – Hash Tables II 11-14

Probing: Quadratic
Quadratic probing:
ℎ( , <) = (ℎ′( ) + ? < + < )

where ℎ′ is an auxiliary hash function, ?, 0 are constants

No primary clustering

Drawbacks:
- Possible values of ?, , and are very restricted
- Secondary clustering, milder form of clustering
- Only few different probe sequences
Algorithms – Hash Tables II 11-15

Probing: Double Hashing

Double hashing uses two auxiliary hash functions
ℎ( , <) = (ℎ′( ) + < ℎ′′( ))
where ℎ′ and ℎ′′ are auxiliary hash functions
Thus the sequence depends on the value of two hash functions
It is unlikely it produces any kind of clustering
Also if ℎ′ and ℎ′′ are selected properly, we have different
probe sequences
Algorithms – Hash Tables II 11-16

Probing: Double Hashing (cntd)

Choice of ℎ′ and ℎ′′:
ℎ′′( ) should be relatively prime to to make sure we search the
entire table
Say, is a power of 2, and ℎ′′( ) is always odd
Or is prime, and ℎ′′( ) < for all
ℎ′( ) =
ℎ′′( ) = 1 + ( ′), and ′ = – 1
Algorithms – Hash Tables II 11-17

Open Addressing Analysis

Theorem
Given an open-address hash table with load factor ) = #/ < 1,
the expected number of probes in an unsuccessful search is at most
assuming uniform hashing
@A

Theorem
Given an open-address hash table with load factor ) = #/ < 1,
the expected number of probes in a successful search is at most
1 1
ln
) 1−)
assuming uniform hashing
Algorithms – Hash Tables II 11-18

Homework

Suggest how to organize a direct access table in which not all keys are
different. All operations must run in D(1) time

Show that if |9| > # (9 denotes the set of all possible keys), there
is a subset of 9 of size # consisting of keys that all hash to the
same slot, so that the worst-case searching time for hashing with
chaining is Θ(#)

Write pseudocode for Hash-Delete in the case of open addressing, and

modify Hash-Insert to handle deleted elements.

Les and Regulations
No ratings yet
Les and Regulations
3 pages
MindMap For PRINCE2
100% (3)
MindMap For PRINCE2
1 page
10 Hash Tables
No ratings yet
10 Hash Tables
19 pages
c11 Hashing
No ratings yet
c11 Hashing
9 pages
14 Hashing
No ratings yet
14 Hashing
23 pages
Chapter10_HashTables
No ratings yet
Chapter10_HashTables
49 pages
Lec 11 Hash Table
No ratings yet
Lec 11 Hash Table
43 pages
Module 5
No ratings yet
Module 5
25 pages
11-Hashing-Hong Kong (1)
No ratings yet
11-Hashing-Hong Kong (1)
25 pages
Chapter 5_Hashing _Part1
No ratings yet
Chapter 5_Hashing _Part1
28 pages
Overview of Hash Tables
No ratings yet
Overview of Hash Tables
4 pages
Hashing
No ratings yet
Hashing
38 pages
DSA2 Chapter 5 Hashing
No ratings yet
DSA2 Chapter 5 Hashing
44 pages
Perfect Hashing
No ratings yet
Perfect Hashing
6 pages
Hash Table 2010
No ratings yet
Hash Table 2010
43 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
Hashing Updated
No ratings yet
Hashing Updated
26 pages
Hash Table PDF
No ratings yet
Hash Table PDF
25 pages
Lect1004 PDF
No ratings yet
Lect1004 PDF
7 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
12 Hashing
No ratings yet
12 Hashing
9 pages
Hashing PPT
No ratings yet
Hashing PPT
39 pages
Lab 3
No ratings yet
Lab 3
5 pages
Hashing Important Theorems
No ratings yet
Hashing Important Theorems
26 pages
Hashing
No ratings yet
Hashing
37 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
Problem Idea of Universal Hashing
No ratings yet
Problem Idea of Universal Hashing
14 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
11 Hashing
No ratings yet
11 Hashing
60 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
Hashing - Datastructures and Algorithms
No ratings yet
Hashing - Datastructures and Algorithms
32 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Hashing PDF
No ratings yet
Hashing PDF
65 pages
05-CSAI-230-COURSE-05
No ratings yet
05-CSAI-230-COURSE-05
44 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Hashing: 15-111 Data Structures Data Structures
No ratings yet
Hashing: 15-111 Data Structures Data Structures
30 pages
CSE 326: Data Structures Hash Tables: Autumn 2007
No ratings yet
CSE 326: Data Structures Hash Tables: Autumn 2007
29 pages
Hashing: John Erol Evangelista
No ratings yet
Hashing: John Erol Evangelista
38 pages
DS 8
No ratings yet
DS 8
30 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
26 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
IT245 - Module 8
No ratings yet
IT245 - Module 8
41 pages
11 - Hash Table
No ratings yet
11 - Hash Table
65 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
Lecture03 Hashing
No ratings yet
Lecture03 Hashing
12 pages
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
No ratings yet
Lecture 7 - Hash - Table - Direct - Adreess - Tables - Hash - Tables - Intro - Separate - Chaining
77 pages
Hash Tables - : Structure
No ratings yet
Hash Tables - : Structure
21 pages
Hashing
50% (2)
Hashing
43 pages
L21 Hashing
No ratings yet
L21 Hashing
55 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
Chapter 8 - Searching
No ratings yet
Chapter 8 - Searching
44 pages
Algorithm Lecture6 Search
No ratings yet
Algorithm Lecture6 Search
40 pages
Week 10: Hash Table: Readings
No ratings yet
Week 10: Hash Table: Readings
18 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Hashing
No ratings yet
Hashing
10 pages
Ads-Unit I
No ratings yet
Ads-Unit I
16 pages
Dictionary ADT: Dictionaries 4/1/2003 8:43 AM
No ratings yet
Dictionary ADT: Dictionaries 4/1/2003 8:43 AM
4 pages
Unit -3
No ratings yet
Unit -3
45 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
Hashing
From Everand
Hashing
Prakash Hegade
No ratings yet
Running in Parallel
No ratings yet
Running in Parallel
24 pages
UNIT-1
No ratings yet
UNIT-1
32 pages
Fundamentals of Relational Database: Yong Choi School of Business CSUB, Bakersfield
No ratings yet
Fundamentals of Relational Database: Yong Choi School of Business CSUB, Bakersfield
18 pages
Sequel Ize
No ratings yet
Sequel Ize
5 pages
TCP Com
100% (1)
TCP Com
1 page
Design of PID Controller For Automatic Voltage Regulator and Validation Using Hardware in The Loop Technique
No ratings yet
Design of PID Controller For Automatic Voltage Regulator and Validation Using Hardware in The Loop Technique
15 pages
Assignment 1 Tutor Marking Guidelines
No ratings yet
Assignment 1 Tutor Marking Guidelines
3 pages
(Download PDF) Advanced Data Analytics Using Python With Architectural Patterns Text and Image Classification and Optimization Techniques 2Nd Edition Sayan Mukhopadhyay Full Chapter PDF
100% (21)
(Download PDF) Advanced Data Analytics Using Python With Architectural Patterns Text and Image Classification and Optimization Techniques 2Nd Edition Sayan Mukhopadhyay Full Chapter PDF
70 pages
Active Directory: Operations Masters
No ratings yet
Active Directory: Operations Masters
25 pages
Updated MCQ On TAFLas Per AKTU Syllabus (Unit 5) )
No ratings yet
Updated MCQ On TAFLas Per AKTU Syllabus (Unit 5) )
59 pages
03 Mips
No ratings yet
03 Mips
27 pages
ILC Manual
No ratings yet
ILC Manual
110 pages
I 14229
No ratings yet
I 14229
17 pages
D100-Distribution Basic 6.0
No ratings yet
D100-Distribution Basic 6.0
377 pages
Eda Continuous Prob Distribution
No ratings yet
Eda Continuous Prob Distribution
3 pages
Wireless Hacking Report
No ratings yet
Wireless Hacking Report
3 pages
Floyd Warshall
No ratings yet
Floyd Warshall
6 pages
Corel Draw X4 Notes
No ratings yet
Corel Draw X4 Notes
12 pages
JAVA Lab Manual
No ratings yet
JAVA Lab Manual
39 pages
Nitish's CV
No ratings yet
Nitish's CV
1 page
Router eSIM v1 Faq
No ratings yet
Router eSIM v1 Faq
13 pages
The Mystery of Rahu in A Horoscope by Shiv Raj Sharma PDF
0% (3)
The Mystery of Rahu in A Horoscope by Shiv Raj Sharma PDF
2 pages
CH 2
No ratings yet
CH 2
30 pages
Laplacian of Gaussian (LoG)
No ratings yet
Laplacian of Gaussian (LoG)
4 pages
Data and File Structure Lab Manual
No ratings yet
Data and File Structure Lab Manual
8 pages
Oracle BICC
No ratings yet
Oracle BICC
5 pages
Gauss Elimination Backward
No ratings yet
Gauss Elimination Backward
14 pages
Qsys Intro
No ratings yet
Qsys Intro
62 pages

11-Hash-Tables-II

Uploaded by

11-Hash-Tables-II

Uploaded by

Hash Tables II

Data Structures and Algorithms

Good Hash Functions

The Division Method

Should be careful with some values of

Primes is a good choice, as long as they are not close to a power of 2

The Multiplication Method

1 denotes the fractional part of , that is –

To construct a hash table we first select ℎ ∈ (randomly!), and then

Universal Hashing (cntd)

Constructing a Universal Hashing Collection

To insert or search the hash table

This sequence depends on the key being

Search and Deletion

Deletion is difficult, as it is not possible in general to shift all elements in

where ℎ′ is an auxiliary hash function, ?, 0 are constants

Probing: Double Hashing

Probing: Double Hashing (cntd)

Open Addressing Analysis

Write pseudocode for Hash-Delete in the case of open addressing, and

You might also like