0% found this document useful (0 votes)

19 views

Sets Maps and Hash Tables Review

The document discusses sets, maps, and hash tables. Sets are collections with no duplicate elements that support operations like union and intersection. Maps are collections of key-value pairs where keys must be unique. Hash tables use hash functions to map keys to buckets, and must handle collisions when different keys hash to the same bucket using techniques like separate chaining or open addressing.

Uploaded by

Jenesis Escobar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views

Sets Maps and Hash Tables Review

Uploaded by

Jenesis Escobar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Sets Maps and Hash Tables

REVIEW

Set – collection that contains NO duplicate elements

• Cannot access elements by index (cannot do set[index])

Operations:

A  B = A OR B A  B = A AND B A – B = in A but NOT in B A  B = A is a SUBSET of B

l
m

A B A B A B A  B

std::set<type> s; → Ordered std::unordered_set<type> s; → Not Ordered

Methods: Methods: Same as ordered sets + 2 new:

insert(element) – adds element bucket_count() -- # buckets

erase(element) – removes element load_factor() -- # elements / # buckets
find(element) – returns iterator to element if it is found, or
returns an iterator to std::end otherwise # buckets = 4
count(element) – returns 1 if element is found or 0 otherwise # elements = 3
size() – gives number of elements in set Load factor = 3/4 = 0.75
empty() – returns if set is empty or not
Hash table
Implemented as: BST Implemented as: Hash Table
Time complexity: O(log(n)) Time complexity: O(1) (+ O(k) for hash)

std::set<type> s; std::unordered_set<type> s;

s.insert(5); s.insert(5);
s.insert(2); s.insert(2);
s.insert(4); s.insert(4);
s.insert(11); s.insert(11);
s.insert(2); // wont add 2 again s.insert(2); // wont add 2 again

// printing set using iterator yields: // printing set using iterator yields:
2, 4, 5, 11 11, 4, 5, 2

s.erase(4); s.bucket_count() // say it = 7;

s.load_factor() // 4 /7 = 0.571429
// printing set using iterator yields:
2, 5, 11
Behind the scenes

Map – collection of (key, value) pairs where key is unique

Many-to-one relationship
(Onto Mapping)
Operations:

std::map<type key, type value> m; → Ordered std::unordered_map<type key, type value> m; → Not Ordered

Methods: Methods: Same as ordered maps + 2 new:

insert (key, value) – if key already exists in map, returns false

otherwise inserts new entry with key, value pair.
map[key] = value – if key already exists in map, overwrites bucket_count() – # buckets
with new value load_factor() - # elements / # buckets
erase(key) – deletes key in map
find(key) – searches for key in map, and returns iterator to it if
found; otherwise returns iterator to map::end()
count(key) – returns 1 if key is found in map or 0 otherwise
size() – gives number of elements in map
empty() – returns if map is empty or not

Implemented as: BST Implemented as: Hash Table Clearly faster than
Time complexity: Time complexity: ordered maps :D
insert = O(log(n)), insert average case = O(1)
[] = O(log(n)) [] average case = O(1)

map<char, int> table; unordered_map<char, int> table;

table[‘b’] = 30; table[‘b’] = 30;

table[‘a’] = 10; table[‘a’] = 10;
table[‘c’] = 50; table[‘c’] = 50;
table[‘a’] = 40; // overwrites previous value of ‘a’ table[‘a’] = 40; // overwrites previous value of ‘a’

//printing using an iterator will yield: //printing using an iterator will yield:

a : 40 c : 50
b : 30 Prints in order of keys b : 30 Does not print in any sort of order
c : 50 a : 40

Hash Table – uses a hash function to compute an index (a hash code) which maps to a “bucket” containing value

Hash function: string_length % table_size

key = “macaroons” Hash code = 2

Having a good hash function is critical for hash table efficiency… value = yummy

Good hash functions will:

- Evenly distribute data (therefore minimizing the potential for
data collisions)
- Be easy to compute, (and very fast)

Bad/Invalid hash functions will:

- Produce different outputs for the same input
- Take lots of time
- Result in high potential for data collisions
7
What is a data collision? 6
5
To understand data collisions, we first must understand what load_factor is. 4
3
Load_factor = # entries / # buckets 3
2 2
It is a way to describe how “full” the hash table is becoming… 1 1
0 0
If load_factor becomes “too large”, (table becomes too full)
we should dynamically resize the table, and rehash our values to Load Factor = 3 /4 = 0.75 Load Factor = 3 /8 = 0.375
reduce the load_factor → making our table more time efficient :)

Okay so what is a Collision then?

Example]

Hash table size = 4 buckets

Julia (length = 5)
Hash function = string_length % table_size
5%4=1
Initially, our load_factor = 0 entries / 4 buckets = 0

Then let’s say we insert the key “Julia” (length = 5) John (length = 4)
4%4=0
Load factor now becomes 1 entry / 4 buckets = 0.25
888-555-1111
Then we insert “John” (length = 4)

Load factor = 2 entries / 4 buckets = 0.50 222-333-4444

Now let’s say we try to insert “Mariannae” (length = 9) Collision!

Mariannae (length = 4)
“Mariannae” hashes to the same value that “Julia” does. 9%4=1
This is a data collision.

Collision Resolution policies:

1. Separate chaining: each bucket stores a linked list; collisions are simply appended to the end of the list

Julia (length = 5)
5%4=1

John (length = 4)
4%4=0
888-555-1111 777-666-5555

222-333-4444

Mariannae (length = 4)
9%4=1

2. Open Addressing (Linear Probing): each bucket stores only one entry; if you try to add and entry and there is a
collision, move the “problem” entry (one bucket at a time) to the next available free bucket and put it there

3. Open Addressing (Quadratic Probing): same as linear probing, except you move the “problem” entry 1 bucket, then by
4 buckets, then by 9 buckets then by 16 buckets, etc.

L15 Maps and Hashes
No ratings yet
L15 Maps and Hashes
41 pages
Hash Table
No ratings yet
Hash Table
24 pages
DSA MK Lect2 PDF
No ratings yet
DSA MK Lect2 PDF
92 pages
Cse373 10 Hashing
No ratings yet
Cse373 10 Hashing
36 pages
L21 Hashing
No ratings yet
L21 Hashing
55 pages
9.map 1 HashTable
No ratings yet
9.map 1 HashTable
31 pages
Dsa Merged
No ratings yet
Dsa Merged
339 pages
09 Hashtable
No ratings yet
09 Hashtable
53 pages
Ch-2: Abstract Data Structures
No ratings yet
Ch-2: Abstract Data Structures
8 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
06 Hashing
No ratings yet
06 Hashing
6 pages
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
No ratings yet
CHAPTER 8 Hashing: Instructors: C. Y. Tang and J. S. Roger Jang
78 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
11 Hashtable-1
No ratings yet
11 Hashtable-1
48 pages
GROUP 15.Pptx Presentation
No ratings yet
GROUP 15.Pptx Presentation
29 pages
Lecture 27 - Hashing
No ratings yet
Lecture 27 - Hashing
48 pages
L5 HashTables
No ratings yet
L5 HashTables
22 pages
Hash Tables
No ratings yet
Hash Tables
35 pages
Lecture 13 - Hash Tables
No ratings yet
Lecture 13 - Hash Tables
51 pages
Chapter 8 - Hashing
No ratings yet
Chapter 8 - Hashing
78 pages
DSA Lab 11 Hashing
No ratings yet
DSA Lab 11 Hashing
9 pages
3 Hashing
No ratings yet
3 Hashing
20 pages
CH 4
No ratings yet
CH 4
58 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
43 pages
CS301 Lec41
No ratings yet
CS301 Lec41
18 pages
Maps
No ratings yet
Maps
36 pages
Hashing
No ratings yet
Hashing
20 pages
Hashing
No ratings yet
Hashing
23 pages
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
No ratings yet
MODULE 5_BCS304_HASHING_Leftisht trees_OBST_Notes
32 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Dsa Lecture 13 Hash Tables
No ratings yet
Dsa Lecture 13 Hash Tables
15 pages
Hashing
No ratings yet
Hashing
38 pages
Hash Tables
No ratings yet
Hash Tables
21 pages
unit 1 Hashing
No ratings yet
unit 1 Hashing
61 pages
Modifed Hash
No ratings yet
Modifed Hash
42 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
DSA Day 5
No ratings yet
DSA Day 5
10 pages
chap-1 ADS
No ratings yet
chap-1 ADS
5 pages
Struktur Data: By: Sri Rezeki Candra Nursari
No ratings yet
Struktur Data: By: Sri Rezeki Candra Nursari
34 pages
ADS Unit 3
No ratings yet
ADS Unit 3
14 pages
Unit IV Hashing and Set 9
No ratings yet
Unit IV Hashing and Set 9
8 pages
Lab 09 - Hashing
No ratings yet
Lab 09 - Hashing
47 pages
Hashing
No ratings yet
Hashing
9 pages
Unit 5 Data Structure
No ratings yet
Unit 5 Data Structure
12 pages
Introduction To Hashmaps
No ratings yet
Introduction To Hashmaps
16 pages
Hashing new
No ratings yet
Hashing new
48 pages
Maps and Dictionary: Data Structures and Algorithms
No ratings yet
Maps and Dictionary: Data Structures and Algorithms
50 pages
Lecture 12
No ratings yet
Lecture 12
33 pages
CS2040 Summary
No ratings yet
CS2040 Summary
16 pages
Group 15 Hash Tables
No ratings yet
Group 15 Hash Tables
42 pages
Lab08 - DS - Hash Tables
No ratings yet
Lab08 - DS - Hash Tables
9 pages
Hashing
No ratings yet
Hashing
44 pages
l26STLContainersAssociative (1)
No ratings yet
l26STLContainersAssociative (1)
15 pages
Hashing in Data Structure
No ratings yet
Hashing in Data Structure
25 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Learn Programming Using C#
From Everand
Learn Programming Using C#
Taurius Litvinavicius
No ratings yet
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Cryptography and Network Security: by William Stallings
No ratings yet
Cryptography and Network Security: by William Stallings
37 pages
CSC 221 Hashing
No ratings yet
CSC 221 Hashing
2 pages
(Update) Post Test Week 6 - Attempt Review
No ratings yet
(Update) Post Test Week 6 - Attempt Review
5 pages
M.Tech JNTUK ADS UNIT-3
No ratings yet
M.Tech JNTUK ADS UNIT-3
13 pages
The Secure Hash Function (SHA) : Network Security
No ratings yet
The Secure Hash Function (SHA) : Network Security
24 pages
629314285 Hashing in Data Structure
No ratings yet
629314285 Hashing in Data Structure
23 pages
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
No ratings yet
Done DS GTU Study Material Presentations Unit-4 13032021035653AM
24 pages
Sha 256
100% (1)
Sha 256
199 pages
Hashing
No ratings yet
Hashing
75 pages
CrackStation - Online Password Hash Cracking - MD5, SHA1, Linux, Rainbow Tables, Etc
No ratings yet
CrackStation - Online Password Hash Cracking - MD5, SHA1, Linux, Rainbow Tables, Etc
2 pages
s53fds65f DSFDSFDSF
No ratings yet
s53fds65f DSFDSFDSF
17 pages
Keccak Slides at NIST
No ratings yet
Keccak Slides at NIST
71 pages
DS - Unit 5 - Notes
No ratings yet
DS - Unit 5 - Notes
8 pages
Hashing
No ratings yet
Hashing
30 pages
Student Solution Chap 12
No ratings yet
Student Solution Chap 12
10 pages
21-Birthday Attack and HMAC-16-03-2024
No ratings yet
21-Birthday Attack and HMAC-16-03-2024
39 pages
A Review Paper On Cryptographic Hash Function
No ratings yet
A Review Paper On Cryptographic Hash Function
11 pages
FDS Unit 5
No ratings yet
FDS Unit 5
22 pages
Assaignement 1
No ratings yet
Assaignement 1
7 pages
Hash Tables
No ratings yet
Hash Tables
4 pages
20. Hashing Technique
No ratings yet
20. Hashing Technique
8 pages
Implementation of Linear & Quadratic Probing
No ratings yet
Implementation of Linear & Quadratic Probing
11 pages
CO4 - Hashing in Data Structure
No ratings yet
CO4 - Hashing in Data Structure
13 pages
LBHM 3Rd All India Open Fide Rated Chess Tournament-2019 Click On Invitation For Prize List
No ratings yet
LBHM 3Rd All India Open Fide Rated Chess Tournament-2019 Click On Invitation For Prize List
10 pages
v.4.28.21 PSdZData Full
No ratings yet
v.4.28.21 PSdZData Full
2 pages
9.hash Function and Hash Table
No ratings yet
9.hash Function and Hash Table
19 pages
C
No ratings yet
C
20 pages
Assignment 2 - Group Assignment
No ratings yet
Assignment 2 - Group Assignment
6 pages
Hash Tables: Unit - III - Chapter 5 of Data Structures and Algorithm Analysis in C++ - Mark Allen Weiss
No ratings yet
Hash Tables: Unit - III - Chapter 5 of Data Structures and Algorithm Analysis in C++ - Mark Allen Weiss
60 pages

Sets Maps and Hash Tables Review

Uploaded by

Sets Maps and Hash Tables Review

Uploaded by

Sets Maps and Hash Tables

Set – collection that contains NO duplicate elements

A  B = A OR B A  B = A AND B A – B = in A but NOT in B A  B = A is a SUBSET of B

std::set<type> s; → Ordered std::unordered_set<type> s; → Not Ordered

Methods: Methods: Same as ordered sets + 2 new:

insert(element) – adds element bucket_count() -- # buckets

s.erase(4); s.bucket_count() // say it = 7;

Map – collection of (key, value) pairs where key is unique

Methods: Methods: Same as ordered maps + 2 new:

insert (key, value) – if key already exists in map, returns false

map<char, int> table; unordered_map<char, int> table;

table[‘b’] = 30; table[‘b’] = 30;

Hash function: string_length % table_size

Good hash functions will:

Bad/Invalid hash functions will:

Okay so what is a Collision then?

Hash table size = 4 buckets

Load factor = 2 entries / 4 buckets = 0.50 222-333-4444

Now let’s say we try to insert “Mariannae” (length = 9) Collision!

Collision Resolution policies:

You might also like