0% found this document useful (0 votes)

21 views

Hashing

This document discusses hashing and hash tables. Hashing allows items to be stored and searched in constant O(1) time by mapping items to slots in a hash table using a hash function. Common hash functions include the remainder method, folding method, and mid-square method. Collisions occur when multiple items map to the same slot, compromising the efficiency. The goal is to minimize collisions and evenly distribute items in the hash table.

Uploaded by

DOMINIC MUSHAYI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Hashing

Uploaded by

DOMINIC MUSHAYI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

pythonds This Chapter Saving and Logging are Disabled    ✏ 

6.5. Hashing
In previous sections we were able to make improvements in our search
algorithms by taking advantage of
information about where items are
stored in the collection with respect to one another. For example, by
knowing that a list was ordered, we could search in logarithmic time
using a binary search. In this section
we will attempt to go one step
further by building a data structure that can be searched in
O(1) time. This
concept is referred to as hashing.

In order to do this, we will need to know even more about where the
items might be when we go to look for
them in the collection. If every
item is where it should be, then the search can use a single comparison
to
discover the presence of an item. We will see, however, that this is
typically not the case.

A hash table is a collection of items which are stored in such a way

as to make it easy to find them later.
Each position of the hash table,
often called a slot, can hold an item and is named by an integer
value
starting at 0. For example, we will have a slot named 0, a slot
named 1, a slot named 2, and so on. Initially,
the hash table contains
no items so every slot is empty. We can implement a hash table by using
a list with
each element initialized to the special Python value
None . Figure 4 shows a hash table of size m = 11.
In other words, there are m slots in the table, named 0 through 10.

Figure 4: Hash Table with 11 Empty Slots

The mapping between an item and the slot where that item belongs in the
hash table is called the hash
function. The hash function will take
any item in the collection and return an integer in the range of slot
names, between 0 and m-1. Assume that we have the set of integer items
54, 26, 93, 17, 77, and 31. Our
first hash function, sometimes referred
to as the “remainder method,” simply takes an item and divides it
by the
table size, returning the remainder as its hash value
(h(item) = item%11). Table 4 gives all of
the
hash values for our example items. Note that this remainder method
(modulo arithmetic) will typically
be present in some form in all hash
functions, since the result must be in the range of slot names.

Table 4: Simple Hash Function Using Remainders

Item Hash Value

54 10

26 4

93 5

17 6

77 0

31 9

Once the hash values have been computed, we can insert each item into
the hash table at the designated
position as shown in
Figure 5. Note that 6 of the 11 slots are now occupied. This
is referred to as the load
numberof items 6
factor, and is commonly denoted by
λ =
tablesize
. For this example,
λ =
11
.

Figure 5: Hash Table with Six Items

Now when we want to search for an item, we simply use the hash function
to compute the slot name for
the item and then check the hash table to
see if it is present. This searching operation is O(1), since
a
constant amount of time is required to compute the hash value and then
index the hash table at that
location. If everything is where it should
be, we have found a constant time search algorithm.

You can probably already see that this technique is going to work only
if each item maps to a unique
location in the hash table. For example,
if the item 44 had been the next item in our collection, it would
have a
hash value of 0 (44%11 == 0). Since 77 also had a hash
value of 0, we would have a problem.
According to the hash function, two
or more items would need to be in the same slot. This is referred to as
a collision (it may also be called a “clash”). Clearly, collisions
create a problem for the hashing technique.
We will discuss them in
detail later.

6.5.1. Hash Functions

Given a collection of items, a hash function that maps each item into a
unique slot is referred to as a
perfect hash function. If we know
the items and the collection will never change, then it is possible to
construct a perfect hash function (refer to the exercises for more about
perfect hash functions).
Unfortunately, given an arbitrary collection of
items, there is no systematic way to construct a perfect hash
function.
Luckily, we do not need the hash function to be perfect to still gain
performance efficiency.

One way to always have a perfect hash function is to increase the size
of the hash table so that each
possible value in the item range can be
accommodated. This guarantees that each item will have a unique
 slot.
Although this is practical for small numbers of items, it is not
feasible when the number of possible 
items is large. For example, if the
items were nine-digit Social Security numbers, this method would
require
almost one billion slots. If we only want to store data for a class of
25 students, we will be wasting
an enormous amount of memory.
Our goal is to create a hash function that minimizes the number of
collisions, is easy to compute, and
pythonds This Chapter
evenly distributes the items in the
hash table. There are a Saving
number and Logging
of common are to
ways extend the
Disabled simple   ✏ 
remainder method. We will consider a few of them here.

The folding method for constructing hash functions begins by

dividing the item into equal-size pieces (the
last piece may not be of
equal size). These pieces are then added together to give the resulting
hash
value. For example, if our item was the phone number 436-555-4601,
we would take the digits and divide
them into groups of 2
(43,65,55,46,01). After the addition, 43 + 65 + 55 + 46 + 01, we get
210. If we
assume our hash table has 11 slots, then we need to perform
the extra step of dividing by 11 and keeping
the remainder. In this case
210 % 11 is 1, so the phone number 436-555-4601 hashes to
slot 1. Some
folding methods go one step further and reverse every other
piece before the addition. For the above
example, we get
43 + 56 + 55 + 64 + 01 = 219 which gives 219 % 11 = 10.

Another numerical technique for constructing a hash function is called

the mid-square method. We first
square the item, and then extract
some portion of the resulting digits. For example, if the item were 44,
we
would first compute 442 = 1, 936. By extracting the
middle two digits, 93, and performing the remainder
step, we get 5
(93 % 11). Table 5 shows items under both the
remainder method and the mid-square
method. You should verify that you
understand how these values were computed.

Table 5: Comparison of Remainder and Mid-Square Methods

Item Remainder Mid-Square

54 10 3

26 4 7

93 5 9

17 6 8

77 0 4

31 9 6

We can also create hash functions for character-based items such as

strings. The word “cat” can be
thought of as a sequence of ordinal
values.

>>> ord('c')

>>> ord('a')

>>> ord('t')

116

We can then take these three ordinal values, add them up, and use the
remainder method to get a hash
value (see Figure 6).
Listing 1 shows a function called hash that takes a
string and a table size and
returns the hash value in the range from 0
to tablesize -1.

Figure 6: Hashing a String Using Ordinal Values

Listing 1

def hash(astring, tablesize):

sum = 0

for pos in range(len(astring)):

sum = sum + ord(astring[pos])

return sum%tablesize

It is interesting to note that when using this hash function, anagrams

will always be given the same hash
value. To remedy this, we could use
the position of the character as a weight. Figure 7 shows
one possible
way to use the positional value as a weighting factor. The
modification to the hash function is left as an
exercise.

Figure 7: Hashing a String Using Ordinal Values with Weighting

You may be able to think of a number of additional ways to compute hash

values for items in a collection.
The important thing to remember is
that the hash function has to be efficient so that it does not become
the dominant part of the storage and search process. If the hash
function is too complex, then it becomes
more work to compute the slot
name than it would be to simply do a basic sequential or binary search
as
 described earlier. This would quickly defeat the purpose of hashing. 

6.5.2. Collision Resolution

We now return to the problem of collisions. When two items hash to the
same slot, we must have a
pythonds This Chapter
systematic method for placing the second item
in the hashSaving andprocess
table. This Logging is are collision 
Disabled
called   ✏ 
resolution. As
we stated earlier, if the hash function is perfect, collisions will
never occur. However, since
this is often not possible, collision
resolution becomes a very important part of hashing.

One method for resolving collisions looks into the hash table and tries
to find another open slot to hold the
item that caused the collision. A
simple way to do this is to start at the original hash value position
and
then move in a sequential manner through the slots until we
encounter the first slot that is empty. Note that
we may need to go back
to the first slot (circularly) to cover the entire hash table. This
collision resolution
process is referred to as open addressing in
that it tries to find the next open slot or address in the hash
table.
By systematically visiting each slot one at a time, we are performing an
open addressing technique
called linear probing.

Figure 8 shows an extended set of integer items under the

simple remainder method hash function
(54,26,93,17,77,31,44,55,20).
Table 4 above shows the hash values for the original items.
Figure 5 shows
the original contents. When we attempt to
place 44 into slot 0, a collision occurs. Under linear probing, we
look
sequentially, slot by slot, until we find an open position. In this
case, we find slot 1.

Again, 55 should go in slot 0 but must be placed in slot 2 since it is

the next open position. The final value
of 20 hashes to slot 9. Since
slot 9 is full, we begin to do linear probing. We visit slots 10, 0, 1,
and 2, and
finally find an empty slot at position 3.

Figure 8: Collision Resolution with Linear Probing

Once we have built a hash table using open addressing and linear
probing, it is essential that we utilize the
same methods to search for
items. Assume we want to look up the item 93. When we compute the hash
value, we get 5. Looking in slot 5 reveals 93, and we can return
True . What if we are looking for 20? Now
the hash value is 9, and
slot 9 is currently holding 31. We cannot simply return False since
we know that
there could have been collisions. We are now forced to do a
sequential search, starting at position 10,
looking until either we find
the item 20 or we find an empty slot.

A disadvantage to linear probing is the tendency for clustering;

items become clustered in the table. This
means that if many collisions
occur at the same hash value, a number of surrounding slots will be
filled by
the linear probing resolution. This will have an impact on
other items that are being inserted, as we saw
when we tried to add the
item 20 above. A cluster of values hashing to 0 had to be skipped to
finally find
an open position. This cluster is shown in
Figure 9.

Figure 9: A Cluster of Items for Slot 0

One way to deal with clustering is to extend the linear probing

technique so that instead of looking
sequentially for the next open
slot, we skip slots, thereby more evenly distributing the items that
have
caused collisions. This will potentially reduce the clustering that
occurs. Figure 10 shows the items when
collision
resolution is done with a “plus 3” probe. This means that once a
collision occurs, we will look at
every third slot until we find one
that is empty.

Figure 10: Collision Resolution Using “Plus 3”

The general name for this process of looking for another slot after a
collision is rehashing. With simple
linear probing, the rehash
function is newhashvalue = rehash(oldhashvalue) where

rehash(pos) = (pos + 1)%sizeof table. The “plus 3” rehash

can be defined as

rehash(pos) = (pos + 3)%sizeof table. In

general,

rehash(pos) = (pos + skip)%sizeof table. It is

important to note that the size of the “skip” must

be such that all the

slots in the table will eventually be visited. Otherwise, part of the
table will be unused.
To ensure this, it is often suggested that the
table size be a prime number. This is the reason we have
been using 11
in our examples.

A variation of the linear probing idea is called quadratic probing.

Instead of using a constant “skip” value,
we use a rehash function that
increments the hash value by 1, 3, 5, 7, 9, and so on. This means that
if the
first hash value is h, the successive values are h + 1,
h + 4, h + 9, h + 16, and so on. In general, the
i will be i^2 rehash(pos) = (h + i2). In other words,
quadratic probing uses a skip consisting of
successive perfect squares.
Figure 11 shows our example values after they are placed using
this
technique.

Figure 11: Collision Resolution with Quadratic Probing

An alternative method for handling the collision problem is to allow

each slot to hold a reference to a
collection (or chain) of items.
Chaining allows many items to exist at the same location in the hash
table.
When collisions happen, the item is still placed in the proper
slot of the hash table. As more and more
 items hash to the same
location, the difficulty of searching for the item in the collection
increases. Figure 
12 shows the items as they are added to a hash
table that uses chaining to resolve collisions.
pythonds This Chapter Saving and Logging are Disabled    ✏ 

Figure 12: Collision Resolution with Chaining

When we want to search for an item, we use the hash function to generate
the slot where it should reside.
Since each slot holds a collection, we
use a searching technique to decide whether the item is present.
The
advantage is that on the average there are likely to be many fewer items
in each slot, so the search is
perhaps more efficient. We will look at
the analysis for hashing at the end of this section.

Self Check

Q-1: In a hash table of size 13 which index positions would the following two keys map to? 27,
130

A. 1, 10

B. 13, 0

C. 1, 0

D. 2, 3

Check Me Compare me

Activity: 6.5.2.1 Multiple Choice (HASH_1)

Q-2: Suppose you are given the following set of keys to insert into a hash table that holds
exactly 11 values: 113 , 117 , 97 , 100 , 114 , 108 , 116 , 105 , 99 Which of the following best
demonstrates the contents of the hash table after all the keys have been inserted using linear
probing?

A. 100, __, __, 113, 114, 105, 116, 117, 97, 108, 99

B. 99, 100, __, 113, 114, __, 116, 117, 105, 97, 108

108

C. 100, 113, 117, 97, 14, 108, 116, 105, 99, __, __

D. 117, 114, 108, 116, 105, 99, __, __, 97, 100, 113

113

Check Me Compare me

Activity: 6.5.2.2 Multiple Choice (HASH_2)

6.5.3. Implementing the Map Abstract Data Type

One of the most useful Python collections is the dictionary. Recall that
a dictionary is an associative data
type where you can store key–data
pairs. The key is used to look up the associated data value. We often
refer to this idea as a map.

The map abstract data type is defined as follows. The structure is an

unordered collection of associations
between a key and a data value. The
keys in a map are all unique so that there is a one-to-one
relationship
between a key and a value. The operations are given below.

Map() Create a new, empty map. It returns an empty map

collection.
put(key,val) Add a new key-value pair to the map. If the key is
already in the map then replace
the old value with the new value.
get(key) Given a key, return the value stored in the map or
None otherwise.
del Delete the key-value pair from the map using a statement of
the form del map[key] .
len() Return the number of key-value pairs stored in the map.
in Return True for a statement of the form key in map , if
the given key is in the map, False
otherwise.

One of the great benefits of a dictionary is the fact that given a key,
we can look up the associated data
value very quickly. In order to
provide this fast look up capability, we need an implementation that
supports
an efficient search. We could use a list with sequential or
binary search but it would be even better to use
a hash table as
described above since looking up an item in a hash table can approach
O(1)
performance.

In Listing 2 we use two lists to create a

HashTable class that implements the Map abstract data type.
One
list, called slots , will hold the key items and a parallel list,
called data , will hold the data values.
When we look up a key, the
corresponding position in the data list will hold the associated data
value. We
will treat the key list as a hash table using the ideas
presented earlier. Note that the initial size for the hash
table has
been chosen to be 11. Although this is arbitrary, it is important that
the size be a prime number
 
so that the collision resolution algorithm
can be as efficient as possible.

Listing 2
class HashTable:

pythonds This Chapter Saving and Logging are Disabled    ✏ 

def __init__(self):

self.size = 11

self.slots = [None] * self.size

self.data = [None] * self.size

hashfunction implements the simple remainder method. The collision

resolution technique is linear
probing with a “plus 1” rehash function.
The put function (see Listing 3) assumes that
there will
eventually be an empty slot unless the key is already present
in the self.slots . It computes the original
hash value and if that
slot is not empty, iterates the rehash function until an empty slot
occurs. If a
nonempty slot already contains the key, the old data value
is replaced with the new data value. Dealing
with the situation where there are
no empty slots left is an exercise.

Listing 3

def put(self,key,data):

hashvalue = self.hashfunction(key,len(self.slots))

if self.slots[hashvalue] == None:

self.slots[hashvalue] = key

self.data[hashvalue] = data

else:

if self.slots[hashvalue] == key:

self.data[hashvalue] = data #replace

else:

nextslot = self.rehash(hashvalue,len(self.slots))

while self.slots[nextslot] != None and \

self.slots[nextslot] != key:

nextslot = self.rehash(nextslot,len(self.slots))

if self.slots[nextslot] == None:

self.slots[nextslot]=key

self.data[nextslot]=data

else:

self.data[nextslot] = data #replace

def hashfunction(self,key,size):

return key%size

def rehash(self,oldhash,size):

return (oldhash+1)%size

Likewise, the get function (see Listing 4)

begins by computing the initial hash value. If the value is not in
the
initial slot, rehash is used to locate the next possible position.
Notice that line 15 guarantees that the
search will terminate by
checking to make sure that we have not returned to the initial slot. If
that happens,
we have exhausted all possible slots and the item must not
be present.

The final methods of the HashTable class provide additional

dictionary functionality. We overload the
__getitem__ and
__setitem__ methods to allow access using``[]``. This means that
once a HashTable
has been created, the familiar index operator will
be available. We leave the remaining methods as
exercises.

Listing 4

1 def get(self,key):

2 startslot = self.hashfunction(key,len(self.slots))

4 data = None

5 stop = False

6 found = False

7 position = startslot

8 while self.slots[position] != None and \

9 not found and not stop:

10 if self.slots[position] == key:

11 found = True

12 data = self.data[position]

13 else:

14 position=self.rehash(position,len(self.slots))

15 if position == startslot:

16 stop = True

17 return data

19 def __getitem__(self,key):

20 return self.get(key)

22 def __setitem__(self,key,data):

23 self.put(key,data)

The following session shows the HashTable class in action. First we

will create a hash table and store
some items with integer keys and
string data values.

>>> H=HashTable()

>>> H[54]="cat"

>>> H[26]="dog"

>>> H[93]="lion"

>>> H[17]="tiger"

>>> H[77]="bird"

>>> H[31]="cow"

>>> H[44]="goat"

>>> H[55]="pig"

>>> H[20]="chicken"

>>> H.slots

 [77, 44, 55, 20, 26, 93, 17, None, None, 31, 54]

>>> H.data

['bird', 'goat', 'pig', 'chicken', 'dog', 'lion',

'tiger', None, None, 'cow', 'cat']

Next we will access and modify some items in the hash table. Note that
the value for the key 20 is being
pythonds This Chapter
replaced. Saving and Logging are Disabled    ✏ 

>>> H[20]

'chicken'

>>> H[17]

'tiger'

>>> H[20]='duck'

>>> H[20]

'duck'

>>> H.data

['bird', 'goat', 'pig', 'duck', 'dog', 'lion',

'tiger', None, None, 'cow', 'cat']

>> print(H[99])

None

The complete hash table example can be found in ActiveCode 1.

Save & Run Show Code Show CodeLens

Activity: 6.5.3.1 Complete Hash Table Example (hashtablecomplete)

6.5.4. Analysis of Hashing

We stated earlier that in the best case hashing would provide a
O(1), constant time search technique.
However, due to
collisions, the number of comparisons is typically not so simple. Even
though a complete
analysis of hashing is beyond the scope of this text,
we can state some well-known results that
approximate the number of
comparisons necessary to search for an item.

The most important piece of information we need to analyze the use of a

hash table is the load factor, λ.
Conceptually, if
λ is small, then there is a lower chance of collisions,
meaning that items are more likely to
be in the slots where they belong.
If λ is large, meaning that the table is filling up,
then there are more and
more collisions. This means that collision
resolution is more difficult, requiring more comparisons to find an
empty slot. With chaining, increased collisions means an increased
number of items on each chain.

As before, we will have a result for both a successful and an

unsuccessful search. For a successful search
using open addressing with
linear probing, the average number of comparisons is approximately

2
1 1 1 1

2
(1 +
1−λ
) and an
unsuccessful search gives
2
(1 + (
1−λ
) )
If we are using chaining, the
λ
average number of comparisons is
1 + 2
for the successful case, and simply
λ comparisons if the
search is unsuccessful.

You have attempted 1 of 4 activities on this page

user not logged in

 

BCS304-DSA Notes M-5
100% (1)
BCS304-DSA Notes M-5
22 pages
Fanuc I/O Unit-Model A: Connection and Maintenance Manual
100% (2)
Fanuc I/O Unit-Model A: Connection and Maintenance Manual
244 pages
Ruhrpumpen Vertical Turbine Pumps - Operation and Maintenance Manuals
100% (1)
Ruhrpumpen Vertical Turbine Pumps - Operation and Maintenance Manuals
44 pages
Hashing Interactivepy
No ratings yet
Hashing Interactivepy
11 pages
Hashing
No ratings yet
Hashing
23 pages
Algo Cha 8
No ratings yet
Algo Cha 8
20 pages
HASHING
No ratings yet
HASHING
8 pages
Week13 1
No ratings yet
Week13 1
16 pages
Module 6 DSA 24
No ratings yet
Module 6 DSA 24
64 pages
Lab 3
No ratings yet
Lab 3
5 pages
Hashing
No ratings yet
Hashing
7 pages
UNIT V - Hashing
No ratings yet
UNIT V - Hashing
20 pages
CH 4 Hash Table
No ratings yet
CH 4 Hash Table
20 pages
Hashing
No ratings yet
Hashing
56 pages
Hash
No ratings yet
Hash
7 pages
Unit-9-Hashing-BIM
No ratings yet
Unit-9-Hashing-BIM
5 pages
DS Module 5 Hashing
No ratings yet
DS Module 5 Hashing
23 pages
DS Module-X
No ratings yet
DS Module-X
74 pages
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
No ratings yet
Module 5: HASHING: Functions. The Values Are Then Stored in A Data Structure Called Hash Table
39 pages
Hashing and Indexing
No ratings yet
Hashing and Indexing
28 pages
Hashing PDF
No ratings yet
Hashing PDF
56 pages
Unit-5 2
No ratings yet
Unit-5 2
9 pages
DSA_M5
No ratings yet
DSA_M5
38 pages
Hashing
No ratings yet
Hashing
30 pages
Hashing Data Structure
No ratings yet
Hashing Data Structure
22 pages
Lect Hashing
No ratings yet
Lect Hashing
36 pages
hashtables
No ratings yet
hashtables
21 pages
Unit 5 Session 5 Hashing
No ratings yet
Unit 5 Session 5 Hashing
20 pages
MODULE-5
No ratings yet
MODULE-5
33 pages
Hash Tables: A Detailed Description
No ratings yet
Hash Tables: A Detailed Description
10 pages
Hashing
No ratings yet
Hashing
8 pages
ADI Hashing
No ratings yet
ADI Hashing
47 pages
Unit 3.4 Hashing Techniques
No ratings yet
Unit 3.4 Hashing Techniques
7 pages
05 Hashing
No ratings yet
05 Hashing
47 pages
Hashing in Data Structures
No ratings yet
Hashing in Data Structures
8 pages
HAshing (Satish sir)
No ratings yet
HAshing (Satish sir)
52 pages
Hashing and Skiplist_removed
No ratings yet
Hashing and Skiplist_removed
113 pages
Hash Function
No ratings yet
Hash Function
4 pages
ADS Unit-2
No ratings yet
ADS Unit-2
53 pages
Hashing
No ratings yet
Hashing
44 pages
Hash Tables in DS
No ratings yet
Hash Tables in DS
14 pages
Unit-5
No ratings yet
Unit-5
50 pages
Unit28 Hashing1
No ratings yet
Unit28 Hashing1
19 pages
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
No ratings yet
Introduction To Hashing & Hashing Techniques: Review of Searching Techniques
19 pages
Hashing Algorithms
No ratings yet
Hashing Algorithms
22 pages
Unit 5 Data Structure
No ratings yet
Unit 5 Data Structure
12 pages
Dsa Module 6 Ktuassist
No ratings yet
Dsa Module 6 Ktuassist
9 pages
Hashing
No ratings yet
Hashing
34 pages
Module 5 Hashing
No ratings yet
Module 5 Hashing
66 pages
Hash Functions
No ratings yet
Hash Functions
9 pages
UNIT 1- Hashing
No ratings yet
UNIT 1- Hashing
118 pages
Hashing
No ratings yet
Hashing
20 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
HASHING
No ratings yet
HASHING
63 pages
DSA Lab 11 Hashing
No ratings yet
DSA Lab 11 Hashing
9 pages
Dsa 4
No ratings yet
Dsa 4
55 pages
Hashing Techniques
No ratings yet
Hashing Techniques
15 pages
Dsa Module 6 Ktustudents - in
No ratings yet
Dsa Module 6 Ktustudents - in
9 pages
22CS302_LM21
No ratings yet
22CS302_LM21
7 pages
DSAU1HASH
No ratings yet
DSAU1HASH
21 pages
Lecture 8 Hashing
No ratings yet
Lecture 8 Hashing
47 pages
On Innovation Models: Ziya G. Boyacigiller
No ratings yet
On Innovation Models: Ziya G. Boyacigiller
24 pages
111-1 CS3B BlockDiagram
No ratings yet
111-1 CS3B BlockDiagram
11 pages
Imdp12013-61e 020
No ratings yet
Imdp12013-61e 020
149 pages
College Faculty List of Seminars/Training Attended First Semester S.Y. 2021-2022
No ratings yet
College Faculty List of Seminars/Training Attended First Semester S.Y. 2021-2022
2 pages
NA DS-3811-0819-NA-Cloud-Vol-ONTAP - DS
No ratings yet
NA DS-3811-0819-NA-Cloud-Vol-ONTAP - DS
4 pages
OWASP Webinar Script
No ratings yet
OWASP Webinar Script
4 pages
ZATCA E-Invoicing Customized Solution in SAP ECC - S4HANA
100% (1)
ZATCA E-Invoicing Customized Solution in SAP ECC - S4HANA
7 pages
JKO Mobile Quickstart PDF
No ratings yet
JKO Mobile Quickstart PDF
1 page
Penerapan Prinsip Rule of Reason Pada Putusan Perkara Nomor 08/KPPU-I/2020 Tentang Dugaan Praktik Diskriminasi Antara Telkom-Telkomsel Dan Netflix
No ratings yet
Penerapan Prinsip Rule of Reason Pada Putusan Perkara Nomor 08/KPPU-I/2020 Tentang Dugaan Praktik Diskriminasi Antara Telkom-Telkomsel Dan Netflix
14 pages
The Role of Artificial Intelligence in Modern Healthcare
No ratings yet
The Role of Artificial Intelligence in Modern Healthcare
2 pages
LCS 6th Lab Manual
No ratings yet
LCS 6th Lab Manual
12 pages
ISO27k ISMS 4.4 Implementation and Certification Process 2022
No ratings yet
ISO27k ISMS 4.4 Implementation and Certification Process 2022
1 page
DJI OM5 Mobile - Phone - Compatibility - List - EN
No ratings yet
DJI OM5 Mobile - Phone - Compatibility - List - EN
1 page
2023 V14i6041
No ratings yet
2023 V14i6041
8 pages
Use of Coconut Shell As An Aggregate in Concrete: A Review
No ratings yet
Use of Coconut Shell As An Aggregate in Concrete: A Review
2 pages
写作作品集
100% (1)
写作作品集
6 pages
RF 02 LRN Reactivation
No ratings yet
RF 02 LRN Reactivation
26 pages
Propeller Slip
100% (1)
Propeller Slip
6 pages
Smart 700 LT Cabinet: Model: A70/1NE Cod: A12070100101
No ratings yet
Smart 700 LT Cabinet: Model: A70/1NE Cod: A12070100101
2 pages
BALICK Final Minus Index 9aug Chapters 1 and 2
No ratings yet
BALICK Final Minus Index 9aug Chapters 1 and 2
84 pages
Smart Safety Lock
No ratings yet
Smart Safety Lock
37 pages
Introduction To Logistics Management
No ratings yet
Introduction To Logistics Management
29 pages
Technical Manual for Air-Cooled Modular Chiller - (FCH01-2022,23) (2)
No ratings yet
Technical Manual for Air-Cooled Modular Chiller - (FCH01-2022,23) (2)
39 pages
UCJV300 Multilayer Printing Guide
No ratings yet
UCJV300 Multilayer Printing Guide
30 pages
Meeting Vii
No ratings yet
Meeting Vii
10 pages
CH 8
No ratings yet
CH 8
52 pages
1.4 Raised Floor
No ratings yet
1.4 Raised Floor
2 pages
4 DP VX-39 Dogis Pipe. Drift ID. With Int. Coated Pipe
100% (2)
4 DP VX-39 Dogis Pipe. Drift ID. With Int. Coated Pipe
3 pages

Hashing

Uploaded by

Hashing

Uploaded by

pythonds This Chapter Saving and Logging are Disabled    ✏ 

A hash table is a collection of items which are stored in such a way

Figure 4: Hash Table with 11 Empty Slots

Table 4: Simple Hash Function Using Remainders

Item Hash Value

Figure 5: Hash Table with Six Items

6.5.1. Hash Functions

The folding method for constructing hash functions begins by

Another numerical technique for constructing a hash function is called

Table 5: Comparison of Remainder and Mid-Square Methods

Item Remainder Mid-Square

We can also create hash functions for character-based items such as

Figure 6: Hashing a String Using Ordinal Values

def hash(astring, tablesize):

for pos in range(len(astring)):

sum = sum + ord(astring[pos])

It is interesting to note that when using this hash function, anagrams

Figure 7: Hashing a String Using Ordinal Values with Weighting

You may be able to think of a number of additional ways to compute hash

6.5.2. Collision Resolution

Figure 8 shows an extended set of integer items under the

Again, 55 should go in slot 0 but must be placed in slot 2 since it is

Figure 8: Collision Resolution with Linear Probing

A disadvantage to linear probing is the tendency for clustering;

Figure 9: A Cluster of Items for Slot 0

One way to deal with clustering is to extend the linear probing

Figure 10: Collision Resolution Using “Plus 3”

rehash(pos) = (pos + 1)%sizeof table. The “plus 3” rehash

rehash(pos) = (pos + 3)%sizeof table. In

rehash(pos) = (pos + skip)%sizeof table. It is

be such that all the

A variation of the linear probing idea is called quadratic probing.

Figure 11: Collision Resolution with Quadratic Probing

An alternative method for handling the collision problem is to allow

Figure 12: Collision Resolution with Chaining

Activity: 6.5.2.1 Multiple Choice (HASH_1)

Activity: 6.5.2.2 Multiple Choice (HASH_2)

6.5.3. Implementing the Map Abstract Data Type

The map abstract data type is defined as follows. The structure is an

Map() Create a new, empty map. It returns an empty map

In Listing 2 we use two lists to create a

pythonds This Chapter Saving and Logging are Disabled    ✏ 

self.slots = [None] * self.size

self.data = [None] * self.size

hashfunction implements the simple remainder method. The collision

self.data[hashvalue] = data #replace

while self.slots[nextslot] != None and \

self.data[nextslot] = data #replace

Likewise, the get function (see Listing 4)

The final methods of the HashTable class provide additional

8 while self.slots[position] != None and \

9 not found and not stop:

The following session shows the HashTable class in action. First we

['bird', 'goat', 'pig', 'chicken', 'dog', 'lion',

'tiger', None, None, 'cow', 'cat']

['bird', 'goat', 'pig', 'duck', 'dog', 'lion',

'tiger', None, None, 'cow', 'cat']

The complete hash table example can be found in ActiveCode 1.

Save & Run Show Code Show CodeLens

Activity: 6.5.3.1 Complete Hash Table Example (hashtablecomplete)

6.5.4. Analysis of Hashing

The most important piece of information we need to analyze the use of a

As before, we will have a result for both a successful and an

You have attempted 1 of 4 activities on this page

user not logged in

You might also like