0% found this document useful (0 votes)

517 views

Find - S Algorithm

The Find-S algorithm is a concept learning algorithm that finds the most specific hypothesis that fits all the positive training examples by starting with the most specific hypothesis and generalizing it for each positive example where attributes do not match. It initializes the hypothesis to the most specific representation and replaces attribute values with '?' for positive examples where the attribute value does not match the hypothesis to generalize it, ignoring negative examples. The final hypothesis after processing all examples fits all positive examples in the most general way.

Uploaded by

Nagamanju SureshKumar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

517 views

Find - S Algorithm

Uploaded by

Nagamanju SureshKumar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Find S Algorithm

The find-S algorithm is a basic concept learning

algorithm in machine learning. The find-S algorithm
finds the most specific hypothesis that fits all the positive
examples. We have to note here that the algorithm
considers only those positive training example.

Introduction :
The find-S algorithm starts with the most specific
hypothesis and generalizes this hypothesis each time it
fails to classify an observed positive training data.
Hence, the Find-S algorithm moves from the most
specific hypothesis to the most general hypothesis.
Important Representation :

1. ? indicates that any value is acceptable for the

attribute.
2. specify a single required value ( e.g., Cold ) for
the attribute.
3. ϕindicates that no value is acceptable.
4. The most general hypothesis is represented
by: {?, ?, ?, ?, ?, ?}
5. The most specific hypothesis is represented
by: {ϕ, ϕ, ϕ, ϕ, ϕ, ϕ}
Steps Involved In Find-S :

1. Start with the most specific hypothesis.

h = {ϕ, ϕ, ϕ, ϕ, ϕ, ϕ}
2. Take the next example and if it is negative, then
no changes occur to the hypothesis.
3. If the example is positive and we find that our
initial hypothesis is too specific then we update
our current hypothesis to a general condition.
4. Keep repeating the above steps till all the training
examples are complete.
5. After we have completed all the training examples
we will have the final hypothesis when can use to
classify the new examples.
Example :
Consider the following data set having the data about
which particular seeds are poisonous.

First, we consider the hypothesis to be a more specific

hypothesis. Hence, our hypothesis would be :
h = {ϕ, ϕ, ϕ, ϕ, ϕ, ϕ}

Consider example 1 :
The data in example 1 is { GREEN, HARD, NO,
WRINKLED }. We see that our initial hypothesis is
more specific and we have to generalize it for this
example. Hence, the hypothesis becomes :
h = { GREEN, HARD, NO, WRINKLED }
Consider example 2 :
Here we see that this example has a negative outcome.
Hence we neglect this example and our hypothesis
remains the same.
h = { GREEN, HARD, NO, WRINKLED }
Consider example 3 :
Here we see that this example has a negative outcome.
Hence we neglect this example and our hypothesis
remains the same.
h = { GREEN, HARD, NO, WRINKLED }
Consider example 4 :
The data present in example 4 is { ORANGE, HARD,
NO, WRINKLED }. We compare every single attribute
with the initial data and if any mismatch is found we
replace that particular attribute with a general case ( ” ?
” ). After doing the process the hypothesis becomes :
h = { ?, HARD, NO, WRINKLED }
Consider example 5 :
The data present in example 5 is { GREEN, SOFT, YES,
SMOOTH }. We compare every single attribute with the
initial data and if any mismatch is found we replace that
particular attribute with a general case ( ” ? ” ). After
doing the process the hypothesis becomes :
h = { ?, ?, ?, ? }
Since we have reached a point where all the attributes in
our hypothesis have the general condition, example 6 and
example 7 would result in the same hypothesizes with all
general attributes.
h = { ?, ?, ?, ? }
Hence, for the given data the final hypothesis would be
:
Final Hyposthesis: h = { ?, ?, ?, ? }

Algorithm :

1. Initialize h to the most specific hypothesis in H

2. For each positive training instance x

For each attribute constraint a, in h

If the constraint a, is satisfied by x

Then do nothing

Else replace a, in h by the next more general

constraint that is satisfied by x

3. Output hypothesis h

How To Implement Find-S Algorithm In Machine

Learning?

In Machine Learning, concept learning can be termed as

“a problem of searching through a predefined space of
potential hypothesis for the hypothesis that best fits the
training examples” – Tom Mitchell. In this article, we
will go through one such concept learning algorithm
known as the Find-S algorithm. The following topics are
discussed in this article.
• What is Find-S Algorithm in Machine Learning?
• How Does it Work?
• Limitations of Find-S Algorithm
• Implementation of Find-S Algorithm
• Use Case

What is Find-S Algorithm in Machine Learning?

In order to understand Find-S algorithm, you need to

have a basic idea of the following concepts as well:

1. Concept Learning
2. General Hypothesis
3. Specific Hypothesis

1. Concept Learning

Let’s try to understand concept learning with a real-life

example. Most of human learning is based on past
instances or experiences. For example, we are able to
identify any type of vehicle based on a certain set of
features like make, model, etc., that are defined over a
large set of features.

These special features differentiate the set of cars, trucks,

etc from the larger set of vehicles. These features that
define the set of cars, trucks, etc are known as concepts.

Similar to this, machines can also learn from concepts to

identify whether an object belongs to a specific category
or not. Any algorithm that supports concept learning
requires the following:

• Training Data
• Target Concept
• Actual Data Objects

2. General Hypothesis

Hypothesis, in general, is an explanation for something.

The general hypothesis basically states the general
relationship between the major variables. For example, a
general hypothesis for ordering food would be I want a
burger.
G = { ‘?’, ‘?’, ‘?’, …..’?’}

3. Specific Hypothesis

The specific hypothesis fills in all the important details

about the variables given in the general hypothesis. The
more specific details into the example given above would
be I want a cheeseburger with a chicken pepperoni
filling with a lot of lettuce.

S = {‘Φ’,’Φ’,’Φ’, ……,’Φ’}

Python Machine Learning Certification Training

• Instructor-led Live Sessions

• Real-life Case Studies
• Assignments
• Lifetime Access

Explore Curriculum
Now ,let’s talk about the Find-S Algorithm in Machine
Learning.

The Find-S algorithm follows the steps written below:

1. Initialize ‘h’ to the most specific hypothesis.

2. The Find-S algorithm only considers the positive
examples and eliminates negative examples. For
each positive example, the algorithm checks for
each attribute in the example. If the attribute value
is the same as the hypothesis value, the algorithm
moves on without any changes. But if the attribute
value is different than the hypothesis value, the
algorithm changes it to ‘?’.

Now that we are done with the basic explanation of the

Find-S algorithm, let us take a look at how it works.

How Does It Work?

1. The process starts with initializing ‘h’ with the most
specific hypothesis, generally, it is the first positive
example in the data set.
2. We check for each positive example. If the example
is negative, we will move on to the next example but
if it is a positive example we will consider it for the
next step.
3. We will check if each attribute in the example is
equal to the hypothesis value.
4. If the value matches, then no changes are made.
5. If the value does not match, the value is changed to
‘?’.
6. We do this until we reach the last positive example
in the data set.

Limitations of Find-S Algorithm

There are a few limitations of the Find-S algorithm listed

down below:

1. There is no way to determine if the hypothesis is

consistent throughout the data.
2. Inconsistent training sets can actually mislead the
Find-S algorithm, since it ignores the negative
examples.
3. Find-S algorithm does not provide a backtracking
technique to determine the best possible changes
that could be done to improve the resulting
hypothesis.

Implementation of Find-S Algorithm

To understand the implementation, let us try to
implement it to a smaller data set with a bunch of
examples to decide if a person wants to go for a walk.

The concept of this particular problem will be on what

days does a person likes to go on walk.

Weath Temperat Compa Humid Go

Time Wind
er ure ny ity es
Morni Stron
Sunny Warm Yes Mild Yes
ng g
Eveni Norm
Rainy Cold No Mild No
ng al
Morni Norm
Sunny Moderate Yes Normal Yes
ng al
Eveni Stron
Sunny Cold Yes High Yes
ng g
Looking at the data set, we have six attributes and a final
attribute that defines the positive or negative example. In
this case, yes is a positive example, which means the
person will go for a walk.
So now, the general hypothesis is:

Next
h0 = {‘Morning’, ‘Sunny’, ‘Warm’, ‘Yes’, ‘Mild’,
‘Strong’}

This is our general hypothesis, and now we will consider

each example one by one, but only the positive examples.

h1= {‘Morning’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

h2 = {‘?’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

We replaced all the different values in the general

hypothesis to get a resultant hypothesis. Now that we
know how the Find-S algorithm works, let us take a look
at an implementation using Python.

Use Case
Let’s try to implement the above example using Python.
The code to implement the Find-S algorithm using the
above data is given below.

1 import pandas as pd
2 import numpy as np
3
4 #to read the data in the csv file
5 data = pd.read_csv("data.csv")
6 print(data,"n")
7
8 #making an array of all the attributes
9 d = np.array(data)[:,:-1]
10print("n The attributes are: ",d)
11
12#segragating the target that has positive and negative examples
13target = np.array(data)[:,-1]
14print("n The target is: ",target)
15
16#training function to implement find-s algorithm
17def train(c,t):
18 for i, val in enumerate(t):
19 if val == "Yes":
20 specific_hypothesis = c[i].copy()
21 break
22
23 for i, val in enumerate(c):
24 if t[i] == "Yes":
25 for x in range(len(specific_hypothesis)):
26 if val[x] != specific_hypothesis[x]:
27 specific_hypothesis[x] = '?'
28 else:
29 pass
30
31 return specific_hypothesis
32
33#obtaining the final hypothesis
34print("n The final hypothesis is:",train(d,target))
Output:
U Tube link

1. (2817) Find-S Algorithm (concept) | Machine

Learning (2018) - YouTube
2. (2817) FIND S Algorithm | Finding A Maximally
Specific Hypothesis | Solved Example - 1 by
Mahesh Huddar - YouTube
3. (2817) Machine Learning | Find-S Algorithm -
YouTube( EASY)

Unit Iii
No ratings yet
Unit Iii
20 pages
Unit-Iii: A Weather Dataset
No ratings yet
Unit-Iii: A Weather Dataset
12 pages
Assignment10 Guymason
No ratings yet
Assignment10 Guymason
1 page
Eliciting Requirements
No ratings yet
Eliciting Requirements
20 pages
IOT Mod4@AzDOCUMENTS - in
No ratings yet
IOT Mod4@AzDOCUMENTS - in
17 pages
Locating Mobile Entities in Distributed Systems
67% (3)
Locating Mobile Entities in Distributed Systems
2 pages
Algorithms Flowcharts Notes
100% (4)
Algorithms Flowcharts Notes
4 pages
Difference Between Semaphore and Monitor
100% (1)
Difference Between Semaphore and Monitor
8 pages
Lecture 6 Data Preprocessing
No ratings yet
Lecture 6 Data Preprocessing
59 pages
OS Viva Question
No ratings yet
OS Viva Question
6 pages
BCA 4th Sem Operating System Unit 1 PPT Slides
No ratings yet
BCA 4th Sem Operating System Unit 1 PPT Slides
28 pages
Unit - 1 Iot and Applications
No ratings yet
Unit - 1 Iot and Applications
14 pages
Cloud: Figure - The Symbol Used To Denote The Boundary of A Cloud Environment
No ratings yet
Cloud: Figure - The Symbol Used To Denote The Boundary of A Cloud Environment
27 pages
Cooperative Process: Prepared & Presented By: Abdul Rehman & Muddassar Ali
No ratings yet
Cooperative Process: Prepared & Presented By: Abdul Rehman & Muddassar Ali
18 pages
Chapter 4 Database Security
No ratings yet
Chapter 4 Database Security
10 pages
UNIT 2 DMW
No ratings yet
UNIT 2 DMW
26 pages
Data Mining-Unit 3-Part1
No ratings yet
Data Mining-Unit 3-Part1
41 pages
Q.1 Explain Memory Management Requirements?: - The Available Memory Is Generally Shared Among A
No ratings yet
Q.1 Explain Memory Management Requirements?: - The Available Memory Is Generally Shared Among A
10 pages
Distributed Database Design Concept
No ratings yet
Distributed Database Design Concept
5 pages
Normalization
No ratings yet
Normalization
14 pages
Unit-I OSI Security Architecture
No ratings yet
Unit-I OSI Security Architecture
14 pages
Chapter 7 Common Standard in Cloud Computing: Working Group
No ratings yet
Chapter 7 Common Standard in Cloud Computing: Working Group
6 pages
Design Patterns Lab
No ratings yet
Design Patterns Lab
29 pages
COA Chapter 6
No ratings yet
COA Chapter 6
6 pages
SIMD Computer Organizations
0% (1)
SIMD Computer Organizations
20 pages
Modeling and Detection of Camouflaging Worm
No ratings yet
Modeling and Detection of Camouflaging Worm
37 pages
Unit 2 Information Security complete notes
No ratings yet
Unit 2 Information Security complete notes
84 pages
Cellular Digital Packet Data
No ratings yet
Cellular Digital Packet Data
27 pages
Unit II Cloud Computing
100% (1)
Unit II Cloud Computing
9 pages
CO Unit 1-2
No ratings yet
CO Unit 1-2
14 pages
CN UNIT-2 Notes
No ratings yet
CN UNIT-2 Notes
70 pages
JNTUA JNTUH JNTUK - B Tech - 3 1 - Lecture Notes - MECH - Operations Research or Lecture Notes
No ratings yet
JNTUA JNTUH JNTUK - B Tech - 3 1 - Lecture Notes - MECH - Operations Research or Lecture Notes
60 pages
Horspool Algorithm
No ratings yet
Horspool Algorithm
6 pages
Unit 4 Ai
100% (2)
Unit 4 Ai
16 pages
Unit III - SPM
No ratings yet
Unit III - SPM
13 pages
R20-Atcd-Q.p - Model Paper.
100% (1)
R20-Atcd-Q.p - Model Paper.
3 pages
ML-Lab Manual - NEP - DSS
No ratings yet
ML-Lab Manual - NEP - DSS
23 pages
Differences Between TCP and UDP - GeeksforGeeks
No ratings yet
Differences Between TCP and UDP - GeeksforGeeks
8 pages
Cloud Platform Architecture Over
No ratings yet
Cloud Platform Architecture Over
71 pages
Advanced Computer Architecture: Program Flow Mechanisms
No ratings yet
Advanced Computer Architecture: Program Flow Mechanisms
14 pages
Managing State: 5.1 The Problem of State in Web Applications
No ratings yet
Managing State: 5.1 The Problem of State in Web Applications
17 pages
Railway Reservation System
0% (1)
Railway Reservation System
15 pages
Unit-3-Greedy Method PDF
No ratings yet
Unit-3-Greedy Method PDF
22 pages
Workers Behind The Scene in Dbms in Detail
No ratings yet
Workers Behind The Scene in Dbms in Detail
6 pages
File System Implementation
No ratings yet
File System Implementation
38 pages
Computer Forensics Evidence and Capture: Data Recovery
No ratings yet
Computer Forensics Evidence and Capture: Data Recovery
15 pages
Algorithms As A Technology
No ratings yet
Algorithms As A Technology
4 pages
How Does A Single Bit Error Differs From Burst Error.
No ratings yet
How Does A Single Bit Error Differs From Burst Error.
4 pages
STM Viva Que
100% (2)
STM Viva Que
54 pages
Green Cloud Computing
No ratings yet
Green Cloud Computing
23 pages
Delays in Computer Networks
No ratings yet
Delays in Computer Networks
5 pages
Algorithm For Asynchronous Check Pointing and Recovery
No ratings yet
Algorithm For Asynchronous Check Pointing and Recovery
4 pages
Module-4 Cloud Computing Architecture PDF
No ratings yet
Module-4 Cloud Computing Architecture PDF
19 pages
6 Android UI Architecture
No ratings yet
6 Android UI Architecture
24 pages
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
12 pages
Unit 3 Topic 4 Java Interfaces To HDFS
0% (1)
Unit 3 Topic 4 Java Interfaces To HDFS
15 pages
Cs3451 Ios Unit 5 Notes
No ratings yet
Cs3451 Ios Unit 5 Notes
21 pages
S Algorithm
No ratings yet
S Algorithm
19 pages
MLDM 230207120936 121018d3
No ratings yet
MLDM 230207120936 121018d3
8 pages
Ex.no.2_Find S Algorithm
No ratings yet
Ex.no.2_Find S Algorithm
3 pages
Machine Learning Notes Unit 1
No ratings yet
Machine Learning Notes Unit 1
25 pages
1. LAWS OF LEADERSHIP - LESSON 1
No ratings yet
1. LAWS OF LEADERSHIP - LESSON 1
32 pages
Congo PPT - Ss
No ratings yet
Congo PPT - Ss
10 pages
RE API Installation Guide
No ratings yet
RE API Installation Guide
4 pages
Funny Stories
No ratings yet
Funny Stories
5 pages
Homework 2
0% (1)
Homework 2
2 pages
Injection
No ratings yet
Injection
8 pages
3rd Semester Marksheet
100% (1)
3rd Semester Marksheet
2 pages
Change Control Process
No ratings yet
Change Control Process
14 pages
Shawarma Damascus - Google Search
No ratings yet
Shawarma Damascus - Google Search
1 page
Department of Education: Application For Leave
100% (1)
Department of Education: Application For Leave
1 page
Service Processes OK
No ratings yet
Service Processes OK
16 pages
10-TBT-07 Eye Injury Protection Week-7
No ratings yet
10-TBT-07 Eye Injury Protection Week-7
4 pages
09-PhysicalSecurity
No ratings yet
09-PhysicalSecurity
79 pages
Unit 4: ISO 9001:2008 Quality Management System
No ratings yet
Unit 4: ISO 9001:2008 Quality Management System
4 pages
Orbic Journey rc2200l User Guide Manual Eng v7 06092020
No ratings yet
Orbic Journey rc2200l User Guide Manual Eng v7 06092020
52 pages
Application of Linear Inequalities Linear Programming
No ratings yet
Application of Linear Inequalities Linear Programming
21 pages
Understanding Employment Standards in Tanzania
No ratings yet
Understanding Employment Standards in Tanzania
27 pages
Topic 2 - Pierce - Clinical Field Experience A - Informal Observations
No ratings yet
Topic 2 - Pierce - Clinical Field Experience A - Informal Observations
4 pages
In The University of Chakwal: Form of Application For The Use of Candidates For Appointment
No ratings yet
In The University of Chakwal: Form of Application For The Use of Candidates For Appointment
9 pages
Hernandez Vs Andal
No ratings yet
Hernandez Vs Andal
4 pages
Benchman 5000 Manual
No ratings yet
Benchman 5000 Manual
133 pages
Customer Satisfaction
No ratings yet
Customer Satisfaction
81 pages
Excel and Mathcad Tutorial On Iterative Solutions of Nonlinear Equations
No ratings yet
Excel and Mathcad Tutorial On Iterative Solutions of Nonlinear Equations
3 pages
GATE Books For Electrical Engineering
No ratings yet
GATE Books For Electrical Engineering
2 pages
Developing A GUI in C++ and DirectX
No ratings yet
Developing A GUI in C++ and DirectX
49 pages
Jsa Cathodic Protection
No ratings yet
Jsa Cathodic Protection
4 pages
MAKAUT 2020-2021 ODD Sem Theory Exam Schedule - All B.Tech BSC BCA and MTech
No ratings yet
MAKAUT 2020-2021 ODD Sem Theory Exam Schedule - All B.Tech BSC BCA and MTech
84 pages
October 09-13, 2023 (G11)
No ratings yet
October 09-13, 2023 (G11)
3 pages
It Appendices
No ratings yet
It Appendices
7 pages

Find - S Algorithm

Uploaded by

Find - S Algorithm

Uploaded by

Find S Algorithm

The find-S algorithm is a basic concept learning

1. ? indicates that any value is acceptable for the

1. Start with the most specific hypothesis.

First, we consider the hypothesis to be a more specific

1. Initialize h to the most specific hypothesis in H

2. For each positive training instance x

If the constraint a, is satisfied by x

Else replace a, in h by the next more general

How To Implement Find-S Algorithm In Machine

In Machine Learning, concept learning can be termed as

What is Find-S Algorithm in Machine Learning?

In order to understand Find-S algorithm, you need to

Let’s try to understand concept learning with a real-life

These special features differentiate the set of cars, trucks,

Similar to this, machines can also learn from concepts to

Hypothesis, in general, is an explanation for something.

The specific hypothesis fills in all the important details

Python Machine Learning Certification Training

• Instructor-led Live Sessions

The Find-S algorithm follows the steps written below:

1. Initialize ‘h’ to the most specific hypothesis.

Now that we are done with the basic explanation of the

How Does It Work?

Limitations of Find-S Algorithm

There are a few limitations of the Find-S algorithm listed

1. There is no way to determine if the hypothesis is

Implementation of Find-S Algorithm

The concept of this particular problem will be on what

Weath Temperat Compa Humid Go

This is our general hypothesis, and now we will consider

h1= {‘Morning’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

h2 = {‘?’, ‘Sunny’, ‘?’, ‘Yes’, ‘?’, ‘?’}

We replaced all the different values in the general

1. (2817) Find-S Algorithm (concept) | Machine

You might also like