0% found this document useful (0 votes)

87 views69 pages

Machine Learning: Dr. Muhammad Asadullah

This document provides an overview of machine learning concepts including: - Machine learning allows computers to learn from data and predict outcomes. - Datasets can include arrays or databases and contain numerical, categorical, or ordinal data. - Important metrics for analyzing data include the mean, median, mode, standard deviation, and percentiles. - Histograms can visualize the distribution of data in a dataset.

Uploaded by

Syed Kamran Ahmad1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views69 pages

Machine Learning: Dr. Muhammad Asadullah

Uploaded by

Syed Kamran Ahmad1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 69

MACHINE LEARNING

Dr. Muhammad Asadullah

MACHINE LEARNING
• Machine Learning is making the computer learn from studying data
and statistics.
• Machine Learning is a step into the direction of artificial intelligence
(AI).
• Machine Learning is a program that analyses data and learns to
predict the outcome.
Where To Start?
• We will go back to mathematics and study statistics, and how to
calculate important numbers based on data sets.
• We will also learn how to use various Python modules to get the
answers we need.
• And we will learn how to make functions that are able to predict the
outcome based on what we have learned.
Data Set
• In the mind of a computer, a data set is any collection of data. It can
be anything from an array to a complete database.
• Example of an array
[99,86,87,88,111,86,103,87,94,78,77,85,86]
Data Set
• Example of a database:
Carname Color Age Speed AutoPass
BMW red 5 99 Y
Volvo black 7 86 Y
VW gray 8 87 N
VW white 7 88 Y
Ford white 2 111 Y
VW white 17 86 Y
Tesla red 2 103 Y
BMW black 9 87 Y
Volvo gray 4 94 N
Ford white 11 78 N
Toyota gray 12 77 N
VW white 9 85 N
Toyota blue 6 86 Y
Data Set
• By looking at the array, we can guess that the average value is probably around
80 or 90, and we are also able to determine the highest value and the lowest
value, but what else can we do?
• And by looking at the database we can see that the most popular color is
white, and the oldest car is 17 years, but what if we could predict if a car had
an AutoPass, just by looking at the other values?
• That is what Machine Learning is for! Analyzing data and predicting the
outcome!
• In Machine Learning it is common to work with very large data sets.
• We will try to make it as easy as possible to understand the different concepts
of machine learning, and we will work with small easy-to-understand data sets.
Data Types
• To analyze data, it is important to know what type of data we are dealing with.
• We can split the data types into three main categories:
• Numerical
• Categorical
• Ordinal
• Numerical data are numbers, and can be split into two numerical categories:
• Discrete Data
- numbers that are limited to integers. Example: The number of cars passing by.
• Continuous Data
- numbers that are of infinite value. Example: The price of an item, or the size of an
item
Data Types
• Categorical data are values that cannot be measured up against each
other. Example: a color value, or any yes/no values.
• Ordinal data are like categorical data, but can be measured up against
each other. Example: school grades where A is better than B and so
on.
• By knowing the data type of your data source, you will be able to
know what technique to use when analyzing them.
• You will learn more about statistics and analyzing data in the next
chapters.
Data Types
• Categorical data are values that cannot be measured up against each
other. Example: a color value, or any yes/no values.
• Ordinal data are like categorical data, but can be measured up against
each other. Example: school grades where A is better than B and so
on.
• By knowing the data type of your data source, you will be able to
know what technique to use when analyzing them.
Mean, Median, and Mode
• What can we learn from looking at a group of numbers?
• In Machine Learning (and in mathematics) there are often three
values that interests us:
• Mean - The average value
• Median - The mid point value
• Mode - The most common value
• Example: We have registered the speed of 13 cars:
• peed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
• What is the average, the middle, or the most common speed value?
Mean
• The mean value is the average value.
• To calculate the mean, find the sum of all values, and divide the sum
by the number of values:
• (99+86+87+88+111+86+103+87+94+78+77+85+86) / 13 = 89.77
• the NumPy module has a method for this.
• Example
Mean
import numpy

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]

x = numpy.mean(speed)

print(x)
Median
• The median value is the value in the middle, after you have sorted all
the values:
77, 78, 85, 86, 86, 86, 87, 87, 88, 94, 99, 103, 111

• It is important that the numbers are sorted before you can find the
median.
• The NumPy module has a method for this:
Median
• import numpy

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]

x = numpy.median(speed)

print(x)
Median

77, 78, 85, 86, 86, 86, 87, 87, 94, 98, 99, 103

(86 + 87) / 2 = 86.5
Median
• import numpy

speed = [99,86,87,88,86,103,87,94,78,77,85,86]

x = numpy.median(speed)

print(x)
Mode
• The Mode value is the value that appears the most number of times:
99, 86, 87, 88, 111, 86, 103, 87, 94, 78, 77, 85, 86 = 86

• The SciPy module has a method for this

Mode
• from scipy import stats

speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]

x = stats.mode(speed)

print(x)
What is Standard Deviation?
• Standard deviation is a number that describes how spread out the
values are.
• A low standard deviation means that most of the numbers are close
to the mean (average) value.
• A high standard deviation means that the values are spread out over a
wider range.
• Example: This time we have registered the speed of 7 cars:
• speed = [86,87,88,86,87,85,86]
• The standard deviation is: 0.9
Standard Deviation
• Meaning that most of the values are within the range of 0.9 from the mean
value, which is 86.4.
• Let us do the same with a selection of numbers with a wider range:
• speed = [32,111,138,28,59,77,97]
• The standard deviation is: 37.85
• Meaning that most of the values are within the range of 37.85 from the
mean value, which is 77.4.
• As you can see, a higher standard deviation indicates that the values are
spread out over a wider range.
• The NumPy module has a method to calculate the standard deviation:
Standard Deviation
• import numpy

speed = [86,87,88,86,87,85,86]

x = numpy.std(speed)

print(x)
Standard Deviation
• import numpy

speed = [32,111,138,28,59,77,97]

x = numpy.std(speed)

print(x)
Variance
• Variance is another number that indicates how spread out the values
are.
• In fact, if you take the square root of the variance, you get the
standard deviation!
• Or the other way around, if you multiply the standard deviation by
itself, you get the variance!
Variance
• To calculate the variance you have to do as follows:
• 1. Find the mean:
• (32+111+138+28+59+77+97) / 7 = 77.4
• 2. For each value: find the difference from the mean:
• 32 - 77.4 = -45.4
111 - 77.4 =  33.6
138 - 77.4 =  60.6
28 - 77.4 = -49.4
59 - 77.4 = -18.4
77 - 77.4 = - 0.4
97 - 77.4 =  19.6
Variance
• 3. For each difference: find the square value:
(-45.4)2 = 2061.16
(33.6)2 = 1128.96
(60.6)2 = 3672.36
(-49.4)2 = 2440.36
(-18.4)2 =  338.56
(- 0.4)2 =    0.16
(19.6)2 =  384.16

4. The variance is the average number of these squared differences:

(2061.16+1128.96+3672.36+2440.36+338.56+0.16+384.16) / 7 = 1432.
2
Luckily, NumPy has a method to calculate the variance:
Variance
• import numpy

speed = [32,111,138,28,59,77,97]

x = numpy.var(speed)

print(x)
Standard Deviation
• As we have learned, the formula to find the standard deviation is the
square root of the variance:
√1432.25 = 37.85

• Or, as in the example from before, use the NumPy to calculate the
standard deviation:
Standard Deviation
• import numpy

speed = [32,111,138,28,59,77,97]

x = numpy.std(speed)

print(x)
Symbols
• Standard Deviation is often represented by the symbol Sigma: σ
• Variance is often represented by the symbol Sigma Square: σ2
What are Percentiles?
• Percentiles are used in statistics to give you a number that describes
the value that a given percent of the values are lower than.
• Example: Let's say we have an array of the ages of all the people that
lives in a street.
• ages = [5,31,43,48,50,41,7,11,15,39,80,82,32,2,8,6,25,36,27,61,31]
• What is the 75. percentile? The answer is 43, meaning that 75% of the
people are 43 or younger.
• The NumPy module has a method for finding the specified percentile:
Percentiles
• import numpy

ages = [5,31,43,48,50,41,7,11,15,39,80,82,32,2,8,6,25,36,27,61,31]

x = numpy.percentile(ages, 75)

print(x)
Percentiles
• import numpy

ages = [5,31,43,48,50,41,7,11,15,39,80,82,32,2,8,6,25,36,27,61,31]

x = numpy.percentile(ages, 90)

print(x)
Data Distribution
• Earlier we have worked with very small amounts of data in our
examples, just to understand the different concepts.
• In the real world, the data sets are much bigger, but it can be difficult
to gather real world data, at least at an early stage of a project.
• How Can we Get Big Data Sets?
• To create big data sets for testing, we use the Python module NumPy,
which comes with a number of methods to create random data sets,
of any size.
Create an array containing 250 random floats
between 0 and 5:
• import numpy

x = numpy.random.uniform(0.0, 5.0, 250)

print(x)
Histogram
• To visualize the data set we can draw a histogram with the data we
collected.
• We will use the Python module Matplotlib to draw a histogram.

• import numpy
import matplotlib.pyplot as plt

x = numpy.random.uniform(0.0, 5.0, 250)

plt.hist(x, 5)
plt.show()
Histogram
Histogram Explained
• We use the array from the example above to draw a histogram with 5 bars.
• The first bar represents how many values in the array are between 0 and 1.
• The second bar represents how many values are between 1 and 2.
• Etc.
• Which gives us this result:
• 52 values are between 0 and 1
• 48 values are between 1 and 2
• 49 values are between 2 and 3
• 51 values are between 3 and 4
• 50 values are between 4 and 5
Big Data Distributions
• An array containing 250 values is not considered very big, but now you know
how to create a random set of values, and by changing the parameters, you can
create the data set as big as you want.
• Create an array with 100000 random numbers, and display them using a
histogram with 100 bars:
• import numpy
import matplotlib.pyplot as plt

x = numpy.random.uniform(0.0, 5.0, 100000)

plt.hist(x, 100)
plt.show()
Normal Data Distribution
• In the previous chapter we learned how to create a completely
random array, of a given size, and between two given values.
• In this chapter we will learn how to create an array where the values
are concentrated around a given value.
• In probability theory this kind of data distribution is known as
the normal data distribution, or the Gaussian data distribution, after
the mathematician Carl Friedrich Gauss who came up with the
formula of this data distribution.
A typical normal data distribution:
• import numpy
import matplotlib.pyplot as plt

x = numpy.random.normal(5.0, 1.0, 100000)

plt.hist(x, 100)
plt.show()
Histogram Explained
We use the array from the numpy.random.normal() method,
with 100000 values, to draw a histogram with 100 bars.

We specify that the mean value is 5.0, and the standard

deviation is 1.0.

Meaning that the values should be concentrated around 5.0,

and rarely further away than 1.0 from the mean.

And as you can see from the histogram, most values are
between 4.0 and 6.0, with a top at approximately 5.0.
Scatter Plot
• A scatter plot is a diagram where each value in the data set is
represented by a dot.
Scatter Plot
• The Matplotlib module has a method for drawing scatter plots, it
needs two arrays of the same length, one for the values of the x-axis,
and one for the values of the y-axis:
x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

The x array represents the age of each car.

The y array represents the speed of each car.
Scatter Plot
• import matplotlib.pyplot as plt

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

plt.scatter(x, y)
plt.show()
Scatter Plot Explained
• The x-axis represents ages, and the y-axis represents speeds.
• What we can read from the diagram is that the two fastest cars were
both 2 years old, and the slowest car was 12 years old.
• Note: It seems that the newer the car, the faster it drives, but that
could be a coincidence, after all we only registered 13 cars.
Random Data Distributions
• In Machine Learning the data sets can contain thousands-, or even millions, of
values.
• You might not have real world data when you are testing an algorithm, you might
have to use randomly generated values.
• As we have learned in the previous chapter, the NumPy module can help us with
that!
• Let us create two arrays that are both filled with 1000 random numbers from a
normal data distribution.
• The first array will have the mean set to 5.0 with a standard deviation of 1.0.
• The second array will have the mean set to 10.0 with a standard deviation of 2.0:
A scatter plot with 1000 dots:
• import numpy
import matplotlib.pyplot as plt

x = numpy.random.normal(5.0, 1.0, 1000)
y = numpy.random.normal(10.0, 2.0, 1000)

plt.scatter(x, y)
plt.show()
Scatter Plot Explained

• We can see that the dots are concentrated around the value 5 on the
x-axis, and 10 on the y-axis.
• We can also see that the spread is wider on the y-axis than on the x-
axis.
Regression
• The term regression is used when you try to find the relationship
between variables.
• In Machine Learning, and in statistical modeling, that relationship is
used to predict the outcome of future events.
Linear Regression
• Linear regression uses the relationship between the data-points to
draw a straight line through all them.
• This line can be used to predict future values.
• In Machine Learning, predicting the future is very important.
How Does it Work?
• Python has methods for finding a relationship between data-points
and to draw a line of linear regression.
• We will show you how to use these methods instead of going through
the mathematic formula.
• In the example below, the x-axis represents age, and the y-axis
represents speed.
• We have registered the age and speed of 13 cars as they were passing
a tollbooth.
• Let us see if the data we collected could be used in a linear regression:
Linear Regression
• import matplotlib.pyplot as plt

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

plt.scatter(x, y)
plt.show()
Import scipy and draw the line of Linear Regression:

• import matplotlib.pyplot as plt
from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept

mymodel = list(map(myfunc, x))

plt.scatter(x, y)
plt.plot(x, mymodel)
plt.show()
Example Explained
• Import the modules you need.

• You can learn about the Matplotlib module in our Matplotlib Tutorial.

• You can learn about the SciPy module in our SciPy Tutorial.
• import matplotlib.pyplot as plt
• from scipy import stats
• Create the x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
• y = [99,86,87,88,111,86,103,87,94,78,77,85,86]
arrays that represent the values of the x and y axis:
• Execute a method that returns some important key values of Linear
Regression:
slope, intercept, r, p, std_err = stats.linregress(x, y)
Create a function that uses the slope and intercept values to return a
new value.
This new value represents where on the y-axis the corresponding x
value will be placed:
def myfunc(x):
return slope * x + intercept
• Run each value of the x array through the function. This will result in a
new array with new values for the y-axis:
mymodel = list(map(myfunc, x))
• Draw the original scatter plot:
• Draw the line of linear regression:
plt.plot(x, mymodel)
• Display the diagram:
plt.show()
R for Relationship
• It is important to know how the relationship between the values of the x-
axis and the values of the y-axis is, if there are no relationship the linear
regression can not be used to predict anything.

• This relationship - the coefficient of correlation - is called r.

• The r value ranges from -1 to 1, where 0 means no relationship, and 1

(and -1) means 100% related.

• Python and the Scipy module will compute this value for you, all you have
to do is feed it with the x and y values.
How well does my data fit in a linear
regression?
• from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

slope, intercept, r, p, std_err = stats.linregress(x, y)

print(r)
• Note: The result -0.76 shows that there is a relationship, not perfect,
but it indicates that we could use linear regression in future predictions
Predict Future Values
• Now we can use the information we have gathered to predict
future values.
• Example: Let us try to predict the speed of a 10 years old car.

• To do so, we need the same myfunc() function from the

example
• def myfunc(x):
return slope * x + intercept
Predict the speed of a 10 years old car
• from scipy import stats

x = [5,7,8,7,2,17,2,9,4,11,12,9,6]
y = [99,86,87,88,111,86,103,87,94,78,77,85,86]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept

speed = myfunc(10)

print(speed)
Bad Fit?
• These values for the x- and y-axis should result in a very bad fit for linear regression
• import matplotlib.pyplot as plt
from scipy import stats

x = [89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y = [21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

slope, intercept, r, p, std_err = stats.linregress(x, y)

def myfunc(x):
return slope * x + intercept

mymodel = list(map(myfunc, x))

plt.scatter(x, y)
plt.plot(x, mymodel)
plt.show()
You should get a very low r value

• import numpy
from scipy import stats

x = [89,43,36,36,95,10,66,34,38,20,26,29,48,64,6,5,36,66,72,40]
y = [21,46,3,35,67,95,53,72,58,10,26,34,90,33,38,20,56,2,47,15]

slope, intercept, r, p, std_err = stats.linregress(x, y)

print(r)
• The result: 0.013 indicates a very bad relationship, and tells us that this
data set is not suitable for linear regression.
Polynomial Regression
• If your data points clearly will not fit a linear regression (a straight line
through all data points), it might be ideal for polynomial regression.
• Polynomial regression, like linear regression, uses the relationship
between the variables x and y to find the best way to draw a line
through the data points.
How Does it Work?
• Python has methods for finding a relationship between data-points
and to draw a line of polynomial regression. We will show you how to
use these methods instead of going through the mathematic formula.
• In the example below, we have registered 18 cars as they were
passing a certain tollbooth.
• We have registered the car's speed, and the time of day (hour) the
passing occurred.
• The x-axis represents the hours of the day and the y-axis represents
the speed:
Example
Start by drawing a scatter plot:
import matplotlib.pyplot as plt

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

plt.scatter(x, y)
plt.show()
Import numpy and matplotlib then draw the line of Polynomial Regression:

• import numpy
import matplotlib.pyplot as plt

x = [1,2,3,5,6,7,8,9,10,12,13,14,15,16,18,19,21,22]
y = [100,90,80,60,60,55,60,65,70,70,75,76,78,79,90,99,99,100]

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

myline = numpy.linspace(1, 22, 100)

plt.scatter(x, y)
plt.plot(myline, mymodel(myline))
plt.show()

Painless Statistics
From Everand
Painless Statistics
Barron's Educational Series
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Sidewalks, Islands,&Medians Design Manuals (MOMRA) - (English)
100% (3)
Sidewalks, Islands,&Medians Design Manuals (MOMRA) - (English)
111 pages
Hiradc Potong Rumput
100% (2)
Hiradc Potong Rumput
2 pages
Machine Learning: Data Set
100% (1)
Machine Learning: Data Set
52 pages
Machine Learning: Where To Start?
No ratings yet
Machine Learning: Where To Start?
71 pages
6.Lab Activity
No ratings yet
6.Lab Activity
23 pages
Machine Learning
No ratings yet
Machine Learning
80 pages
4-Demonstrate the Descriptive Statistics for a sample data like mean, median, variance and correlation etc.,-16-12-2024
No ratings yet
4-Demonstrate the Descriptive Statistics for a sample data like mean, median, variance and correlation etc.,-16-12-2024
10 pages
Build ETL Using Python
No ratings yet
Build ETL Using Python
7 pages
Modul 7 Praktikum Machine Learning Python
No ratings yet
Modul 7 Praktikum Machine Learning Python
32 pages
Python Tutorial - W3school2 PDF
No ratings yet
Python Tutorial - W3school2 PDF
131 pages
Machine Learning: Where To Start?
No ratings yet
Machine Learning: Where To Start?
4 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
19 pages
ML Lab Final R22
No ratings yet
ML Lab Final R22
67 pages
unit1
No ratings yet
unit1
78 pages
Rahul ML file'[1] 2
No ratings yet
Rahul ML file'[1] 2
30 pages
Shubh Am
No ratings yet
Shubh Am
70 pages
ML Course Slides
No ratings yet
ML Course Slides
345 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
65 pages
ML Course Slides
No ratings yet
ML Course Slides
356 pages
Lab Plan 5: Statistics and Probability: Describing A Single Set of Data
No ratings yet
Lab Plan 5: Statistics and Probability: Describing A Single Set of Data
19 pages
MLCourseSlides
No ratings yet
MLCourseSlides
427 pages
UNIT-1 (Preparing To Model)
No ratings yet
UNIT-1 (Preparing To Model)
82 pages
Unit2PreparingtoModelpptx 2023 09 02 14 52 40
No ratings yet
Unit2PreparingtoModelpptx 2023 09 02 14 52 40
43 pages
NAC.pdf (1)
No ratings yet
NAC.pdf (1)
23 pages
MLCourse Slides
No ratings yet
MLCourse Slides
356 pages
Data Science - g.scali (Lect1) (1)
No ratings yet
Data Science - g.scali (Lect1) (1)
22 pages
Most Compact and Complete Data Science Cheat Sheet 1672981093
No ratings yet
Most Compact and Complete Data Science Cheat Sheet 1672981093
10 pages
Unit 2 1
No ratings yet
Unit 2 1
54 pages
Digital Vidya Python Data Analytst Course
No ratings yet
Digital Vidya Python Data Analytst Course
18 pages
POA - Tracker MACHINE LEARNING
100% (1)
POA - Tracker MACHINE LEARNING
48 pages
What Is Data Science? Probability Overview Descriptive Statistics
No ratings yet
What Is Data Science? Probability Overview Descriptive Statistics
10 pages
Unit 4
No ratings yet
Unit 4
66 pages
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
No ratings yet
MACHINE LEARNING LAB WORD 12-1-2025. DOCUMENT
68 pages
Data Science Cheat Sheet
No ratings yet
Data Science Cheat Sheet
10 pages
program-1_
No ratings yet
program-1_
15 pages
ML 3170724 Unit-2
No ratings yet
ML 3170724 Unit-2
40 pages
Chapter 1,2,3
No ratings yet
Chapter 1,2,3
26 pages
Maths
No ratings yet
Maths
30 pages
Machine Learning (1) : Inteligência Artificial E Cibersegurança (Inacs)
No ratings yet
Machine Learning (1) : Inteligência Artificial E Cibersegurança (Inacs)
33 pages
Teks DATA SCIENCE Syllabus - QR
No ratings yet
Teks DATA SCIENCE Syllabus - QR
26 pages
Notebook Statistics
No ratings yet
Notebook Statistics
6 pages
DS Chapter - 2
No ratings yet
DS Chapter - 2
73 pages
12. B Lab Manual Machine Learning SEM-7 CSE 2024
No ratings yet
12. B Lab Manual Machine Learning SEM-7 CSE 2024
49 pages
Data Analysis and Visualization EDA
No ratings yet
Data Analysis and Visualization EDA
51 pages
Lecture 01-05 Data, Central Tendency PDF
No ratings yet
Lecture 01-05 Data, Central Tendency PDF
51 pages
DATA ANALYSIS WITH PYTHON
No ratings yet
DATA ANALYSIS WITH PYTHON
28 pages
POA - Tracker
No ratings yet
POA - Tracker
34 pages
Data Science Master
No ratings yet
Data Science Master
11 pages
CIE-02PPT.18
No ratings yet
CIE-02PPT.18
10 pages
Data Science Class X Notes
No ratings yet
Data Science Class X Notes
3 pages
Big Data Mid Term
No ratings yet
Big Data Mid Term
14 pages
Data Science & Aiml (Mile Stone Solution)
No ratings yet
Data Science & Aiml (Mile Stone Solution)
37 pages
Nummerical Summaries
No ratings yet
Nummerical Summaries
11 pages
ML U2
No ratings yet
ML U2
62 pages
ml programs
No ratings yet
ml programs
41 pages
CIS 467 - Topic 2 - Data Exploration and Preprocessing
No ratings yet
CIS 467 - Topic 2 - Data Exploration and Preprocessing
81 pages
Magic wIth Math
From Everand
Magic wIth Math
Rajinder Goswami
5/5 (2)
Illuminating Data: A hands on guide to data visualization in R
From Everand
Illuminating Data: A hands on guide to data visualization in R
Eman Ahmad
No ratings yet
SAT Math Shortcuts
From Everand
SAT Math Shortcuts
Bella Biscotti
No ratings yet
Computer Networks: M. Bilal Khan
No ratings yet
Computer Networks: M. Bilal Khan
49 pages
Lecture 4 CN IP Addressing P2
No ratings yet
Lecture 4 CN IP Addressing P2
44 pages
Python List 01
No ratings yet
Python List 01
38 pages
NSA Week 2 Lab
No ratings yet
NSA Week 2 Lab
2 pages
Network & System Administration: Engr. Gohar Mumtaz
No ratings yet
Network & System Administration: Engr. Gohar Mumtaz
15 pages
Python List 02
No ratings yet
Python List 02
27 pages
Porters Five Forces Model Example
No ratings yet
Porters Five Forces Model Example
1 page
If Else and Loops
No ratings yet
If Else and Loops
29 pages
Lecture 2 CN Network Types, Layered Approach
No ratings yet
Lecture 2 CN Network Types, Layered Approach
38 pages
Artificial Intelligence 1
No ratings yet
Artificial Intelligence 1
20 pages
Business Finance Week 6 L11, 12
No ratings yet
Business Finance Week 6 L11, 12
15 pages
Arabic Alphabet Recognition Basket Raising Mumeens
No ratings yet
Arabic Alphabet Recognition Basket Raising Mumeens
31 pages
TDTA Company Surandal in J.J.christdudass
No ratings yet
TDTA Company Surandal in J.J.christdudass
16 pages
Lecture - 3 To 5 - Permeability-Rev
No ratings yet
Lecture - 3 To 5 - Permeability-Rev
79 pages
Lecture1 EMGT0312
No ratings yet
Lecture1 EMGT0312
19 pages
RDO No. 77 - Bacolod City Negros Occidental
No ratings yet
RDO No. 77 - Bacolod City Negros Occidental
396 pages
Classroom Management Strategies of Multigrade Schools With Emphasis On The Role of Technology
No ratings yet
Classroom Management Strategies of Multigrade Schools With Emphasis On The Role of Technology
10 pages
Shamim-IMMI S257a (s40) Requirement To Provide PIDs PDF
No ratings yet
Shamim-IMMI S257a (s40) Requirement To Provide PIDs PDF
4 pages
AWS – SITE-TO-SITE VPN – HANDOVER (2) 2
No ratings yet
AWS – SITE-TO-SITE VPN – HANDOVER (2) 2
2 pages
VICUT CC-330 User Manual v1
No ratings yet
VICUT CC-330 User Manual v1
53 pages
Chapter 4 - Interrupt Module
No ratings yet
Chapter 4 - Interrupt Module
21 pages
Social Media Marketing in Pakistan
No ratings yet
Social Media Marketing in Pakistan
5 pages
FS 2 Activity 1
No ratings yet
FS 2 Activity 1
8 pages
Installation Instructions: LRM1070, LRM1080
No ratings yet
Installation Instructions: LRM1070, LRM1080
2 pages
XML v4.0 GUIDE
No ratings yet
XML v4.0 GUIDE
95 pages
The Answer of Case Sullivan Ford Auto Wo
No ratings yet
The Answer of Case Sullivan Ford Auto Wo
14 pages
18-Week Marathon Training Plan For Beginners Who Have Completed Some Fitness Preparation
No ratings yet
18-Week Marathon Training Plan For Beginners Who Have Completed Some Fitness Preparation
2 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Proc Guide
100% (1)
Proc Guide
24 pages
Guidelines
No ratings yet
Guidelines
15 pages
9 Structural Analysis - Frames and Machines - Part B
No ratings yet
9 Structural Analysis - Frames and Machines - Part B
22 pages
HRM 340 Employees Recruitment and Selection
No ratings yet
HRM 340 Employees Recruitment and Selection
22 pages
Pulse Oximeter Using Arduino
No ratings yet
Pulse Oximeter Using Arduino
12 pages
Sujita
No ratings yet
Sujita
3 pages
Low-Side Gate Drivers With UVLO Versus BJT Totem-Pole
No ratings yet
Low-Side Gate Drivers With UVLO Versus BJT Totem-Pole
3 pages
People VS PACIS
No ratings yet
People VS PACIS
5 pages
AVVNL269-MW-KUSUM-Component-A-Rajasthan-November2024
No ratings yet
AVVNL269-MW-KUSUM-Component-A-Rajasthan-November2024
102 pages
EXTM3U
No ratings yet
EXTM3U
2 pages
CRB
0% (1)
CRB
3 pages

Machine Learning: Dr. Muhammad Asadullah

Uploaded by

Machine Learning: Dr. Muhammad Asadullah

Uploaded by

MACHINE LEARNING

Dr. Muhammad Asadullah

• The SciPy module has a method for this

4. The variance is the average number of these squared differences:

We specify that the mean value is 5.0, and the standard

Meaning that the values should be concentrated around 5.0,

The x array represents the age of each car.

slope, intercept, r, p, std_err = stats.linregress(x, y)

mymodel = list(map(myfunc, x))

• This relationship - the coefficient of correlation - is called r.

• The r value ranges from -1 to 1, where 0 means no relationship, and 1

slope, intercept, r, p, std_err = stats.linregress(x, y)

• To do so, we need the same myfunc() function from the

slope, intercept, r, p, std_err = stats.linregress(x, y)

slope, intercept, r, p, std_err = stats.linregress(x, y)

slope, intercept, r, p, std_err = stats.linregress(x, y)

mymodel = numpy.poly1d(numpy.polyfit(x, y, 3))

You might also like