0% found this document useful (0 votes)

8 views

Course Elementsofai (1)

This document provides an overview of linear regression and its application in machine learning, highlighting the differences between regression and classification. It explains the concept of linear regression through relatable examples, such as calculating a shopping bill, and introduces logistic regression as a method for predicting categorical outcomes. The document also discusses the importance of data quality and the challenges faced in machine learning, emphasizing that while machine learning can provide valuable predictions, it is not infallible.

Uploaded by

odayalghonmieen55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Course Elementsofai (1)

Uploaded by

odayalghonmieen55

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Elements of AI

Course overview Machine learning Reg

III. Regression
Our main learning objective in this section is
another nice example of supervised learning
methods, and almost as simple as the
nearest neighbor classifier too: linear
regression. We’ll introduce its close cousin,
logistic regression as well.

Note

The difference between

classification and regression
There is a small but important difference in the
kind of predictions that we should produce in
different scenarios. While for example the nearest
neighbor classifier chooses a class label for any
item out of a given set of alternatives (like
spam/ham, or 0,1,2,...,9), linear regression
produces a numerical prediction that is not
constrained to be an integer (a whole number as
opposed to something like 3.14). So linear
regression is better suited in situations where the
output variable can be any number like the price
of a product, the distance to an obstacle, the box-
office revenue of the next Star Wars movie, and so
on.

The basic idea in linear regression is to add up

the effects of each of the feature variables to
produce the predicted value. The technical
term for the adding up process is linear
combination. The idea is very straightforward,
and it can be illustrated by your shopping bill.

Note

Thinking of linear regression

as a shopping bill
Suppose you go to the grocery store and buy
2.5kg potatoes, 1.0kg carrots, and two bottles of
milk. If the price of potatoes is 2€ per kg, the price
of carrots is 4€ per kg, and a bottle of milk costs
3€, then the bill, calculated by the cashier, totals
2.5 × 2€ + 1.0 × 4€ + 2 × 3€ = 15€. In linear
regression, the amount of potatoes, carrots, and
milk are the inputs in the data. The output is the
cost of your shopping, which clearly depends on
both the price and how much of each product you
buy.

The word linear means that the increase in the

output when one input feature is increased by
some fixed amount is always the same. In other
words, whenever you add, say, two kilos of
carrots into your shopping basket, the bill goes
up 8€. When you add another two kilos, the bill
goes up another 8€, and if you add half as
much, 1kg, the bill goes up exactly half as
much, 4€.

Key terminology

Coefficients or weights
In linear regression terminology, the prices of the
different products would be called coefficients or
weights (this may appear confusing since we
measured the amount of potatoes and carrots by
weight, but do not let yourself be tricked by this).
One of the main advantages of linear regression is
its easy interpretability: the learned weights may
in fact be more interesting than the predictions of
the outputs.

For example, when we use linear regression to

predict the life expectancy, the weight of smoking
(cigarettes per day) is about minus half a year,
meaning that smoking one cigarette more per day
takes you on the average half a year closer to
termination. Likewise, the weight of vegetable
consumption (handful of vegetables per day) has
weight plus one year, so eating a handful of
greens every day gives you on the average one
more year.

Unanswered

Exercise 16: Linear

regression
Suppose that an extensive study is carried
out, and it is found that in a particular
country, the life expectancy (the average
number of years that people live) among
non smoking women who don't eat any

Sign up to solve exercises:

In the above exercise, the life expectancy of

non-smoking, veggie-hating women, 80 years,
was the starting point for the calculation. The
technical term for the starting point is the
intercept. We will return to this below when
we discuss how to learn linear regression
models from data.

Learning linear regression

Above, we discussed how predictions are
obtained from linear regression when both the
weights and the input features are known. So
we are given the inputs and the weight, and we
can produce the predicted output.

When we are given the inputs and the outputs

for a number of items, we can find the weights
such that the predicted output matches the
actual output as well as possible. This is the
task solved by machine learning.

Note

Example
Continuing the shopping analogy, suppose we
were given the contents of a number of shopping
baskets and the total bill for each of them, and we
were asked to figure out the price of each of the
products (potatoes, carrots, and so on). From one
basket, say 1kg of sirloin steak, 2kg of carrots, and
a bottle of Chianti, even if we knew that the total
bill is 35€, we couldn’t determine the prices
because there are many sets of prices that will
yield the same total bill. With many baskets,
however, we will usually be able to solve the
problem.

But the problem is made harder by the fact that

in the real world, the actual output isn’t always
fully determined by the input, because of
various factors that introduce uncertainty or
"noise" into the process. You can think of
shopping at a bazaar where the prices for any
given product may vary from time to time, or a
restaurant where the final damage includes a
variable amount of tip. In such situations, we
can estimate the prices but only with some
limited accuracy.

Finding the weights that optimize the match

between the predicted and the actual outputs in
the training data is a classical statistical
problem dating back to the 1800s, and it can be
easily solved even for massive data sets.

We will not go into the details of the actual

weight-finding algorithms, such as the classical
least squares technique, simple as they are.
However, you can get a feel of finding trends in
data in the following exercises.

Visualizing linear regression

A good way to get a feel for what linear
regression can tell us is to draw a chart
containing our data and our regression results.
As a simple toy example our data set has one
variable, the number of cups of coffee an
employee drinks per day, and the number of
lines of code written per day by that employee
as the output. This is not a real data set as
obviously there are other factors having an
effect on the productivity of an employee other
than coffee that interact in complex ways. The
increase in productivity by increasing the
amount of coffee will also hold only to a certain
point after which the jitters distract too much.

40
Lines of code written

0
0 1 2 3 4 5 6 7 8 9 10
Cups of coffee per day

When we present our data in the chart above as

points where one point represents one
employee, we can see that there is obviously a
trend that drinking more coffee results in more
lines of code being written (recall that this is
completely made-up data). From this data set
we can learn the coefficient, or the weight,
related to coffee consumption, and by eye we
can already say that it seems to be somewhere
close to five, since for each cup of coffee
consumed the number of lines programmed
seems to go up roughly by five. For example,
employees who drink around two cups of coffee
per day seem to produce around 20 lines of
code per day, and similarly at four cups of
coffee, the amount of lines produced is around
30.

It can also be noted that employees who do not

drink coffee at all also produce code, and is
shown by the graph to be about ten lines. This
number is the intercept term that we mentioned
earlier. The intercept is another parameter in
the model just like the weights are, that can be
learned from the data. Just as in the life
expectancy example it can be thought of as the
starting point of our calculations before we
have added in the effects of the input variable,
or variables if we have more than one, be it
coffee cups in this example, or cigarettes and
vegetables in the previous one.

The line in the chart represents our predicted

outcome, where we have estimated the
intercept and the coefficient by using an actual
linear regression technique called least squares.
This line can be used to predict the number of
lines produced when the input is the number of
cups of coffee. Note that we can obtain a
prediction even if we allow only partial cups
(like half, 1/4 cups, and so on).

Unanswered

Exercise 17: Life

expectancy and
education (part 1 of
2)
Let’s study the link between the total
number of years spent in school (including
everything between preschool and
university) and life expectancy. Here is
d t f th diff t ti

Sign up to solve exercises:

Unanswered

Exercise 18: Life

expectancy and
education (part 2 of
2)
In the previous exercise, we only had data
from three countries. The full data set
consists of data from 14 different
countries, presented here in a graph:

Sign up to solve exercises:

It should be pointed out that studies like those

used in the above exercises cannot identify
causal relationships. In other words, from this
data alone, it is impossible to say whether
studying actually increases life expectancy
through a better-informed and healthier life-
style or other mechanisms, or whether the
apparent association between life expectancy
and education is due to underlying factors that
affects both. It is likely that, for example, in
countries where people tend to be highly
educated, nutrition, healthcare, and safety are
also better, which increases life expectancy.
With this kind of simple analysis, we can only
identify associations, which can nevertheless be
useful for prediction.

Machine learning applications of

linear regression
Linear regression is truly the workhorse of
many AI and data science applications. It has
its limits but they are often compensated by its
simplicity, interpretability and efficiency.
Linear regression has been successfully used in
the following problems to give a few examples:

prediction of click rates in online

advertising

prediction of retail demand for products

prediction of box-office revenue of

Hollywood movies

prediction of software cost

prediction of insurance cost

prediction of crime rates

prediction of real estate prices

Could we use regression to predict

labels?
As we discussed above, linear regression and
the nearest neighbor method produce different
kinds of predictions. Linear regression outputs
numerical outputs while the nearest neighbor
method produces labels from a fixed set of
alternatives ("classes").

Where linear regression excels compared to

nearest neighbors is interpretability. What do
we mean by this? You could say that in a way,
the nearest neighbor method and any single
prediction that it produces are easy to
interpret: it’s just the nearest training data
element! This is true, but when it comes to the
interpretability of the learned model, there is a
clear difference. Interpreting the trained model
in nearest neighbors in a similar fashion as the
weights in linear regression is impossible: the
learned model is basically the whole data, and
it is usually way too big and complex to provide
us with much insight. So what if we’d like to
have a method that produces the same kind of
outputs as the nearest neighbor, labels, but is
interpretable like linear regression?

Logistic regression to the rescue

Well there is good news for you: we can turn the
linear regression method’s outputs into
predictions about labels. The technique for
doing this is called logistic regression. We will
not go into the technicalities, suffice to say that
in the simplest case, we take the output from
linear regression, which is a number, and
predict one label A if the output is greater than
zero, and another label B if the output is less
than or equal to zero. Actually, instead of just
predicting one class or another, logistic
regression can also give us a measure of
uncertainty of the prediction. So if we are
predicting whether a customer will buy a new
smartphone this year, we can get a prediction
that customer A will buy a phone with
probability 90%, but for another, less
predictable customer, we can get a prediction
that they will not buy a phone with 55%
probability (or in other words, that they will
buy one with 45% probability).

It is also possible to use the same trick to obtain

predictions over more than two possible labels,
so instead of always predicting either yes or no
(buy a new phone or not, fake news or real
news, and so forth), we can use logistic
regression to identify, for example, handwritten
digits, in which case there are ten possible
labels.

An example of logistic regression

Let’s suppose that we collect data of students
taking an introductory course in cookery. In
addition to the basic information such as the
student ID, name, and so on, we also ask the
students to report how many hours they
studied for the exam (however you study for a
cookery exam, probably cooking?) – and hope
that they are more or less honest in their
reports. After the exam, we will know whether
each student passed the course or not. Some
data points are presented below:

Student ID Hours studied Pass/fail

24 15 Pass
41 9.5 Pass
58 2 Fail
101 5 Fail
103 6.5 Fail
215 6 Pass

Based on the table, what kind of conclusion

could you draw between the hours studied and
passing the exam? We could think that if we
have data from hundreds of students, maybe
we could see the amount needed to study in
order to pass the course. We can present this
data in a chart as you can see below.

Unanswered

Exercise 19: Logistic

regression
100%

90%

80%

70%
Probability of passing

60%

50%

40%

Sign up to solve exercises:

Logistic regression is also used in a great

variety of real-world AI applications such as
predicting financial risks, in medical studies,
and so on. However, like linear regression, it is
also constrained by the linearity property and
we need many other methods in our toolbox.
We will return to the linearity issue later when
we discuss neural networks.

The limits of machine learning

To summarize, machine learning is a very
powerful tool for building AI applications. In
addition to the nearest neighbor method, linear
regression, and logistic regression, there are
literally hundreds, if not thousands, of different
machine learning techniques, but they all boil
down to the same thing: trying to extract
patterns and dependencies from data and using
them either to gain understanding of a
phenomenon or to predict future outcomes.

Machine learning can be a very hard problem

and we can’t usually achieve a perfect method
that would always produce the correct label.
However, in most cases, a good but not perfect
prediction is still better than none. Sometimes
we may be able to produce better predictions by
ourselves but we may still prefer to use machine
learning because the machine will make its
predictions faster and it will also keep churning
out predictions without getting tired. Good
examples are recommendation systems that
need to predict what music, what videos, or
what ads are more likely to be of interest to you.

The factors that affect how good a result we can

achieve include:

The hardness of the task: in handwritten

digit recognition, if the digits are
written very sloppily, even a human
can’t always guess correctly what the
writer intended

The machine learning method: some

methods are far better for a particular
task than others

The amount of training data: from only

a few examples, it is impossible to
obtain a good classifier

The quality of the data

Note

Data quality matters

In the beginning of this chapter, we emphasized
the importance of having enough data and the
risks of overfitting. Another equally important
factor is the quality of the data. In order to build a
model that generalizes well to data outside of the
training data, the training data needs to contain
enough information that is relevant to the problem
at hand. For example, if you create an image
classifier that tells you what the image given to
the algorithm is about, and you have trained it
only on pictures of dogs and cats, it will assign
everything it sees as either a dog or a cat. This
would make sense if the algorithm is used in an
environment where it will only see cats and dogs,
but not if it is expected to see boats, cars, and
flowers as well.

We’ll return to potential problems caused by

”biased” data.

It is also important to emphasize that different

machine learning methods are suitable for
different tasks. Thus, there is no single best
method for all problems ("one algorithm to rule
them all..."). Fortunately, one can try out a
large number of different methods and see
which one of them works best in the problem at
hand.

This leads us to a point that is very important

but often overlooked in practice: what it means
to work better. In the digit recognition task, a
good method would of course produce the
correct label most of the time. We can measure
this by the classification error: the fraction of
cases where our classifier outputs the wrong
class. In predicting apartment prices, the
quality measure is typically something like the
difference between the predicted price and the
final price for which the apartment is sold. In
many real-life applications, it is also worse to
err in one direction than in another: setting the
price too high may delay the process by
months, but setting the price too low will mean
less money for the seller. And to take yet
another example, failing to detect a pedestrian
in front of a car is a far worse error than falsely
detecting one when there is none.

As mentioned above, we can’t usually achieve

zero error, but perhaps we will be happy with
error less than 1 in 100 (or 1%). This too depends
on the application: you wouldn’t be happy to
have only 99% safe cars on the streets, but
being able to predict whether you’ll like a new
song with that accuracy may be more than
enough for a pleasant listening experience.
Keeping the actual goal in mind at all times
helps us make sure that we create actual added
value.

After completing Chapter 4 you

should be able to:

Explain why machine learning

techniques are used

Distinguish between unsupervised

and supervised machine learning
scenarios

Explain the principles of three

supervised classification methods: the
nearest neighbor method, linear
regression, and logistic regression

Please join the Elements of AI community

to discuss and ask questions about this
chapter.

You reached the end of Chapter 4!

Next Chapter

Neural networks
Start →

Introduction to AI

Building AI

About

FAQ

Terms and Conditions

BRIGGS, Asa - The Age of Improvement
No ratings yet
BRIGGS, Asa - The Age of Improvement
497 pages
SECTION 204-01: Front Suspension 2013 Explorer Workshop Manual Removal and Installation
No ratings yet
SECTION 204-01: Front Suspension 2013 Explorer Workshop Manual Removal and Installation
4 pages
Assignments
No ratings yet
Assignments
6 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Introduction to Applied Econometrics Analysis Using Stata
From Everand
Introduction to Applied Econometrics Analysis Using Stata
Justin Doran
5/5 (3)
Regression - Elements of AI 4-2
100% (2)
Regression - Elements of AI 4-2
20 pages
Da On Regression
No ratings yet
Da On Regression
58 pages
(The SAGE Quantitative Research Kit) Peter Martin - Linear Regression - An Introduction To Statistical Models-SAGE Publications (2022)
No ratings yet
(The SAGE Quantitative Research Kit) Peter Martin - Linear Regression - An Introduction To Statistical Models-SAGE Publications (2022)
201 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Regression Analysis Using R
No ratings yet
Regression Analysis Using R
17 pages
MODULE 2
No ratings yet
MODULE 2
21 pages
Introduction To Predictive Modeling With Examples: David A. Dickey, N. Carolina State U., Raleigh, NC
No ratings yet
Introduction To Predictive Modeling With Examples: David A. Dickey, N. Carolina State U., Raleigh, NC
14 pages
1 Simple Linear Regression I Least Squares Estimation
No ratings yet
1 Simple Linear Regression I Least Squares Estimation
70 pages
Case Study
No ratings yet
Case Study
87 pages
Iskak, Stats 2
No ratings yet
Iskak, Stats 2
5 pages
Full download Stat2 1st Edition Ann R. Cannon pdf docx
No ratings yet
Full download Stat2 1st Edition Ann R. Cannon pdf docx
67 pages
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
No ratings yet
DMV Unit 3 PPT_RSK_250419_125620 jfhuehiwhu
89 pages
Supervised Learning - Regression - Annotated
No ratings yet
Supervised Learning - Regression - Annotated
97 pages
Linear Regression. Com
No ratings yet
Linear Regression. Com
13 pages
10 - Linear Models
No ratings yet
10 - Linear Models
57 pages
new89梁涛企业管理（运营与供应链方向）202111080248Application of linear regression model and logistic regression model based on Iris data set
No ratings yet
new89梁涛企业管理（运营与供应链方向）202111080248Application of linear regression model and logistic regression model based on Iris data set
21 pages
Regression Models for Data Science in R
No ratings yet
Regression Models for Data Science in R
137 pages
Chapter 1: The Nature of Econometrics and Economic Data
No ratings yet
Chapter 1: The Nature of Econometrics and Economic Data
18 pages
FAT 2 - Sample Solutions
No ratings yet
FAT 2 - Sample Solutions
4 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Stat2 1st Edition Ann R. Cannon - Discover the ebook with all chapters in just a few seconds
No ratings yet
Stat2 1st Edition Ann R. Cannon - Discover the ebook with all chapters in just a few seconds
47 pages
Chapter 1 Simple Linear Regression
No ratings yet
Chapter 1 Simple Linear Regression
17 pages
SEE5211 Chapter3-P2017
No ratings yet
SEE5211 Chapter3-P2017
58 pages
unit-3
No ratings yet
unit-3
30 pages
Untitled
No ratings yet
Untitled
5 pages
DD309
No ratings yet
DD309
23 pages
Regression Models For Data Science in R by Brian Caffo
No ratings yet
Regression Models For Data Science in R by Brian Caffo
144 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Unit 2
No ratings yet
Unit 2
35 pages
Correlation and Regression Skill Set
No ratings yet
Correlation and Regression Skill Set
8 pages
Selvanathan-7e 17
No ratings yet
Selvanathan-7e 17
92 pages
Generalized Linear Models
100% (8)
Generalized Linear Models
243 pages
AIML MSE 2 Notes
No ratings yet
AIML MSE 2 Notes
35 pages
Regression with Linear Predictors Complete DOCX Download
100% (11)
Regression with Linear Predictors Complete DOCX Download
16 pages
OpenIntro Statistics 4th Edition David Diez - Read the ebook now with the complete version and no limits
100% (1)
OpenIntro Statistics 4th Edition David Diez - Read the ebook now with the complete version and no limits
61 pages
Econometrics I - Lecture 6 (wooldridge)
No ratings yet
Econometrics I - Lecture 6 (wooldridge)
42 pages
AI_Lec23
No ratings yet
AI_Lec23
36 pages
Reg Mods
No ratings yet
Reg Mods
137 pages
Ebook Business Statistics Canadian 3Rd Edition Sharpe Solutions Manual Full Chapter PDF
100% (21)
Ebook Business Statistics Canadian 3Rd Edition Sharpe Solutions Manual Full Chapter PDF
63 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
Stats 1
No ratings yet
Stats 1
3 pages
H-409; Experimental Design with R
No ratings yet
H-409; Experimental Design with R
72 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Lecture Notes - Logistic Regression
100% (1)
Lecture Notes - Logistic Regression
11 pages
Stat2 by Ann R. Cannon
No ratings yet
Stat2 by Ann R. Cannon
639 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
Generalized Linear Models An Applied Approach Ulf Olsson pdf download
100% (2)
Generalized Linear Models An Applied Approach Ulf Olsson pdf download
90 pages
Growth Curve Analysis and Visualization Using R - 1st Edition Google Drive Download
100% (9)
Growth Curve Analysis and Visualization Using R - 1st Edition Google Drive Download
14 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
46 pages
10.Introduction to Artificial Intelligence
No ratings yet
10.Introduction to Artificial Intelligence
25 pages
Unit 3
No ratings yet
Unit 3
24 pages
chapter 8
No ratings yet
chapter 8
39 pages
Get The R Book 3rd Edition Elinor Jones Simon Harden Michael J Crawley free all chapters
100% (2)
Get The R Book 3rd Edition Elinor Jones Simon Harden Michael J Crawley free all chapters
40 pages
Python. Easy Steps to Learning.
From Everand
Python. Easy Steps to Learning.
Renier Engelbrecht
No ratings yet
Understanding Statistics: An Introduction
From Everand
Understanding Statistics: An Introduction
Antony Davies
No ratings yet
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
Grade 9 Monthly Exam in English
No ratings yet
Grade 9 Monthly Exam in English
4 pages
Modal Verbs of Obligation
No ratings yet
Modal Verbs of Obligation
1 page
LIN Basics For Beginners en
100% (1)
LIN Basics For Beginners en
118 pages
CSC2103 Lecture01
No ratings yet
CSC2103 Lecture01
14 pages
RBT 2210
No ratings yet
RBT 2210
28 pages
Service Station Manual: RX SX 125
No ratings yet
Service Station Manual: RX SX 125
219 pages
MCQ On Qualitative Research
No ratings yet
MCQ On Qualitative Research
16 pages
Medical Infographic Analysis
100% (1)
Medical Infographic Analysis
2 pages
Flywheel
No ratings yet
Flywheel
11 pages
Past and Past Perfect Tenses COT
No ratings yet
Past and Past Perfect Tenses COT
114 pages
Lithotrak Nuclear Service - Radiation Protection Supervisor - On the Job...
No ratings yet
Lithotrak Nuclear Service - Radiation Protection Supervisor - On the Job...
7 pages
Lapu Lapu Clup
No ratings yet
Lapu Lapu Clup
72 pages
Bruffee 1995
No ratings yet
Bruffee 1995
8 pages
ÔN TẬP ENG 167
No ratings yet
ÔN TẬP ENG 167
5 pages
Well Engineered Software
No ratings yet
Well Engineered Software
4 pages
Applied Statistics in Business and Economics 4th Edition Doane Test Bank instant download
100% (2)
Applied Statistics in Business and Economics 4th Edition Doane Test Bank instant download
65 pages
Adaptive PID Controller With P&O MPPT Algorit
No ratings yet
Adaptive PID Controller With P&O MPPT Algorit
13 pages
Consumer Perception - Tata As A Brand
100% (2)
Consumer Perception - Tata As A Brand
82 pages
Acer Al1521 SM (ET)
No ratings yet
Acer Al1521 SM (ET)
50 pages
Dps Indirapuram Holiday Homework For Class 9
100% (1)
Dps Indirapuram Holiday Homework For Class 9
8 pages
English: Quarter 2 - Module 7: Multimodal and Its Elements
No ratings yet
English: Quarter 2 - Module 7: Multimodal and Its Elements
22 pages
Writing A Project Based Problem Based Learning Plan
100% (1)
Writing A Project Based Problem Based Learning Plan
28 pages
Module 6 Midterm
No ratings yet
Module 6 Midterm
5 pages
SJ-20120319104909-001-ZXUR 9000 UMTS (V4.11.20) Documentation Guide
No ratings yet
SJ-20120319104909-001-ZXUR 9000 UMTS (V4.11.20) Documentation Guide
10 pages
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
No ratings yet
50 AI Engineer Interview Questions & Answers [2025] - DigitalDefynd
27 pages
CT Scan Addendum 1
No ratings yet
CT Scan Addendum 1
3 pages
HW Week 3 - Part 7
No ratings yet
HW Week 3 - Part 7
18 pages
Linguistic Mechanisms of Humour Subtitling
No ratings yet
Linguistic Mechanisms of Humour Subtitling
14 pages

Course Elementsofai (1)

Uploaded by

Course Elementsofai (1)

Uploaded by

Elements of AI

Course overview Machine learning Reg

The difference between

The basic idea in linear regression is to add up

Thinking of linear regression

The word linear means that the increase in the

For example, when we use linear regression to

Exercise 16: Linear

Sign up to solve exercises:

In the above exercise, the life expectancy of

Learning linear regression

When we are given the inputs and the outputs

But the problem is made harder by the fact that

Finding the weights that optimize the match

We will not go into the details of the actual

Visualizing linear regression

When we present our data in the chart above as

It can also be noted that employees who do not

The line in the chart represents our predicted

Exercise 17: Life

Sign up to solve exercises:

Exercise 18: Life

Sign up to solve exercises:

It should be pointed out that studies like those

Machine learning applications of

prediction of click rates in online

prediction of retail demand for products

prediction of box-office revenue of

prediction of software cost

prediction of insurance cost

prediction of crime rates

prediction of real estate prices

Could we use regression to predict

Where linear regression excels compared to

Logistic regression to the rescue

It is also possible to use the same trick to obtain

An example of logistic regression

Student ID Hours studied Pass/fail

Based on the table, what kind of conclusion

Exercise 19: Logistic

Sign up to solve exercises:

Logistic regression is also used in a great

The limits of machine learning

Machine learning can be a very hard problem

The factors that affect how good a result we can

The hardness of the task: in handwritten

The machine learning method: some

The amount of training data: from only

The quality of the data

Data quality matters

We’ll return to potential problems caused by

It is also important to emphasize that different

This leads us to a point that is very important

As mentioned above, we can’t usually achieve

After completing Chapter 4 you

Explain why machine learning

Distinguish between unsupervised

Explain the principles of three

Please join the Elements of AI community

You reached the end of Chapter 4!

Terms and Conditions

You might also like