Additional Exercice S Data Science

The document discusses three examples involving machine learning models: 1) A nearest neighbor model for surfing predictions using a weather dataset. 2) A bag-of-words model for email spam detection trained on five sample emails. 3) A linear regression model to predict oxygen consumption based on astronaut age and heart rate.

Uploaded by

Frank

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

341 views

Additional Exercice S Data Science

Uploaded by

Frank

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Additional exercices

1. The table below lists a dataset that was used to create a nearest neighbour model
that predicts whether it will be a good day to go surfing.

Assuming that the model uses Euclidean distance to find the nearest neighbour,
what prediction will the model return for each of the following query instances.

2. Email spam filtering models often use a bag-of-words representation

for emails. In a bag-of-words representation, the descriptive features that
describe a document (in our case, an email) each represent how many
times a particular word occurs in the document. One descriptive feature
is included for each word in a predefined dictionary. The dictionary is typically
defined as the complete set of words that occur in the training dataset.
The table below lists the bag-of-words representation for the following five
emails and a target feature, SPAM, whether they are spam emails or genuine
emails:

 “money, money, money”

 “free money for free gambling fun”
 “gambling for fun”
 “machine learning for fun, fun, fun”
 “free machine learning”

1
 What target level would a nearest neighbor model using Euclidean distance
return for the following email: “machine learning for free”?
 What target level would a k-NN model with k=3 and using Euclidean distance
return for the same query?
 What target level would a weighted k-NN model with k=5 and using a weighting
scheme of the reciprocal of the squared Euclidean distance between the neighbor
and the query, return for the query?
 What target level would a k-NN model with k=3 and using Manhattan distance
return for the same query?
 There are a lot of zero entries in the spam bag-of-words dataset. This is indicative
of sparse data and is typical for text analytics. Cosine similarity is often a good
choice when dealing with sparse non-binary data. What target level would a 3-
NN model using cosine similarity return for the query?

3. You have been hired by the European Space Agency to build a model that predicts
the amount of oxygen that an astronaut consumes when performing five minutes of
intense physical work. The descriptive features for the model will be the age of the
astronaut and their average heart rate throughout the work. The regression model is

The table below shows a historical dataset that has been collected for this task.

2
 Assuming that the current weights in a multivariate linear regression model
are w[0] = 59.50, w[1] = 0.15, and w[2]=0.60, make a prediction for each
training instance using this model.
 Calculate the sum of squared errors for the set of predictions generated in the
previous question.
 Assuming a learning rate of 0.000002, calculate the weights at the next
iteration of the gradient descent algorithm.
 Calculate the sum of squared errors for a set of predictions generated using
the new set of weights calculated in the previous question.

Ndividual Ssignment Anagement
No ratings yet
Ndividual Ssignment Anagement
5 pages
Free Job Hunt Guide v1.1 PDF
No ratings yet
Free Job Hunt Guide v1.1 PDF
78 pages
1 +the+Real+Money+Games+Playbook
No ratings yet
1 +the+Real+Money+Games+Playbook
43 pages
Personal Cybersecurity Checklist
No ratings yet
Personal Cybersecurity Checklist
4 pages
What Is A $cashtag
No ratings yet
What Is A $cashtag
3 pages
The Shortcut To Success
No ratings yet
The Shortcut To Success
70 pages
Mei cf540 Coin Mech Pocket Guide
No ratings yet
Mei cf540 Coin Mech Pocket Guide
47 pages
Modeling Bubbling Fluidized Bed Using DDPM+DEM
100% (1)
Modeling Bubbling Fluidized Bed Using DDPM+DEM
11 pages
Solution To Derivatives Markets: SOA Exam MFE and CAS Exam 3 FE
No ratings yet
Solution To Derivatives Markets: SOA Exam MFE and CAS Exam 3 FE
25 pages
U.S. v. Safehouse Amicus Brief of Drug Policy Scholars and Ex Government Officials
No ratings yet
U.S. v. Safehouse Amicus Brief of Drug Policy Scholars and Ex Government Officials
22 pages
Affidavit of Heirship For A Motor Vehicle
100% (1)
Affidavit of Heirship For A Motor Vehicle
2 pages
A Complete CSC Step by Step Guide PDF
0% (1)
A Complete CSC Step by Step Guide PDF
6 pages
Money Market
No ratings yet
Money Market
36 pages
Focus On Five (Feb 4-18)
No ratings yet
Focus On Five (Feb 4-18)
7 pages
Wegilant Ethical Hacking Workshop Proposal
No ratings yet
Wegilant Ethical Hacking Workshop Proposal
16 pages
Derivative Markets Solutions PDF
No ratings yet
Derivative Markets Solutions PDF
30 pages
Optimum Quantity of Money
No ratings yet
Optimum Quantity of Money
14 pages
Around The World in Eighty Days: Jules Verne
No ratings yet
Around The World in Eighty Days: Jules Verne
4 pages
Modernising Money Free Overview
No ratings yet
Modernising Money Free Overview
37 pages
Hack.
From Everand
Hack.
D Shipway
4/5 (3)
Plastic Money
No ratings yet
Plastic Money
5 pages
HackReactor Syllabus 2017
No ratings yet
HackReactor Syllabus 2017
2 pages
PDF Split and Merge - Split and Merge PDF Documents, Free and Open Source
No ratings yet
PDF Split and Merge - Split and Merge PDF Documents, Free and Open Source
9 pages
(Affidavit of Confirmation) : Instructions
No ratings yet
(Affidavit of Confirmation) : Instructions
3 pages
Working Cheats 2021 Garena Free Fire Hack Free Diamonds and Key Generator Generator No Survey No Verification Android Ios Mod
No ratings yet
Working Cheats 2021 Garena Free Fire Hack Free Diamonds and Key Generator Generator No Survey No Verification Android Ios Mod
2 pages
Bluetooth Technology: Presented By
No ratings yet
Bluetooth Technology: Presented By
36 pages
AP Application
No ratings yet
AP Application
16 pages
PVZ Hack
No ratings yet
PVZ Hack
27 pages
Description: 3/23/2014 Android Rooting - Wikipedia, The Free Encyclopedia
No ratings yet
Description: 3/23/2014 Android Rooting - Wikipedia, The Free Encyclopedia
1 page
Free Cash Flows
100% (1)
Free Cash Flows
51 pages
Today's Tabbloid: Unknown Thieves
100% (2)
Today's Tabbloid: Unknown Thieves
16 pages
Bitcoin: The Money Revolution
No ratings yet
Bitcoin: The Money Revolution
12 pages
Agile Money For Nothing
No ratings yet
Agile Money For Nothing
47 pages
TEA Application Guide and Form EN March 2020 Filled PDF
No ratings yet
TEA Application Guide and Form EN March 2020 Filled PDF
20 pages
Assigment 2
No ratings yet
Assigment 2
14 pages
Advertising Rod Rojas
No ratings yet
Advertising Rod Rojas
4 pages
Where Does Money Come From
No ratings yet
Where Does Money Come From
32 pages
Assigment 3
No ratings yet
Assigment 3
9 pages
Public To Get FREE U.S. Coins
No ratings yet
Public To Get FREE U.S. Coins
1 page
Tax Free Investing Basics
0% (1)
Tax Free Investing Basics
5 pages
Public Rights Private Commerce
No ratings yet
Public Rights Private Commerce
20 pages
Toyota Window Sticker Get A Free Monroney Label and VIN Decoder For Toyota
No ratings yet
Toyota Window Sticker Get A Free Monroney Label and VIN Decoder For Toyota
1 page
The Lessons School Forgot: How to Hack Your Way Through the Technology Revolution
From Everand
The Lessons School Forgot: How to Hack Your Way Through the Technology Revolution
Steve Sammartino
No ratings yet
The Internet Data Collection With The Go PDF
No ratings yet
The Internet Data Collection With The Go PDF
21 pages
Investments - Chap. 6
No ratings yet
Investments - Chap. 6
31 pages
Levered Free Cash Flow Hazardous To Your Health
No ratings yet
Levered Free Cash Flow Hazardous To Your Health
2 pages
(English (Auto-Generated) ) How To Save $10K FAST (Money Saving Tips) (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) How To Save $10K FAST (Money Saving Tips) (DownSub - Com)
8 pages
Free Cisco Courses
No ratings yet
Free Cisco Courses
4 pages
Bitcoin: The Money Revolution
No ratings yet
Bitcoin: The Money Revolution
8 pages
Customer Perception Towards Plastic Money PDF Free
No ratings yet
Customer Perception Towards Plastic Money PDF Free
65 pages
Sample Financial Aid Appeal Letter Asking For More Money - Sample Letter HQ
No ratings yet
Sample Financial Aid Appeal Letter Asking For More Money - Sample Letter HQ
5 pages
As ISO IEC 14516-2004 Information Technology - Security Techniques - Guidelines For The Use and Management of
No ratings yet
As ISO IEC 14516-2004 Information Technology - Security Techniques - Guidelines For The Use and Management of
10 pages
Downhill Domination Hints Cheat
No ratings yet
Downhill Domination Hints Cheat
2 pages
United States Patent: Annis (10) Patent No .: US 9, 742, 252 B2
No ratings yet
United States Patent: Annis (10) Patent No .: US 9, 742, 252 B2
14 pages
14 Super Simple Ways To Build Residual Income
No ratings yet
14 Super Simple Ways To Build Residual Income
4 pages
Modern Money, Debt Slavery and Destructive Economics
No ratings yet
Modern Money, Debt Slavery and Destructive Economics
4 pages
Mint Money 1 For WEB
No ratings yet
Mint Money 1 For WEB
17 pages
Unit 7 - Assingnment1 - Template (1) (1) - Copy 1 1
100% (1)
Unit 7 - Assingnment1 - Template (1) (1) - Copy 1 1
19 pages
Money: Research Showcase
No ratings yet
Money: Research Showcase
20 pages
CFGVHJBN
No ratings yet
CFGVHJBN
2 pages
AWS Training and Certification: Enroll in Free Digital Training at Aws - Training
100% (1)
AWS Training and Certification: Enroll in Free Digital Training at Aws - Training
2 pages
MSCCS - 104
No ratings yet
MSCCS - 104
5 pages
DLP Co1 6filpino
No ratings yet
DLP Co1 6filpino
5 pages
Swans D300 Active Speaker Manual: Designed by Hivi Acoustics, Inc
No ratings yet
Swans D300 Active Speaker Manual: Designed by Hivi Acoustics, Inc
6 pages
(Yard) Individual ASSIGNMENT (Qantitative)
40% (5)
(Yard) Individual ASSIGNMENT (Qantitative)
2 pages
OnlineShop Function Details
No ratings yet
OnlineShop Function Details
10 pages
C18 Heat Exchanger - Assemble - With Fuel Cooler
No ratings yet
C18 Heat Exchanger - Assemble - With Fuel Cooler
4 pages
Principles of Management: The Friday Cinema
No ratings yet
Principles of Management: The Friday Cinema
22 pages
FS - M2 Opportunistic Entrepreneur
No ratings yet
FS - M2 Opportunistic Entrepreneur
12 pages
Poster-Septic Tanks Dos and Donts PDF
No ratings yet
Poster-Septic Tanks Dos and Donts PDF
1 page
GDAit User Guide V210
No ratings yet
GDAit User Guide V210
29 pages
Restorative Package $449 Ti - $499 ZR: Tooth #
No ratings yet
Restorative Package $449 Ti - $499 ZR: Tooth #
1 page
B.riddim Info Pack
No ratings yet
B.riddim Info Pack
5 pages
Children and Youth Services Review
100% (1)
Children and Youth Services Review
8 pages
Instant ebooks textbook (Ebook) Aggressive and Violent Peasant Elites in the Nordic Countries, C. 1500-1700 by Ulla Koskinen (eds.) ISBN 9783319406879, 9783319406886, 3319406876, 3319406884 download all chapters
100% (7)
Instant ebooks textbook (Ebook) Aggressive and Violent Peasant Elites in the Nordic Countries, C. 1500-1700 by Ulla Koskinen (eds.) ISBN 9783319406879, 9783319406886, 3319406876, 3319406884 download all chapters
55 pages
Welding and Hot Tapping
No ratings yet
Welding and Hot Tapping
27 pages
MO T6 Prob
No ratings yet
MO T6 Prob
3 pages
20240923_Silfen Glasberg & Shannon_2010
No ratings yet
20240923_Silfen Glasberg & Shannon_2010
246 pages
St. Dominic Academy of Pulilan, Inc.: "Give Me Your Demand, Baby!"
0% (1)
St. Dominic Academy of Pulilan, Inc.: "Give Me Your Demand, Baby!"
2 pages
CorporCorporate Presentation Liugong Machinery Europe 2014ate Presentation Liugong Machinery Europe 2014
100% (2)
CorporCorporate Presentation Liugong Machinery Europe 2014ate Presentation Liugong Machinery Europe 2014
43 pages
M01 Raga3989 17 Ism C01
No ratings yet
M01 Raga3989 17 Ism C01
13 pages
Integrated Circuits Applications Laboratory: Lab Manual
No ratings yet
Integrated Circuits Applications Laboratory: Lab Manual
72 pages
Simple Column Design Example
No ratings yet
Simple Column Design Example
5 pages
Polyu Beamer Slides
No ratings yet
Polyu Beamer Slides
16 pages
Detailed Microwave Assisted Extraction PPT
No ratings yet
Detailed Microwave Assisted Extraction PPT
12 pages
Ad 19 NPSH
No ratings yet
Ad 19 NPSH
14 pages
Ben Tettmar Feedback
No ratings yet
Ben Tettmar Feedback
2 pages
Kinco FV20 VFD User Manual-20190424
100% (1)
Kinco FV20 VFD User Manual-20190424
119 pages
The Social Learning Theory of Julian B. Rotter
No ratings yet
The Social Learning Theory of Julian B. Rotter
5 pages
Natural Building Construction
100% (1)
Natural Building Construction
25 pages

Additional Exercice S Data Science

Uploaded by

Additional Exercice S Data Science

Uploaded by

Additional exercices

2. Email spam filtering models often use a bag-of-words representation

 “money, money, money”

You might also like