0% found this document useful (0 votes)

13 views

Midterm 2023 Redacted

Uploaded by

Dillon Murphy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Midterm 2023 Redacted

Uploaded by

Dillon Murphy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Midterm Exam

Data 402

Instructions

You have two and half hours to complete this exam, although it is designed to take much
less time than that.
The exam is open-note and open-internet. That is, you may use any class materials and any
public online materials to help you during the exam. This also means you may use software
to perform calculations or derivations.
Of course, you may not contact any other humans during the course of the exam. This
includes online forums and message boards; that is, you may visit StackOverflow or reddit and
read existing posts, but you may not post a new question of any kind. You also may not use
AI/LLMs like ChatGPT.
You may answer these questions using your computer or written on paper, or any combination
thereof. As long as I can easily find all your answers, any format is fine. Don’t forget to upload
any digital answers to Canvas, and to turn in any paper answers to me.
You do not need to turn in any notes, scratch paper, etc. that you use during the course of
the exam.

1
Part I [100 points]

A researcher wishes to fit an linear regression model with several predictors.

She is considering three loss functions:

For each of these options:

1. Give an intuitive explanation for this choice of loss function. How does it express the
desire for accurate predictions of a quantitative variable?
2. Do you see any possible issues with this loss function? What assumptions have to be
true about the data for this loss function to be viable?
3. Find an equation for the gradient of the loss function. (A general equation for the partial
derivative at 𝛽𝑗 will suﬀice.)
4. Give a brief code outline (psuedocode, R, or python) to show the procedure you would
use to calculate the “best” 𝛽’s according to this loss function. Your code does not need
to actually run; however, it does need to be specific about the inputs as well as the
equations. For example:

Suﬀicient:

def ols(x, y):

sx = std dev of x
sy = std dev of y

mx = mean of x
my = mean of y

rxy = correlation of x and y

2
beta_1 = sy/sx*rxy
beta_0 = my - beta_1*mx

return beta_1 and beta_0

Insuﬀicient:

ols:

Find means and sds

Calculate beta_0
Calculate beta_1

return beta_0 and beta_1

3
Part II [50 points]

Question A

A researcher is trying to fit a linear regression model using a LASSO penalty. To choose a
good value of the penalty parameter, 𝜆, she decides to take the following approach:

Discuss this strategy: Do you think it is a good idea? Why or why not? Do you have any
suggestions to make it more eﬀicient, more justifiable, or more correct?

Question B

Propose a modeling process to address this question.

You should include:

• A (very brief) description of how you might pre-process the data.

4
• A model specification, and why you think that model would be a good choice for this
task. This can be a model we studied, an existing model we haven’t studies, or you can
“invent” something - but you must include some discussion of why you think that choice
is good/reasonable for this scenario.
• A loss function that you will use to fit the model, and why this loss function correctly
expresses your desires for your “best” model.
• A metric you will use to report your model’s abilities and/or to tune hyperparameters,
and why this metric is a good measure of “model success” in this case. This metric
should not be R-squared, MAE, or MSE. (Those would be reasonable in this case, but I
want you to find/invent a different one and justify it.)

Note: You do not need to concern yourself in this question with the computational feasibility
of your model, loss, or metric. I am only looking for you to translate the “real world” needs
of the scenario into mathematical decisions.

MAST90083 2021 S2 Exam Paper
No ratings yet
MAST90083 2021 S2 Exam Paper
4 pages
Inferential Comprehension Intervention Programme - Edition 4 September 2019
No ratings yet
Inferential Comprehension Intervention Programme - Edition 4 September 2019
75 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Mock Econometrics
No ratings yet
Mock Econometrics
3 pages
Richard Forman: Introduction To Landscape Ecology: Patch / Matrix / Edge / Mosaic
100% (3)
Richard Forman: Introduction To Landscape Ecology: Patch / Matrix / Edge / Mosaic
48 pages
Osteoporosis A Guide To Prevention and Treatment Harvard Health
100% (7)
Osteoporosis A Guide To Prevention and Treatment Harvard Health
57 pages
Practical Design of Experiments: DoE Made Easy
From Everand
Practical Design of Experiments: DoE Made Easy
Colin Hardwick
4.5/5 (7)
Arts1301 Museum Critical Review-Worksheet Sp2015
No ratings yet
Arts1301 Museum Critical Review-Worksheet Sp2015
3 pages
Text Book of Engineering Mathematics. Volume I
94% (18)
Text Book of Engineering Mathematics. Volume I
377 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
hw1_2025
No ratings yet
hw1_2025
2 pages
MATH3714-Jan-2024 (1)
No ratings yet
MATH3714-Jan-2024 (1)
9 pages
HW 1 in 2015
No ratings yet
HW 1 in 2015
3 pages
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
No ratings yet
02 ML Sol MidSem - Makeup - Sol - Upated (From Bits)
6 pages
230393_2_14022025
No ratings yet
230393_2_14022025
7 pages
Series 1
No ratings yet
Series 1
2 pages
Sample Midterm With Solutions (Updated)
No ratings yet
Sample Midterm With Solutions (Updated)
26 pages
d2 1 PDF
No ratings yet
d2 1 PDF
4 pages
HW 1
No ratings yet
HW 1
3 pages
18-660: Numerical Methods For Engineering Design and Optimization
No ratings yet
18-660: Numerical Methods For Engineering Design and Optimization
27 pages
BITS - AIML-Cohort 10 - Regression - Assignment 1
No ratings yet
BITS - AIML-Cohort 10 - Regression - Assignment 1
2 pages
22EE514
No ratings yet
22EE514
6 pages
DS 432 Assignment I 2020
No ratings yet
DS 432 Assignment I 2020
7 pages
CMSC720 Practice Exam
No ratings yet
CMSC720 Practice Exam
2 pages
Tutorial Stat 322 PDF
No ratings yet
Tutorial Stat 322 PDF
58 pages
Chapter 14
No ratings yet
Chapter 14
18 pages
Assignment 1 new version
No ratings yet
Assignment 1 new version
4 pages
Mids 21
No ratings yet
Mids 21
10 pages
Fin 04
No ratings yet
Fin 04
15 pages
hw3_red
No ratings yet
hw3_red
4 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
Analysis and Design of Algorithms: A Beginner’s Hope
From Everand
Analysis and Design of Algorithms: A Beginner’s Hope
Shefali Singhal
No ratings yet
Econ452: Problem Set 2: University of Michigan - Department of Economics
No ratings yet
Econ452: Problem Set 2: University of Michigan - Department of Economics
4 pages
chapter_4_assignment (6)
No ratings yet
chapter_4_assignment (6)
5 pages
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
From Everand
Confident Programmer Problem Solver: Six Steps Programming Students Can Take to Solve Coding Problems
Cloudy Heaven Games
No ratings yet
CHRIST (Deemed To Be University), Bangalore - 560 029
No ratings yet
CHRIST (Deemed To Be University), Bangalore - 560 029
3 pages
CS 419M Midsem 2021 22
No ratings yet
CS 419M Midsem 2021 22
6 pages
Graded Homework 1 Solutions
No ratings yet
Graded Homework 1 Solutions
19 pages
Summative Assessment
No ratings yet
Summative Assessment
31 pages
midem_ML_makeup_sol_upated
No ratings yet
midem_ML_makeup_sol_upated
6 pages
Homework 4
No ratings yet
Homework 4
3 pages
Metrics Aug 2023
No ratings yet
Metrics Aug 2023
10 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
Data_604_HW_5_Taneir_Arani
No ratings yet
Data_604_HW_5_Taneir_Arani
13 pages
Mastering Prompt Engineering
From Everand
Mastering Prompt Engineering
Youngsoo Chae
No ratings yet
Assignment 1
No ratings yet
Assignment 1
3 pages
Practice Problems Note
No ratings yet
Practice Problems Note
9 pages
Review For Final Exam: New Material ONLY
No ratings yet
Review For Final Exam: New Material ONLY
4 pages
assgmt1
No ratings yet
assgmt1
7 pages
Homework 3: Due: September 26, 2017, 11:59 PM Instructions
No ratings yet
Homework 3: Due: September 26, 2017, 11:59 PM Instructions
3 pages
18CSO106T Data Analysis Using Open Source Tool: Question Bank
No ratings yet
18CSO106T Data Analysis Using Open Source Tool: Question Bank
26 pages
Me310 5 Regression PDF
No ratings yet
Me310 5 Regression PDF
15 pages
Assignment_III
No ratings yet
Assignment_III
3 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
Ps 1
No ratings yet
Ps 1
5 pages
Econ301 Final spr22
No ratings yet
Econ301 Final spr22
12 pages
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet
Exam 1 Spring 2023 Donald
No ratings yet
Exam 1 Spring 2023 Donald
8 pages
Review Questions PDF
No ratings yet
Review Questions PDF
2 pages
3
No ratings yet
3
4 pages
ME781 Midsem 2016
No ratings yet
ME781 Midsem 2016
2 pages
hw1
No ratings yet
hw1
11 pages
23fall 340 Final
No ratings yet
23fall 340 Final
9 pages
ch16 Solutions
No ratings yet
ch16 Solutions
94 pages
MATH3714-Jan-2023 (1)
No ratings yet
MATH3714-Jan-2023 (1)
9 pages
Upload 5 Documents To Download
No ratings yet
Upload 5 Documents To Download
6 pages
Mba-1-Sem-Business-Statistics-Mba-Aktu-Previous Year Paper
No ratings yet
Mba-1-Sem-Business-Statistics-Mba-Aktu-Previous Year Paper
7 pages
Definition of Weight and Inertia Loading: Appendix C2
No ratings yet
Definition of Weight and Inertia Loading: Appendix C2
32 pages
Advantages and Disadvantages of Information Gathering Techniques
100% (3)
Advantages and Disadvantages of Information Gathering Techniques
4 pages
A 1370
No ratings yet
A 1370
20 pages
Design of Foundation 6/8 MVA Power Transformer
40% (5)
Design of Foundation 6/8 MVA Power Transformer
2 pages
Calimoso V Roullo
100% (1)
Calimoso V Roullo
2 pages
Clause 55 Assessment
No ratings yet
Clause 55 Assessment
17 pages
ELE 301: Signals and Systems: Prof. Paul Cuff
No ratings yet
ELE 301: Signals and Systems: Prof. Paul Cuff
12 pages
Updated Bike Rental System Documentation
No ratings yet
Updated Bike Rental System Documentation
4 pages
BimaBachat - 19-11-2021 0.36.12
No ratings yet
BimaBachat - 19-11-2021 0.36.12
5 pages
0 - EEE 305 Course Plan
No ratings yet
0 - EEE 305 Course Plan
12 pages
Tridosha siddhanta
No ratings yet
Tridosha siddhanta
14 pages
On The Shoulders of Giants A Brief
No ratings yet
On The Shoulders of Giants A Brief
40 pages
Technical and Scientific T. For Engineering
No ratings yet
Technical and Scientific T. For Engineering
6 pages
ABAP Program Tips v3 PDF
No ratings yet
ABAP Program Tips v3 PDF
157 pages
bluepack-rh-mp-ps
No ratings yet
bluepack-rh-mp-ps
1 page
Warehouse Fire Sprinkler Reports: Prepared By: Roshan
No ratings yet
Warehouse Fire Sprinkler Reports: Prepared By: Roshan
9 pages
Bhushan Certificate
No ratings yet
Bhushan Certificate
29 pages
129377
No ratings yet
129377
40 pages
Asl Project
No ratings yet
Asl Project
6 pages
Oxyblock D
No ratings yet
Oxyblock D
13 pages
The Book: Subtitle of The Book. City: Publishing Company.: If There Is No Author
No ratings yet
The Book: Subtitle of The Book. City: Publishing Company.: If There Is No Author
2 pages
Heat Transfer
No ratings yet
Heat Transfer
13 pages
Harmer Lesson Planning
No ratings yet
Harmer Lesson Planning
22 pages

Midterm 2023 Redacted

Uploaded by

Midterm 2023 Redacted

Uploaded by

Midterm Exam

A researcher wishes to fit an linear regression model with several predictors.

For each of these options:

def ols(x, y):

rxy = correlation of x and y

return beta_1 and beta_0

Find means and sds

return beta_0 and beta_1

Propose a modeling process to address this question.

• A (very brief) description of how you might pre-process the data.

You might also like