0% found this document useful (0 votes)

123 views4 pages

Assignment 1: Statistical Machine Learning, Summer Term 2022

This document provides instructions for students taking the Statistical Machine Learning course over the summer term in 2022. It outlines 5 exercises to complete involving probability concepts like joint, marginal, and conditional probability. Students are asked to prove theorems like the weak law of large numbers. Other concepts covered include linear independence, eigenvectors/eigenvalues, and singular value decomposition. Students are also instructed to install Python and required packages to complete programming assignments during the course.

Uploaded by

Partha Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

123 views4 pages

Assignment 1: Statistical Machine Learning, Summer Term 2022

Uploaded by

Partha Ghosh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment 1

Statistical Machine Learning, Summer term 2022

Moritz Haas / Ulrike von Luxburg

due on Monday, April 25th 2022.

On this sheet we are going to recap the basic maths that will be needed to follow this course
(note that the contents of “Maths for ML” are a prerequisite to the course) and provide the
instructions to install Python. There are basic recap documents and links on the course webpage,
recap slides at the end of the slides of this class, and youtube videos in this playlist: https:
//www.youtube.com/playlist?list=PL05umP7R6ij1a6KdEy8PVE9zoCv6SlHRS

Exercise 1 (Joint probability, marginal probability, relative frequency, expected value

and variance, 1+3 points)
Consider two random variables, the sex X and body height Y of a randomly drawn person. The
sex X is binary (male or female) and the body height Y has three values (small, medium, large).
The relation between X and Y can be visualized in a contingency table. In this table the joint
probabilities are shown. For example we denote the probability to randomly sample a medium
sized woman as P (X = female, Y = medium) and from the table we have, that it is 0.1.

Y
X Small Medium Large
Male 0.1 0.15 0.25
Female 0.3 0.1 0.1

(a) The marginal probabilities refer to the probability of only one variable, P (X = x) or in
short P (x). Compute the marginal probabilities of X and Y. We have that P (X = Male) +
P (X = Female) = 1, why?
(b) Calculate the expected value of X, E(X). Let xi for i = 1, . . . , n be i.i.d samples from a
distribution X. Then the empirical mean is defined as
n
1X
X̄n = xi
n i=1

The empirical mean is an estimate for the expected value. In particular the weak law of
large numbers holds. It states that for all ε > 0

lim P X̄n − E (X) > ε = 0
n→∞

Assuming that both E(X) and Var(X) are finite, prove the weak law of large numbers. You
can use Chebyshev’s inequality. It states that if Xi are n random variables i.i.d. as X, with
expected value E(X) and variance Var(X), then for every ε > 0 it holds that
Var(X)
P (|X − E (X)| ≥ ε) ≤ .
ε2
You can also use the following facts. For all ai ∈ R,
!
X X
E ai Xi = ai E (Xi )
i i

and !
X X
V ar ai Xi = a2i Var (Xi ) .
i i

All course material is available on

https://www.tml.cs.uni-tuebingen.de/teaching/2022_statistical_learning/index.php.
To download the material use the login name machine and the password learning.
Exercise 2 (Conditional probability, independence and Bayes theorem, 1+1+2+2
points)

(a) Conditional probabilities refer to the probability distribution of one variable given another
one. It is denoted by
P (A ∩ B) P (A, B)
P (A | B) = = ,
P (B) P (B)
which reads as the probability of A given B. For example in Exercise 1 we have that
P (Y = Large | X = Male) = 0.5. Calculate the probability P (Y = Medium | X = Female).
(b) When are two random variables X and Y independent? Name two characterizations.
(c) The Bayes theorem states that

P (A = a | B = b)P (B = b)
P (B = b | A = a) =
P (A = a)

Let A be the test result for cancer screening, it can be negative or positive, and let B indicate
whether the tested patient has cancer or not. The probability of having cancer is 1% and the
test is accurate with 95% probability, which for this exercise means that

P (A = positive | B = cancer) = 0.95 and P (A = negative | B = no cancer) = 0.95

Compute the probability of having cancer with a positive test result

P (B = cancer | A = positive).

Are you surprised by the result? Can you give an informal explanation of why we obtain such
result?
(d) The odds of having cancer are given by

P (B = cancer)
O(B = cancer) = .
P (B = no cancer)

This quantity states how many cancer patients you have to expect per person without the
disease. The Bayes factor is given by P P(A=positive|B=no
(A=positive|B=cancer)
cancer) and states how much more
likely it is to get a positive test result given a person has cancer compared to when it has no
cancer. Can you state the updated odds after a positive test result

P (B = cancer|A = positive)
O(B = cancer | A = positive) =
P (B = no cancer|A = positive)

in terms of O(B = cancer) and the Bayes factor? Why is this view valuable?

Exercise 3 (Linear independence, basis and rank, 1+1+1+1 points)

Consider the following matrix A:  
1 1 2
A = 1 2 1 .
5 7 8
We will use this particular 3x3 matrix when we refer to A through the rest of this exercise unless
otherwise specified. Also, a vector x ∈ Rd always refers to a column vector. We use xT to refer to
a row vector in Rd .

(a) Note that the product Ax for any arbitrary matrix A ∈ R3×3 and any x ∈ R3 can always be
written as a linear combination of the column vectors of A with the elements of x as coefficients.
Let xT = (x1 , x2 , x3 ) ∈ R3 . Write down the explicit form of Ax as a linear combination of the
column vectors of A.
(b) Do the columns of A form a basis of R3 ? Answer the same question for the rows of A.

2
(c) Now consider the system of linear equations Ax = b; where x ∈ R3 , b ∈ R3 . Try to find a b̃ ∈ R3
(if such a b̃ exists) so that there is no real valued solution x to the linear system Ax = b̃. If
such a b̃ does not exist then explain why. Try to understand the answer to this question in
terms of the expression in part (a) of the exercise.
Now consider bT = (2, 3, 12). Find x that satisfies the relation Ax = b.
(d) Just for the sake of completeness, what is the column rank, row rank and rank of the matrix
A? Write one line justifying/explaining your answers.

Exercise 4 (Symmetric matrices, eigenvectors, eigenvalues and SVD, 1+2+2+0 points)

(a) Geometric Visualization of eigenvectors: Consider the following 2x2 matrices.

1 0 1 2 1 1
A= , B= , C= .
0 1 2 1 0 1
You can find a java applet to visualize the eigenvectors for 2x2 matrices in this link.
https://www.geogebra.org/m/KuMAuEnd. Enable java in your browser in order to access the
applet. Using the applet scale and rotate the vector x in order to identify the independent
eigenvectors and eigenvalues of the three matrices A, B, C. Explain briefly what you observe
in the case of matrix C.
(b) A ∈ Rn×n is a symmetric matrix, with a set of eigenvectors u1 , . . . , un with corresponding
eigenvalues λ1 , . . . , λn . Derive the eigenvectors and the eigenvalues of the following matrices in
terms of eigenvectors and eigenvalues of A.
(1) A + αI, where I is the identity matrix of size n and α ∈ R
(2) AT A
(3) AAT
(4) If in addition A is a non-singular matrix, then find the eigenvectors and eigenvalues of
A−1 .
(c) Let S ∈ Rm×n with m ̸= n. Identify the components of the singular value decomposition
(SVD) of S given that we have the eigendecomposition of the square symmetric matrices S T S
and SS T .
(d) To have a better understanding of eigenvalues/vectors and SVD we recommend the following
video https://www.youtube.com/watch?v=PFDu9oVAE-g. The whole series is worth watching
to gain a better geometric understanding of linear algebra.

Exercise 5 (Setting up python, 0 point)

During this course you will be required to implement some of the algorithms presented in class.
We will use Python and in particular Jupiter notebooks. In order to save time you should install
Python and all the required packages on your laptop. Here you will find the instruction of how to
do so. We will present two methods, the first one is the easiest and recommended if you do not
have any previous experience with Python. Second one is more suitable for those who know pip.

Installation
1) Anaconda. All you need to do is to follow the instructions that you find at the following
links. Select the correct one for your operating system and follow the instructions. When you need
to decide what to download please download ANACONDA and not MINICONDA. Furthermore
download the “Python 3.X version” NOT the “Python 2.X version”.
• Windows: https://conda.io/projects/conda/en/latest/user-guide/install/windows.
html
• MacOS: https://conda.io/projects/conda/en/latest/user-guide/install/macos.html
• Linux: https://conda.io/projects/conda/en/latest/user-guide/install/linux.html

3
2) Pip. We will use the following packages: numpy, scikit-learn, pandas, matplotlib, jupyter.

For example, on Ubuntu, Debian and derivate

sudo pip3 install numpy scikit-learn pandas matplotlib jupyter
or
pip3 --user install numpy scikit-learn pandas matplotlib jupyter

Test
Now it is time to see if everything we need is installed. Together with this sheet you should have
a file named Assignment 1.ipynb. We will use it to test that everything is correctly installed.

First thing we need to launch Jupyter. This depends on your operating system

• Windows: Start → “Jupyter Notebook”

• MacOS/Linux: Open a terminal in the folder that contains the Assignment 1.ipnyb file and
run jupyter notebook

Once Jupyter is running, navigate your folder structure until you find the Assignment 1.ipnyb
file and click on it. Once it is open, please click on cells → Run all. If it says that you are ready
to go then you are ready to go. Otherwise ask for help at the tutorial.

2024 Coinbase Method
33% (3)
2024 Coinbase Method
36 pages
Exercise Solution 05 Linear Classification
No ratings yet
Exercise Solution 05 Linear Classification
9 pages
2 - Monaco Tips and Tricks
100% (1)
2 - Monaco Tips and Tricks
99 pages
A Journey From Linear Algebra To Machine Learning
No ratings yet
A Journey From Linear Algebra To Machine Learning
50 pages
3 PDF
No ratings yet
3 PDF
56 pages
Basic Maths Exerices
No ratings yet
Basic Maths Exerices
44 pages
Tut 7
No ratings yet
Tut 7
32 pages
Homework 1
0% (1)
Homework 1
4 pages
Introduction To Kriging: To Cite This Version
No ratings yet
Introduction To Kriging: To Cite This Version
40 pages
Exercises
No ratings yet
Exercises
27 pages
Lec1_mathreview
No ratings yet
Lec1_mathreview
61 pages
2.3 SciPy-1
No ratings yet
2.3 SciPy-1
17 pages
Lecture 11
No ratings yet
Lecture 11
36 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Intro To Matlab: Solutions To Exercises: Monique Ebell Studienzentrum Gerzensee
No ratings yet
Intro To Matlab: Solutions To Exercises: Monique Ebell Studienzentrum Gerzensee
6 pages
Module 2 Notes Bcs602
No ratings yet
Module 2 Notes Bcs602
19 pages
Copy of deep-learning
No ratings yet
Copy of deep-learning
28 pages
matlab
No ratings yet
matlab
20 pages
Mew
No ratings yet
Mew
13 pages
Matlab For Pattern Recognition
No ratings yet
Matlab For Pattern Recognition
58 pages
assignment
No ratings yet
assignment
7 pages
Day 1
No ratings yet
Day 1
41 pages
BootcampStat Code Sol
No ratings yet
BootcampStat Code Sol
12 pages
ML4N_exam_sample_2024
No ratings yet
ML4N_exam_sample_2024
6 pages
homework1
No ratings yet
homework1
3 pages
2018dec_02402_solution_en
No ratings yet
2018dec_02402_solution_en
31 pages
1e-26 List of Spare Parts for Radio Equipment
No ratings yet
1e-26 List of Spare Parts for Radio Equipment
17 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
HW 2
No ratings yet
HW 2
7 pages
18EC44
No ratings yet
18EC44
3 pages
Econometrics - Exercise set 2 (solution)
No ratings yet
Econometrics - Exercise set 2 (solution)
12 pages
Tut2 Questions
No ratings yet
Tut2 Questions
3 pages
đề học máy 1
No ratings yet
đề học máy 1
3 pages
Assignment 0
No ratings yet
Assignment 0
3 pages
Data11002 2019 E0 PDF
No ratings yet
Data11002 2019 E0 PDF
3 pages
Exercise 01 Solution (1)
No ratings yet
Exercise 01 Solution (1)
8 pages
hw5
No ratings yet
hw5
11 pages
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
No ratings yet
Mathematics of The Linear Model and Linear Mixed Model: Brian Zhang February 2020
20 pages
Pattern Classification
No ratings yet
Pattern Classification
41 pages
utf-8''C2M1 Assignment
No ratings yet
utf-8''C2M1 Assignment
24 pages
00-statistics
No ratings yet
00-statistics
18 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
10 pages
221 Mat Lab Assignment 5
No ratings yet
221 Mat Lab Assignment 5
3 pages
exercise 01 math refresher
No ratings yet
exercise 01 math refresher
4 pages
Fire Water Hydrant Layout - R1
No ratings yet
Fire Water Hydrant Layout - R1
1 page
Download Complete Pro Apache NetBeans Building Applications on the Rich Client Platform 1st Edition Ioannis Kostaras Constantin Drabo Josh Juneau Sven Reimers Mario Schröder Geertjan Wielenga PDF for All Chapters
100% (2)
Download Complete Pro Apache NetBeans Building Applications on the Rich Client Platform 1st Edition Ioannis Kostaras Constantin Drabo Josh Juneau Sven Reimers Mario Schröder Geertjan Wielenga PDF for All Chapters
62 pages
dis1
No ratings yet
dis1
5 pages
COL726_A2
No ratings yet
COL726_A2
5 pages
cs419endsemsols
No ratings yet
cs419endsemsols
6 pages
Tappi 0502-17 Papermaker Formula
86% (21)
Tappi 0502-17 Papermaker Formula
19 pages
Qnpaper
No ratings yet
Qnpaper
3 pages
HW01 - Math Recap
No ratings yet
HW01 - Math Recap
4 pages
A Guide To Microsoft .NET Developer Certifications - Edit
No ratings yet
A Guide To Microsoft .NET Developer Certifications - Edit
8 pages
Probabilistic Methods in Engineering: Exercise Set 4
No ratings yet
Probabilistic Methods in Engineering: Exercise Set 4
3 pages
hw3_red
No ratings yet
hw3_red
4 pages
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
No ratings yet
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
4 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
HMWK 4
No ratings yet
HMWK 4
5 pages
exercise01
No ratings yet
exercise01
3 pages
HW 1
No ratings yet
HW 1
4 pages
NFPA 13-2022 237
No ratings yet
NFPA 13-2022 237
1 page
IET Biometrics - 2021 - Yu - A Survey on Deepfake Video Detection
No ratings yet
IET Biometrics - 2021 - Yu - A Survey on Deepfake Video Detection
18 pages
2021 EE769 Tutorial Sheet 1
No ratings yet
2021 EE769 Tutorial Sheet 1
4 pages
LV6548V Manual-20221104
No ratings yet
LV6548V Manual-20221104
76 pages
Chapter 4 Installation Commissioning and Testing in New Plant
100% (3)
Chapter 4 Installation Commissioning and Testing in New Plant
24 pages
EML Couse Outcome
No ratings yet
EML Couse Outcome
2 pages
Course Outline 2
No ratings yet
Course Outline 2
4 pages
6. V-201-ITP for Plumbing-Rev A
No ratings yet
6. V-201-ITP for Plumbing-Rev A
32 pages
Assembly - Diferencial
No ratings yet
Assembly - Diferencial
25 pages
22kV GTP
100% (1)
22kV GTP
41 pages
OCJP-2 Notes Java
No ratings yet
OCJP-2 Notes Java
30 pages
Checksheet Incoming - Nut, Wheel Single LH
No ratings yet
Checksheet Incoming - Nut, Wheel Single LH
1 page
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
No ratings yet
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
3 pages
Format For Mini Project Report
No ratings yet
Format For Mini Project Report
23 pages
5kw Solar System Price in India With Subsidy @Rs250000 Solar Experts
No ratings yet
5kw Solar System Price in India With Subsidy @Rs250000 Solar Experts
1 page
ISM Keywords
No ratings yet
ISM Keywords
2 pages
Lista PDF
No ratings yet
Lista PDF
2 pages
GS Ep Exp 109 01
No ratings yet
GS Ep Exp 109 01
11 pages
6.2 - Switches Port Security
No ratings yet
6.2 - Switches Port Security
7 pages
IBD 50 Index
No ratings yet
IBD 50 Index
4 pages
Save & Close. How To Insert A Text Box How To Insert A Text Box
No ratings yet
Save & Close. How To Insert A Text Box How To Insert A Text Box
2 pages
Mason Industries, Inc.: Type C Ratings
No ratings yet
Mason Industries, Inc.: Type C Ratings
4 pages
Backdoor: Making Microphones Hear Inaudible Sounds: Nirupam Roy, Haitham Hassanieh, Romit Roy Choudhury
No ratings yet
Backdoor: Making Microphones Hear Inaudible Sounds: Nirupam Roy, Haitham Hassanieh, Romit Roy Choudhury
13 pages
m4 Act2 Apply Include
No ratings yet
m4 Act2 Apply Include
5 pages
ASUS ROG G75VX-BHI7N11 Fans Replacement - Ifixit Repair Guide
No ratings yet
ASUS ROG G75VX-BHI7N11 Fans Replacement - Ifixit Repair Guide
1 page
Photography in Brief: Pleasures and Terrors OF DOMESTIC COMFORT by Peter Galassi (Museum of
No ratings yet
Photography in Brief: Pleasures and Terrors OF DOMESTIC COMFORT by Peter Galassi (Museum of
2 pages
Informatica: The Powercenter/Powermart
No ratings yet
Informatica: The Powercenter/Powermart
3 pages
Topology Essentials
From Everand
Topology Essentials
Emil G. Milewski
5/5 (1)
Set Theory Essentials
From Everand
Set Theory Essentials
Emil Milewski
No ratings yet
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet

Assignment 1: Statistical Machine Learning, Summer Term 2022

Uploaded by

Assignment 1: Statistical Machine Learning, Summer Term 2022

Uploaded by

Assignment 1

Statistical Machine Learning, Summer term 2022

due on Monday, April 25th 2022.

Exercise 1 (Joint probability, marginal probability, relative frequency, expected value

All course material is available on

P (A = positive | B = cancer) = 0.95 and P (A = negative | B = no cancer) = 0.95

Compute the probability of having cancer with a positive test result

Exercise 3 (Linear independence, basis and rank, 1+1+1+1 points)

Exercise 4 (Symmetric matrices, eigenvectors, eigenvalues and SVD, 1+2+2+0 points)

Exercise 5 (Setting up python, 0 point)

For example, on Ubuntu, Debian and derivate

• Windows: Start → “Jupyter Notebook”

You might also like