0% found this document useful (0 votes)

30 views4 pages

HW01 - Math Recap

This document provides a math refresher homework for a machine learning course. It contains 11 problems reviewing topics in linear algebra, calculus, and probability theory. Students are asked to review references on linear algebra and probability theory, and solve problems involving matrix operations, eigenvalues, optimization, and exponential families. Students must submit a single PDF file with their solutions by a deadline.

Uploaded by

ghukasyans033

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views4 pages

HW01 - Math Recap

Uploaded by

ghukasyans033

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CS251/CS340 Machine Learning Page 1

Machine Learning Homework 01

Math Refresher

The machine learning lecture relies on your knowledge of undergraduate mathematics, especially linear
algebra and probability theory. You should think of this homework as a test to see if you meet the
prerequisites for taking this course.

Homework
Reading
We strongly recommend that you review the following documents to refresh your knowledge. You should
already be familiar with most of their content from your previous studies.

• Linear algebra http://cs229.stanford.edu/section/cs229-linalg.pdf

(except sections 4.4, 4.5, 4.6)
• Probability theory http://cs229.stanford.edu/summer2019/cs229-prob.pdf

Linear Algebra
Notation. We use the following notation in this lecture:

• Scalars are denoted with lowercase letters, e.g. a, x, µ.

• Vectors are denoted with bold lowercase letters, e.g. a, x, µ.
• Matrices are denoted with bold uppercase letters, e.g. A, X, Σ.
• RN denotes N -dimensional Euclidean space, i.e. the set of N -dimensional vectors with real-valued
√
entries. For example, x = (2, 2, 6.5, −7)T is an element of R4 , which we denote as x ∈ R4 .
!
2 3 1
• RM ×N is the set of matrices with M rows and N columns. For example, the matrix A =
1 4 5
is an element of R2×3 , which we denote as A ∈ R2×3 .
• A function f : X → Y maps elements of the set X into the set Y. An example would be a function
f : R × R → R defined as f (x, y) = 2x2 + xy − 4.

Problem 1: Let x ∈ RN , y ∈ RM and Z ∈ RP ×Q . The function f : RN × RM × RP ×Q → R is defined as

f (x, y, Z) = xT Ay + Bx − xT CZD − y T E T y + F .

What should be the dimensions (shapes) of the matrices A, B, C, D, E, F for the expression above to be a
valid mathematical expression?
PN PN
Problem 2: Let x ∈ RN , M ∈ RN ×N . Express the function f (x) = i=1 j=1 xi xj Mij using only
matrix-vector multiplications. Show your work and briefly explain your steps.

Upload a single PDF file with your homework solution to Moodle by 30.01.2024, 23:59. We recommend to typeset your solution
(using LaTex or Word), but handwritten solutions are also accepted (bring them to the next lecture or put it in my box).
Collaboration is fine but submitting the same or extremely similar solutions is not allowed. Homework rules are in the syllabus.
CS251/CS340 Machine Learning Page 2

Problem 3: Let A ∈ RM ×N , x ∈ RN and b ∈ RM . We are interested in solving the following system of

linear equations for x

Ax = b (1)

a) Under what conditions does the system of linear equations have a unique solution x for any choice
of b?
b) Assume that M = N = 4 and that A has the following eigenvalues: {−1, 0, 4, 4}. Does Equation 1
have a unique solution x for any choice of b? Justify your answer.

Problem 4: Let A ∈ RN ×N . Assume that there exists a matrix B ∈ RN ×N such that BA = AB = I.

What can you say about the eigenvalues of A? Justify your answer.

Problem 5: A symmetric matrix A ∈ RN ×N is positive semi-definite (PSD) if and only if for any x ∈
RN it holds that xT Ax ≥ 0. Prove that a symmetric matrix A is PSD if and only if it has no negative
eigenvalues.

Problem 6: Let A ∈ RM ×N . Prove that the matrix B = AT A is positive semi-definite for any A.

Calculus
Problem 7: Consider the following function f : R → R

1 2
f (x) = ax + bx + c
2

We are interested in solving the following optimization problem

min f (x)
x∈R

a) Under what conditions does this optimization problem have (i) a unique solution, (ii) infinitely many
solutions or (iii) no solution? Justify your answer.
b) Assume that the optimization problem has a unique solution. Write down the closed-form expression
for x∗ that minimizes the objective function, i.e. find x∗ = arg minx∈R f (x).

Problem 8: Consider the following function g : RN → R

1 T
g(x) = x Ax + bT x + c
2

where A ∈ RN ×N is a symmetric, PSD matrix, b ∈ RN and c ∈ R.

We are interested in solving the following optimization problem

min g(x)
x∈RN

a) Compute the Hessian ∇2 g(x) of the objective function. Under what conditions does this optimiza-
tion problem have a unique solution?
b) Why is it necessary for a matrix A to be PSD for the optimization problem to be well-defined?
What happens if A has a negative eigenvalue?

c) Assume that the matrix A is positive definite (PD). Write down the closed-form expression for x∗
that minimizes the objective function, i.e. find x∗ = arg minx∈RN g(x).

Problem 9: Consider the following functions:

f1 (x) = sin(x1 ) cos(x2 ), x ∈ R2

f2 (x, y) = xT y, x, y ∈ Rn
f3 (x) = xxT , x ∈ Rn

df1 df2 df3

What are the dimensions of dx , dx , and dx ?

Probability Theory
Notation. We use the following notation in our lecture:

• For conciseness and to avoid clutter, we use p(x) to denote multiple things
1. If X is a discrete random variable, p(x) denotes the probability mass function (PMF) of X at
point x (usually denoted as pX (x)orp(X = x) in the statistics literature).
2. If X is a continuous random variable, p(x) denotes the probability density function (PDF) of X
at point x (usually denoted as fX (x) in the statistics literature).
3. If A ∈ Ω is an event, p(A) denotes the probability of this event (usually denoted as P r({A}) or
P ({A}) in the statistics literature)
You will mostly encounter (1) and (2) throughout the lecture. Usually, the meaning is clear from the
context.
• Given the distribution p(x), we may be interested in computing the expected value Ep(x) [f (x)] or,
equivalently, EX [f (x)]. Usually, it is clear with respect to which distribution we are computing the
expectation, so we omit the subscript and simply write E[f (x)].
• x ∼ p means that x is distributed (sampled) according to the distribution p. For example, x ∼
N (µ, σ 2 ) (or equivalently p(x) = N (µ, σ 2 ) means that x is distributed according to the normal distri-
bution with mean µ and variance σ 2 .

Problem 10: Exponential families include many of the most common distributions, such as normal,
exponential, Bernoulli, categorical, etc. You are given the general form of the PDF (PMF in the discrete
case) pθ (x) (also written as p(x | θ)) of the distributions from the exponential family below:
" k
#
X
p(x | θ) = h(x) c(θ) exp wi (θ) ti (x) , θ ∈ Θ,
i=1

where Θ is the parameter space, h(x) ≥ 0, and the ti (x)-s only depend on x, and similarly, c(θ) ≥ 0 and
the wi (θ)-s only depend on the (possibly vector-valued) parameter θ.
Your task is to express the Binomial distribution as an exponential family distribution. Also express the
Beta distribution is an exponential family distribution. Show that the product of the Beta and the Bino-
mial distribution is also a member of the exponential family.

Problem 11: Prove or disprove the following statement

p(a | b, c) = p(a | c) ⇐⇒ p(a | b) = p(a)

Problem 12: Consider the following bivariate distribution p(x, y) of two discrete random variables X
and Y .

Compute:

1. The marginal distributions p(x) and p(y).

2. The conditional distributions p(x | Y = y1 ) and p(y | X = x3 ).

Problem 13: You are given the joint PDF p(a, b, c) of three continuous random variables. Show how the
following expressions can be obtained using the rules of probability
1. p(a)
2. p(c | a, b)
3. p(b | c)

Problem 14: In this problem, there are two bowls. The first bowl holds three pineapples and three or-
anges, while the second bowl has three pineapples and five oranges. Additionally, there’s a biased coin,
which lands on ”tails” with a 0.7 probability and ”heads” with a 0.3 probability. If the coin lands on ”heads”,
a piece of fruit is randomly selected from the first bowl; if it lands on ”tails”, the fruit is chosen from the
second bowl. Your friend flips the coin (which you can’t see), selects a piece of fruit from the correspond-
ing bowl, and hands you a pineapple. Determine the likelihood that the pineapple was selected from the
second bowl.
Problem 15: (Iterated Expectations) Consider two random variables X, Y with joint distribution p(x, y).
Show that

EX [ x ] = EY [EX [ x | y ]].

Here, EX [x|y] denotes the expected value of x under the conditional distributionp(x|y).
Problem 16: Let X ∼ N (µ, σ 2 ), and f (x) = ax + bx2 + c. What is E[f (X)]?

Problem 17: Let p(x) = N (x|µ, Σ), and g(x) = Ax (where A ∈ RN ×N ). What are the values of the
following expressions:

• E[g(x)],
• E[g(x)g(x)T ],
• E[g(x)T g(x)],
• the covariance matrix Cov[g(x)].

Isi Mstat Pyq Book (2024-2016)
No ratings yet
Isi Mstat Pyq Book (2024-2016)
178 pages
CH5019 Mathematical Foundations of Data Science Test 8 Questions
No ratings yet
CH5019 Mathematical Foundations of Data Science Test 8 Questions
4 pages
DL (Unit I)
No ratings yet
DL (Unit I)
25 pages
Functions - by Trockers
80% (5)
Functions - by Trockers
54 pages
ECE673 - Week3 - Lecture - With Figures
No ratings yet
ECE673 - Week3 - Lecture - With Figures
67 pages
A NGUNYI NOTES SMA 2332 Probability and Statistics IV
No ratings yet
A NGUNYI NOTES SMA 2332 Probability and Statistics IV
64 pages
Continuous Random Variables
No ratings yet
Continuous Random Variables
54 pages
MATH2010 2022 23 AutumnNotes Gappy
No ratings yet
MATH2010 2022 23 AutumnNotes Gappy
92 pages
Homework #5: MA 402 Mathematics of Scientific Computing Due: Monday, September 27
100% (1)
Homework #5: MA 402 Mathematics of Scientific Computing Due: Monday, September 27
6 pages
01-Basic Mathematics in Physics
No ratings yet
01-Basic Mathematics in Physics
19 pages
Conditioning On An Event Multiple Continuous R.V. 'S
No ratings yet
Conditioning On An Event Multiple Continuous R.V. 'S
20 pages
Stats 2 Week 5 8 Paga
No ratings yet
Stats 2 Week 5 8 Paga
76 pages
ПМиИИ Демо ENG
No ratings yet
ПМиИИ Демо ENG
11 pages
2020 21sjit PQT QB
No ratings yet
2020 21sjit PQT QB
66 pages
hw1
No ratings yet
hw1
7 pages
Sta 2200 Notes PDF
No ratings yet
Sta 2200 Notes PDF
52 pages
Exercise 01 Solution (1)
No ratings yet
Exercise 01 Solution (1)
8 pages
Maths QB Soln
No ratings yet
Maths QB Soln
81 pages
Maths Part B PDF
No ratings yet
Maths Part B PDF
125 pages
Lecture 1: Introduction and Review of Prerequisite Concepts: DR Jay Lee Jay - Lee@unsw - Edu.au
No ratings yet
Lecture 1: Introduction and Review of Prerequisite Concepts: DR Jay Lee Jay - Lee@unsw - Edu.au
33 pages
Review1 PDF
No ratings yet
Review1 PDF
22 pages
MA4151 Applied Probability)
No ratings yet
MA4151 Applied Probability)
28 pages
Lecture 14 1756137910 231018 104530
No ratings yet
Lecture 14 1756137910 231018 104530
12 pages
HW7 Solutions
No ratings yet
HW7 Solutions
39 pages
Ps0 Template
No ratings yet
Ps0 Template
5 pages
HW 2
No ratings yet
HW 2
7 pages
HW 0
No ratings yet
HW 0
5 pages
HW01 Sol - Math Recap
No ratings yet
HW01 Sol - Math Recap
13 pages
2 Probability and Linear Algebra
No ratings yet
2 Probability and Linear Algebra
21 pages
exercise 01 math refresher
No ratings yet
exercise 01 math refresher
4 pages
DAMA_50_exam_final_22-23
No ratings yet
DAMA_50_exam_final_22-23
11 pages
Assignment 0
No ratings yet
Assignment 0
3 pages
DAMA_50_exam_final_23-24
No ratings yet
DAMA_50_exam_final_23-24
8 pages
hw8 (5555)
No ratings yet
hw8 (5555)
3 pages
Assignment 1: Statistical Machine Learning, Summer Term 2022
No ratings yet
Assignment 1: Statistical Machine Learning, Summer Term 2022
4 pages
Problem Set 9 Solutions
No ratings yet
Problem Set 9 Solutions
5 pages
E1 Exam Sol
No ratings yet
E1 Exam Sol
6 pages
hw0_22au
No ratings yet
hw0_22au
5 pages
AJ Sadler Methods Unit 1
100% (1)
AJ Sadler Methods Unit 1
288 pages
Assignment01 2023
No ratings yet
Assignment01 2023
1 page
5/6/2011: Final Exam: Your Name
No ratings yet
5/6/2011: Final Exam: Your Name
4 pages
Angelomingarelli PDF
No ratings yet
Angelomingarelli PDF
748 pages
Week 11_GS
No ratings yet
Week 11_GS
13 pages
SSP1 Homework CLASS2
No ratings yet
SSP1 Homework CLASS2
9 pages
Unit1 Probab
No ratings yet
Unit1 Probab
11 pages
EXAM-2 Part 02
No ratings yet
EXAM-2 Part 02
7 pages
Section06 Solutions
No ratings yet
Section06 Solutions
11 pages
Random Variable Using MATLAB
No ratings yet
Random Variable Using MATLAB
14 pages
hw1, Random Processes
No ratings yet
hw1, Random Processes
2 pages
Worksheet - 1 - Ch 1 - Matrices - Part 1 - Xii Math - Ay 2025 -26
No ratings yet
Worksheet - 1 - Ch 1 - Matrices - Part 1 - Xii Math - Ay 2025 -26
2 pages
Algebra and Trigonometry 6th Edition Blitzer Test Bank pdf download
100% (5)
Algebra and Trigonometry 6th Edition Blitzer Test Bank pdf download
50 pages
Ta 2
No ratings yet
Ta 2
7 pages
Homework 2 Solutions
No ratings yet
Homework 2 Solutions
6 pages
Homework 0: Mathematical Background For Machine Learning
No ratings yet
Homework 0: Mathematical Background For Machine Learning
11 pages
CS209 Practice Problems 1 ML
No ratings yet
CS209 Practice Problems 1 ML
4 pages
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
No ratings yet
Series 1, Oct 1st, 2013 Probability and Related) : Machine Learning
4 pages
TLA College 8
No ratings yet
TLA College 8
18 pages
HW 3
No ratings yet
HW 3
3 pages
E1 Exam
No ratings yet
E1 Exam
3 pages
exercise01
No ratings yet
exercise01
3 pages
problem_set_5_sol
No ratings yet
problem_set_5_sol
7 pages
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
No ratings yet
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
4 pages
Ma8353 Transforms and Partial Differential Equations II Year III Semester
No ratings yet
Ma8353 Transforms and Partial Differential Equations II Year III Semester
128 pages
Block Diagram
No ratings yet
Block Diagram
75 pages
Engineering Statistics & Linear Algebra: 18EC44 Model Question Paper-2 With Effect From 2019-20 (CBCS Scheme)
No ratings yet
Engineering Statistics & Linear Algebra: 18EC44 Model Question Paper-2 With Effect From 2019-20 (CBCS Scheme)
4 pages
PSet1 CS725 2022
No ratings yet
PSet1 CS725 2022
2 pages
EC400 Slides Lecture 2
No ratings yet
EC400 Slides Lecture 2
35 pages
Vanishing Sums of Roots of Unity H W Lenstra, JR
No ratings yet
Vanishing Sums of Roots of Unity H W Lenstra, JR
20 pages
BECC104 English
No ratings yet
BECC104 English
4 pages
Answers To Final Exam: MA441: Algebraic Structures I 20 December 2003
No ratings yet
Answers To Final Exam: MA441: Algebraic Structures I 20 December 2003
5 pages
Material I - EEE 4018 ACT Part 1 PDF
No ratings yet
Material I - EEE 4018 ACT Part 1 PDF
92 pages
UNIT-4 3D Object Representations Methods
No ratings yet
UNIT-4 3D Object Representations Methods
6 pages
STA 741 / ECE 741: Homework 0: Compressed Sensing and Related Topics Duke University, Spring 2016
No ratings yet
STA 741 / ECE 741: Homework 0: Compressed Sensing and Related Topics Duke University, Spring 2016
2 pages
Basic Calculus Exam 4th Quarter
No ratings yet
Basic Calculus Exam 4th Quarter
4 pages
Rvhs h2 Math p1 Solutions
No ratings yet
Rvhs h2 Math p1 Solutions
13 pages
Math 10 Worksheets 2nd Quarter
No ratings yet
Math 10 Worksheets 2nd Quarter
22 pages
Matlab Code PDF
100% (1)
Matlab Code PDF
26 pages
Chapter 7 - Introduction To Euclid's Geometry
No ratings yet
Chapter 7 - Introduction To Euclid's Geometry
3 pages
LPP Formu & Proce
No ratings yet
LPP Formu & Proce
25 pages
Numerical Methods
No ratings yet
Numerical Methods
23 pages
Compare and Order Fractions Less Than 1: © White Rose Maths 2019
No ratings yet
Compare and Order Fractions Less Than 1: © White Rose Maths 2019
2 pages
Quiz 2
No ratings yet
Quiz 2
2 pages
Wbjee Math
No ratings yet
Wbjee Math
2 pages
Bra Ket & Linear Algebra
No ratings yet
Bra Ket & Linear Algebra
4 pages
The Theory of Interest
No ratings yet
The Theory of Interest
13 pages
508 HW 12
No ratings yet
508 HW 12
2 pages
A Factoring Lemma
No ratings yet
A Factoring Lemma
6 pages
Chain Rule
100% (1)
Chain Rule
3 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A First Course in Functional Analysis
From Everand
A First Course in Functional Analysis
Martin Davis
No ratings yet

HW01 - Math Recap

Uploaded by

HW01 - Math Recap

Uploaded by

CS251/CS340 Machine Learning Page 1

Machine Learning Homework 01

• Linear algebra http://cs229.stanford.edu/section/cs229-linalg.pdf

• Scalars are denoted with lowercase letters, e.g. a, x, µ.

Problem 1: Let x ∈ RN , y ∈ RM and Z ∈ RP ×Q . The function f : RN × RM × RP ×Q → R is defined as

Problem 3: Let A ∈ RM ×N , x ∈ RN and b ∈ RM . We are interested in solving the following system of

Problem 4: Let A ∈ RN ×N . Assume that there exists a matrix B ∈ RN ×N such that BA = AB = I.

We are interested in solving the following optimization problem

Problem 8: Consider the following function g : RN → R

where A ∈ RN ×N is a symmetric, PSD matrix, b ∈ RN and c ∈ R.

Problem 9: Consider the following functions:

f1 (x) = sin(x1 ) cos(x2 ), x ∈ R2

df1 df2 df3

Problem 11: Prove or disprove the following statement

p(a | b, c) = p(a | c) ⇐⇒ p(a | b) = p(a)

1. The marginal distributions p(x) and p(y).

2. The conditional distributions p(x | Y = y1 ) and p(y | X = x3 ).

You might also like