0% found this document useful (0 votes)

254 views

Ps 0

This document summarizes key concepts and proofs from CS 229: Machine Learning Problem Set 0. It addresses calculating gradients, Hessians, and properties of symmetric positive semi-definite matrices. Key results shown include formulas for the gradient and Hessian of functions of matrix forms, properties of eigenvectors and eigenvalues of diagonalizable matrices, and proofs that eigenvalues of symmetric positive semi-definite matrices are non-negative.

Uploaded by

WilliamMa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

254 views

Ps 0

Uploaded by

WilliamMa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CS 229: Machine Learning

Problem Set 0

William Ma

July 20, 2017

1 Question 1
1a Part a
Given f (x) = 12 xT Ax + bT x where A is a symmetric matrix and and b Rn is a vector,
we can calculate x f (x) by taking the partial derivative
n n n
f (x) h1 X X X i
= xi Aij xj + bi xi
xk xk 2
i=1 j=1 i=1
1h X X X X i
= Aij xi xj + Aik xi xk + Akj xk xj + Akk xk
xk 2
i6=k j6=k i6=k j6=k
n
X
+ bi xi
xk
i=1
1X 1X
= Aik xi + Akj xj + Akk x2k + bk
2 2
i6=k j6=k
n n
1 X 1 X
= Aik xi + Akj xj + bk
2 2
i=1 j=1
n
X
= Aik xi + bk
i=1

Now we can easily see that, if x f (x) = 2Ax + b

1b Part b
Given that f (x) = g(h(x)), where g : R R is differentiable and h : Rn R is
differentiable, we can expand f (x) to arrive at the solution

f (x)
= g(h(x))
xk xk
By invoking Chain Rule,

f (x)
= g 0 (h(x)) h(x)
xk xk
Combining these back into a vector,
0
g (h(x)) x 1 h(x)

f (x) = .. 0
= g (h(x))h(x)

.
g 0 (h(x)) xn h(x)

1
1c Part c
Given f (x) = 12 xT Ax + bT x where A is a symmetric matrix and and b Rn is a vector,
we can calculate the Hessian as follows
n n n
2 f (x) 2 h 1 X X X i
= x i A ij x j + bi x i
x2k x2k 2 i=1 j=1 i=1
2 1h X X X X i
= Aij xi xj + Aik xi xk + Akj xk xj + Akk x2k
x2k 2
i6=k j6=k i6=k j6=k
n
2 X
+ bi xi
x2k i=1
1X 1X
= Aik + Akj + 2Akk xk
2 2
i6=k j6=k
n n
1 X 1 X
= Aik + Akj
2 2
i=1 j=1
n
X
= Aik
i=1

Thus, the 2 f (x) = A.

1d Part d
Given f (x) = g(aT x), where g : R R is continuously differentiable and a Rn is a
vector, we can calculate f (x) using the result we got from problem 1a and 1b

f (x) = g 0 (aT x) (aT x)

= g 0 (aT x)a

However, for the Hessian, we have to expand, apply Chain rule to each term, then
recombine back into a vector.
2 f (x) 2
= xj g(aT x)
xi xj xk xi
n n
00 T X X
= g (a x) ak xk al xl
xi xj
k=1 l=1
00 T
g 00 (aT x)a1 an

g (a x)a1 a1 . . .
00 T
= g (a x)ai aj =
.. .. ..
. . .
g 00 (aT x)an a1 . . . g 00 (aT x)an an
= g 00 (aT x)aaT

Thus, 2 f (x) = g 00 (aT x)aaT .

2
2 Problem 2
2a Part a
nn
Proof. Given z Rn and that A = zz T , A S+ if A = AT and xT Ax 0.
A = AT
zz T = (zz T )T
zz T = (z T )T z T = zz t
Thus, A = AT .
xT Ax 0
xT zz T x 0
(xT z)(xT z)T 0
(xT z)2 0
nn
Thus, since A = AT and xT Ax 0, A S+ .

2b Part b
Given z Rn is a non-zero vector and A = zz t , the null-space of A is 1 since, Ax = 0
only when x is orthogonal to z, which implies that z T x = 0 as shown.
Ax = 0
zz T x = 0
z(0) = 0
Thus, the null-space is 1. Using the rank-nullity theorem, the rank of A is n 1.

2c Part c
Proof. Given A Snn
+ and B Rmn is arbitary,
BAB T = (BAB T )T
BAB T = (B T )T AT B T
BAB T = BAB T
Thus, BAB T = (BAB T )T .
xT BAB T x 0
(xT B)A(xT B)T 0
Since A Snn T T T T T
+ , then yAy 0. We can simply let y = x B for (x B)A(x B) 0 to
T T T T T T mm
be true. Thus, since BAB = (BAB ) and x BAB x 0, BAB S+ .

3
3 Problem 3
3a part a
Proof. Given that A is diagonalizable, such that A = T T 1 , and t(i) Rn is the i-th
column of T ,

At(i) = T T 1 t(i)

The inverse of a matrix, M Rnn multiplied by x(i) , the i-th column of M , returns
always returns a n n matrix, N , where
(
1, if j = i and k = i
Njk =
0, otherwise

Thus,

At(i) = T T 1 t(i) = T (i)

= t(i) i = i t(i)

Thus, At(i) = i t(i) where (t(i) , i ) are the eigenvector/eigenvalue pair of A.

3b Part b
Proof. Given that A is symmetric, A = U U 1 , U is orthogonal, and u(i) Rn is the
i-th column of T ,

Au(i) = U U T u(i)
= U U (1) u(i)

We can use the result we got from problem 3a and get that Au(i) = i u(i) , where (u(i) , i )
are the eigenvector/eigenvalue pair of A.

3c Part c
Proof. Given A Snn
+ and i is an eigenvalue of A,

xT Ax 0
xT U U T x 0
(xT U )(xT U )T 0
nn
Since is a diagonal matrix, S+ , which implies that i 0.

Matrix Analysis (2nd) Solutions To Exercises
55% (11)
Matrix Analysis (2nd) Solutions To Exercises
81 pages
Solution Optimization 2ed
58% (19)
Solution Optimization 2ed
138 pages
EE263 Homework 3 Solutions
No ratings yet
EE263 Homework 3 Solutions
16 pages
CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus
No ratings yet
CS 229, Autumn 2016 Problem Set #0 Solutions: Linear Algebra and Multivariable Calculus
4 pages
CS209 Practice Problems 1 ML
No ratings yet
CS209 Practice Problems 1 ML
4 pages
Solutions: Problem Set 1: January 17, 2013
No ratings yet
Solutions: Problem Set 1: January 17, 2013
9 pages
CS 229, Fall 2018 Problem Set #0: Linear Algebra and Multivariable Calculus
No ratings yet
CS 229, Fall 2018 Problem Set #0: Linear Algebra and Multivariable Calculus
2 pages
Linear Algebra Gilbert Strang - MIT18 - 06S10 - Pset8 - s10 - Soln
No ratings yet
Linear Algebra Gilbert Strang - MIT18 - 06S10 - Pset8 - s10 - Soln
6 pages
Ps0 Template
No ratings yet
Ps0 Template
5 pages
Solutions For Applied Numerical Linear Algebra PDF
No ratings yet
Solutions For Applied Numerical Linear Algebra PDF
75 pages
ESE500 HW1 Solutions
No ratings yet
ESE500 HW1 Solutions
10 pages
Midsem Regular MFDS 22-12-2019 Answer Key PDF
No ratings yet
Midsem Regular MFDS 22-12-2019 Answer Key PDF
5 pages
MA412 Q&A Part 4
No ratings yet
MA412 Q&A Part 4
53 pages
(2022) 30407 - Exam - Solution (2)
No ratings yet
(2022) 30407 - Exam - Solution (2)
4 pages
MIT System Theory Solutions
No ratings yet
MIT System Theory Solutions
75 pages
582 Problems
No ratings yet
582 Problems
42 pages
Problem Sets
No ratings yet
Problem Sets
47 pages
Exam2016-17s1
No ratings yet
Exam2016-17s1
9 pages
MSML604_Homework_3
No ratings yet
MSML604_Homework_3
5 pages
Problem 1: CS205 Homework #3 Solutions
No ratings yet
Problem 1: CS205 Homework #3 Solutions
7 pages
20230SEngMath2MidSol
No ratings yet
20230SEngMath2MidSol
8 pages
Homework 4 MATH2050
No ratings yet
Homework 4 MATH2050
7 pages
hw1-18
No ratings yet
hw1-18
7 pages
Math 313 (Linear Algebra) Final Exam Practice KEY
No ratings yet
Math 313 (Linear Algebra) Final Exam Practice KEY
13 pages
Finalsamplesolved
No ratings yet
Finalsamplesolved
7 pages
Worksheet12 Sol
No ratings yet
Worksheet12 Sol
10 pages
Matrix Perturbation Theory
No ratings yet
Matrix Perturbation Theory
18 pages
Ecd 01
No ratings yet
Ecd 01
16 pages
EE263s Homework 4
No ratings yet
EE263s Homework 4
11 pages
Midtermsols Sp2010
No ratings yet
Midtermsols Sp2010
6 pages
(MATH2111) (2017) (F) Final In5mue 14501
No ratings yet
(MATH2111) (2017) (F) Final In5mue 14501
12 pages
Homework 7 Solutions: 5.2 - Diagonalizability
No ratings yet
Homework 7 Solutions: 5.2 - Diagonalizability
7 pages
Ecomt Solns1
No ratings yet
Ecomt Solns1
15 pages
Exercises MEF - 3 - 2018 - Solution
No ratings yet
Exercises MEF - 3 - 2018 - Solution
10 pages
HW 2 Sol
No ratings yet
HW 2 Sol
9 pages
Homework1_solution
No ratings yet
Homework1_solution
9 pages
CuoiKy BaoCaoNhom
No ratings yet
CuoiKy BaoCaoNhom
25 pages
EC2R Answer Key
No ratings yet
EC2R Answer Key
3 pages
Hw2sol PDF
No ratings yet
Hw2sol PDF
5 pages
Col726 2302 Ass3 Solutions
No ratings yet
Col726 2302 Ass3 Solutions
5 pages
SP351 HW6: Nikhil Vasan September 2021
No ratings yet
SP351 HW6: Nikhil Vasan September 2021
10 pages
Slides
No ratings yet
Slides
428 pages
Basic 1
No ratings yet
Basic 1
78 pages
CLA Week3
No ratings yet
CLA Week3
13 pages
11 2 linearalg second exam 모범답안
No ratings yet
11 2 linearalg second exam 모범답안
3 pages
MIT18 06SCF11 FinalRevsum
No ratings yet
MIT18 06SCF11 FinalRevsum
7 pages
MA1101R 2020 Solutions
No ratings yet
MA1101R 2020 Solutions
8 pages
HW2 Solutions
No ratings yet
HW2 Solutions
2 pages
Solutions 4
No ratings yet
Solutions 4
11 pages
math5050_6050_5.pdf
No ratings yet
math5050_6050_5.pdf
6 pages
HW 6 Solutions
No ratings yet
HW 6 Solutions
11 pages
Final
No ratings yet
Final
19 pages
Assignment MEF 1 2018 Solution
No ratings yet
Assignment MEF 1 2018 Solution
11 pages
Finalexam 2
No ratings yet
Finalexam 2
172 pages
DAMA_50_exam_final_23-24
No ratings yet
DAMA_50_exam_final_23-24
8 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
Indian Institute of Technology Bombay Linear Algebra MA106 Tutorial 5,6 Solutions
No ratings yet
Indian Institute of Technology Bombay Linear Algebra MA106 Tutorial 5,6 Solutions
15 pages
2019 Eee Pyq New
No ratings yet
2019 Eee Pyq New
25 pages
bitstream_821479
No ratings yet
bitstream_821479
6 pages
Transformation of Axes (Geometry) Mathematics Question Bank
From Everand
Transformation of Axes (Geometry) Mathematics Question Bank
Mohmmad Khaja Shareef
3/5 (1)
Chemistry and Chemical Reactivity 7th Edition John C. Kotz - Download the ebook today to explore every detail
100% (1)
Chemistry and Chemical Reactivity 7th Edition John C. Kotz - Download the ebook today to explore every detail
44 pages
Electronics For Analog Signal Processing - K. Radhakrishna Rao
No ratings yet
Electronics For Analog Signal Processing - K. Radhakrishna Rao
26 pages
PP Technology
100% (2)
PP Technology
29 pages
CBSE Sample Question Papers For Class 9 Science - Mock Paper 3
No ratings yet
CBSE Sample Question Papers For Class 9 Science - Mock Paper 3
8 pages
PTE - 02 Sejarah Profesi
No ratings yet
PTE - 02 Sejarah Profesi
56 pages
Questions Having One Mark Each
No ratings yet
Questions Having One Mark Each
11 pages
Low-Temperature Hysteresis Effects in Metal-Oxide-Silicon Capacitors Caused Surface-State Trapping
No ratings yet
Low-Temperature Hysteresis Effects in Metal-Oxide-Silicon Capacitors Caused Surface-State Trapping
6 pages
Piezoceramic Materials & Properties
No ratings yet
Piezoceramic Materials & Properties
8 pages
Mark Scheme (Results) January 2015: International GCSE Physics (4PH0 2P)
No ratings yet
Mark Scheme (Results) January 2015: International GCSE Physics (4PH0 2P)
12 pages
Juan Maldacena 1997 J. High Energy Phys. 1997 002
No ratings yet
Juan Maldacena 1997 J. High Energy Phys. 1997 002
18 pages
Computational Physics With R and Maxima: Example 1 Semiclassical Quantization of Molecular Vibrations
No ratings yet
Computational Physics With R and Maxima: Example 1 Semiclassical Quantization of Molecular Vibrations
17 pages
Calculus (Diff) Note 03
No ratings yet
Calculus (Diff) Note 03
15 pages
Download Full Transparent Semiconducting Oxides Bulk Crystal Growth and Fundamental Properties 1st Edition Zbigniew Galazka (Author) PDF All Chapters
No ratings yet
Download Full Transparent Semiconducting Oxides Bulk Crystal Growth and Fundamental Properties 1st Edition Zbigniew Galazka (Author) PDF All Chapters
65 pages
Single-Span Beam Analysis: User Should Make Sure To
0% (1)
Single-Span Beam Analysis: User Should Make Sure To
1 page
General Relation Between Tensile Strength and Fatigue Strength of Metallic Materials - J.C. Pang
100% (2)
General Relation Between Tensile Strength and Fatigue Strength of Metallic Materials - J.C. Pang
11 pages
All DPP
No ratings yet
All DPP
121 pages
PHYSICS Handout - 2 (Q) Eng ver Cohort-1
No ratings yet
PHYSICS Handout - 2 (Q) Eng ver Cohort-1
2 pages
PX408: Relativistic Quantum Mechanics: Tim Gershon (T.J.Gershon@warwick - Ac.uk)
No ratings yet
PX408: Relativistic Quantum Mechanics: Tim Gershon (T.J.Gershon@warwick - Ac.uk)
11 pages
Engineering Mechanics Prof. Manoj Harbola Indian Institute of Technology, Kanpur Module - 02 Lecture - 03 Friction
No ratings yet
Engineering Mechanics Prof. Manoj Harbola Indian Institute of Technology, Kanpur Module - 02 Lecture - 03 Friction
28 pages
Practice Test 1
No ratings yet
Practice Test 1
3 pages
Ouat Answer Key
No ratings yet
Ouat Answer Key
84 pages
Questions
No ratings yet
Questions
5 pages
Chapter 3
No ratings yet
Chapter 3
21 pages
QM Block 4
No ratings yet
QM Block 4
21 pages
Matter Waves
No ratings yet
Matter Waves
25 pages
Bose Einstein Condensation
No ratings yet
Bose Einstein Condensation
16 pages
Ix Phy Notes Modification 29-3-22 PDF
No ratings yet
Ix Phy Notes Modification 29-3-22 PDF
60 pages
Ultrix15 Questions - Class 11th Physics
No ratings yet
Ultrix15 Questions - Class 11th Physics
172 pages
Food Physics
No ratings yet
Food Physics
639 pages
Photochemical Reaction
No ratings yet
Photochemical Reaction
2 pages

Ps 0

Uploaded by

Ps 0

Uploaded by

CS 229: Machine Learning

July 20, 2017

Now we can easily see that, if x f (x) = 2Ax + b

Thus, the 2 f (x) = A.

f (x) = g 0 (aT x) (aT x)

Thus, 2 f (x) = g 00 (aT x)aaT .

At(i) = T T 1 t(i) = T (i)

Thus, At(i) = i t(i) where (t(i) , i ) are the eigenvector/eigenvalue pair of A.

You might also like