0% found this document useful (0 votes)

7 views

Lec7matrixnorm Part3

The document discusses low-rank matrix approximation and its applications. It states that the best rank-k approximation of a matrix A under the Frobenius and spectral norms is its truncated SVD Ak, which is the approximation that minimizes the error. It also provides an example of computing the best rank-1 approximation of a sample matrix. Additionally, it covers orthogonal best-fit subspace problems and proves that the optimal solution is given by the top k right singular vectors of the centered data matrix.

Uploaded by

Somnath Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Lec7matrixnorm Part3

Uploaded by

Somnath Das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Matrix norm and low-rank approximation

Low-rank approximation of matrices

Problem. For any matrix A ∈ Rn×d and integer k ≥ 1, find the rank-k matrix
B that is the closest to A (under a given norm such as Frobenius, or spectral):

min kA − Bk
B∈Rn×d : rank(B)=k

Remark. This problem arises in a number of tasks, e.g.,

• Orthogonal least squares fitting

• Data compression (and noise reduction)

• Recommender systems

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 25/49
Matrix norm and low-rank approximation

Theorem 0.6 (Eckart–Young–Mirsky). Given A ∈ Rn×d and 1 ≤ k ≤ rank(A),

Pk
let Ak be the truncated SVD of A with the largest k terms: Ak = i=1 σi ui viT .
Then Ak is the best rank-k approximation to A in terms of both the Frobenius
and spectral norms:2
sX
min kA − BkF = kA − Ak kF = σi2
B : rank(B)=k
i>k

min kA − Bk2 = kA − Ak k2 = σk+1 .

B : rank(B)=k

Remark. The theorem still holds true if the equality constraint rank(B) = k is
relaxed to rank(B) ≤ k (which will also include all the lower-rank matrices).

2
Proof available at https://en.wikipedia.org/wiki/Low-rank_approximation

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 26/49
Matrix norm and low-rank approximation

Example 0.5. For the matrix

 
1 −1
X = 0 1 ,
 
1 0

the best rank-1 approximation is

 2   
√ 1 −1
√  61  1
T
X1 = σ1 u1 v1 = 3 − √6  √2 − √12  1
= − 2 1 
2 
.
1
√1
2 − 12
6

In this problem, the approximation error under either norm (spectral or Frobenius)
is the same: kX − X1 k = σ2 = 1.

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 27/49
Matrix norm and low-rank approximation

Applications of low-rank approximation

• Orthogonal least-squares fitting

• Image compression

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 28/49
Matrix norm and low-rank approximation

Orthogonal Best-Fit Subspace

b
Problem: Given data x1 , . . . , xn ∈ Rd
and an integer 0 < k < d, find the k-D b b
orthogonal “best-fit” plane by solving xi b

n
X b b
min kxi − PS (xi )k22 b
S b
i=1
b PS (xi)
Remark. This problem is different from
b
ordinary linear regression: b
b
• No predictor-response distinction

• Orthogonal (not vertical) fitting

S b b

errors b

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 29/49
Matrix norm and low-rank approximation

b
Theorem 0.7. An orthogonal best-fit
k-dimensional plane to the data X = b b
[x1 , . . . , xn ]T ∈ Rn×d is given by b

x = x̄ + Vk · α b b

where x̄ is the center of the data set

x̄ b
b v2 b
1X
x̄ = xi b
n v1
b
and Vk = [v1 . . . vk ] is a d × k ma- b
trix whose columns are the top k right
singular vectors of the centered data
S b b

matrix b

e = [x1 − x̄, . . . , xn − x̄]T = X−1x̄T .

X
+0
Dr. Guangliang Chen | Mathematics & Statistics, San José State University 30/49
Matrix norm and low-rank approximation

b
Proof. Suppose an arbitrary k-
dimensional plane S is used to fit the b b
data, with a fixed point m ∈ Rd , and xi b
an orthonormal basis b b PS (xi)
b
B = [b1 , . . . , bk ] ∈ Rd×k . b
b
That is, b2 b
m b
b
BT B = Ik , b1 b
BBT : orthogonal projection onto S
S b b
The projection of each data point xi
b
onto the candidate plane is
+0
PS (xi ) = m + BBT (xi − m).

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 31/49
Matrix norm and low-rank approximation

Accordingly, we may rewrite the original problem as

n
X
min kxi − m − BBT (xi − m)k2
m∈Rd , B∈Rd×k
i=1
BT B=Ik

Using multivariable calculus, we can show that for any fixed B an optimal m is
1 X def
m∗ = xi = x̄.
n
Plugging in x̄ for m and letting x̃i = xi − x̄ gives that
X
min kx̃i − BBT x̃i k2 .
B

In matrix notation, this becomes

T 2 e = [x̃1 , . . . , x̃n ]T ∈ Rn×d .
min kX
e − XBB
e kF , where X
B

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 32/49
Matrix norm and low-rank approximation

Let the full SVD of the centered data matrix X

e be

e = UΣVT
X

Denote by X
e k the best rank-k approximation of X:
e

e k = U k Σk V T .
X k

Then the minimum is attained when

T e k,
XBB
e =X

and a minimizer is the matrix consisting of the top k right singular vectors of X,
e
i.e.,
B = Vk ≡ V(:, 1 : k).

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 33/49
Matrix norm and low-rank approximation

Verify: If B = Vk , then
T e k VT
XBB
e = XV k
e 1 , . . . , vk ]VT
= X[v k
= [σ1 u1 , . . . , σk uk ]VkT
= [u1 , . . . , uk ] diag(σ1 , . . . , σk )VkT
= Uk Σk VkT
e k.
=X

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 34/49
Matrix norm and low-rank approximation

Proof of m∗ = x̄:

First, rewrite the above objective function as

n
X n
X
g(m) = kxi − m − BBT (xi − m)k2 = k(I − BBT )(xi − m)k2
i=1 i=1

and apply the formula

∂
kAxk2 = 2AT Ax
∂x
to find its gradient:
X
∇g(m) = − 2(I − BBT )T (I − BBT )(xi − m)

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 35/49
Matrix norm and low-rank approximation

Note that I−BBT is also an orthogonal projection matrix (onto the complement).
Thus,
(I − BBT )T (I − BBT ) = (I − BBT )2 = I − BBT .
It follows that
X X
∇g(m) = − 2(I − BBT )(xi − m) = −2(I − BBT ) xi − nm

Any minimizer m must satisfy

X
2(I − BBT ) xi − nm = 0

This equation has infinitely many solutions, but the simplest one is
X 1X
xi − nm = 0 −→ m= xi .
n

Dr. Guangliang Chen | Mathematics & Statistics, San José State University 36/49

Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
0% (1)
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
4 pages
Lec7matrixnorm Part4
No ratings yet
Lec7matrixnorm Part4
13 pages
3 - Low Rank Apprx for SVD
No ratings yet
3 - Low Rank Apprx for SVD
4 pages
Lec7matrixnorm Part1
No ratings yet
Lec7matrixnorm Part1
12 pages
Lec7matrixnorm Part2
No ratings yet
Lec7matrixnorm Part2
12 pages
EECS 275 Matrix Computation: Ming-Hsuan Yang
No ratings yet
EECS 275 Matrix Computation: Ming-Hsuan Yang
21 pages
Sketching As A Tool For Numerical Linear Algebra
No ratings yet
Sketching As A Tool For Numerical Linear Algebra
139 pages
Lecture1 Slides
No ratings yet
Lecture1 Slides
26 pages
Lec 10 M
No ratings yet
Lec 10 M
8 pages
leastsquares_minnorm_problems
No ratings yet
leastsquares_minnorm_problems
6 pages
ISYE6669 LP 10 22 4 - AndySun - FW
No ratings yet
ISYE6669 LP 10 22 4 - AndySun - FW
10 pages
SVD Slides
No ratings yet
SVD Slides
17 pages
Col726 2302 Ass2 Solutions
No ratings yet
Col726 2302 Ass2 Solutions
6 pages
Chap 03
No ratings yet
Chap 03
59 pages
PCA
No ratings yet
PCA
42 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
10 pages
Svdnotes
No ratings yet
Svdnotes
10 pages
Math6015-Lecture-02 - Gif
No ratings yet
Math6015-Lecture-02 - Gif
115 pages
Lec4 Quad Form
No ratings yet
Lec4 Quad Form
18 pages
Singular Value Decomposition (SVD)
No ratings yet
Singular Value Decomposition (SVD)
94 pages
Lowrank Relerr SIMAX
No ratings yet
Lowrank Relerr SIMAX
38 pages
Lecture 09
No ratings yet
Lecture 09
22 pages
ComputationalMathematics - Chapter 2 PDF
No ratings yet
ComputationalMathematics - Chapter 2 PDF
29 pages
Least Squares and The Singular Value Decomposition: Ivan Markovsky
No ratings yet
Least Squares and The Singular Value Decomposition: Ivan Markovsky
52 pages
Singular Value Decomposition
No ratings yet
Singular Value Decomposition
24 pages
Symmetric Matrix In: Manchester Ml3 Opl, Engeanc
No ratings yet
Symmetric Matrix In: Manchester Ml3 Opl, Engeanc
16 pages
Li Ti 2009
No ratings yet
Li Ti 2009
11 pages
Linear Algebra Cheat Sheet
No ratings yet
Linear Algebra Cheat Sheet
2 pages
exercise_sheet_6
No ratings yet
exercise_sheet_6
3 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
11 pages
10228-Article Text-13756-1-2-20201228part1
No ratings yet
10228-Article Text-13756-1-2-20201228part1
9 pages
Linear-Algebra-and-Applications-Compressed-28.12.2023
No ratings yet
Linear-Algebra-and-Applications-Compressed-28.12.2023
291 pages
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
No ratings yet
The Singular Value Decomposition: Prof. Walter Gander ETH Zurich Decenber 12, 2008
18 pages
wainwrightslides1
No ratings yet
wainwrightslides1
67 pages
Penalty Decomposition Methods For L - Norm Minimization
No ratings yet
Penalty Decomposition Methods For L - Norm Minimization
26 pages
a bình phương tối tiểu
No ratings yet
a bình phương tối tiểu
11 pages
Lecture 17 Least Squares, State Estimation
No ratings yet
Lecture 17 Least Squares, State Estimation
29 pages
A Non-Convex Relaxation For Fixed-Rank Approximation
No ratings yet
A Non-Convex Relaxation For Fixed-Rank Approximation
9 pages
Numerical Analisis 2015
No ratings yet
Numerical Analisis 2015
357 pages
CS 532 Lecture Notes
No ratings yet
CS 532 Lecture Notes
25 pages
1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021
No ratings yet
1 Applications of SVD: Least Squares Approximation: Lecture 8: October 21, 2021
5 pages
Some Notes On Least Squares, QR-factorization, SVD and Fitting
No ratings yet
Some Notes On Least Squares, QR-factorization, SVD and Fitting
12 pages
Matrix YM
100% (3)
Matrix YM
17 pages
Chapter1_Numerical Analysis II 2023-2024
No ratings yet
Chapter1_Numerical Analysis II 2023-2024
30 pages
Laa 2015
No ratings yet
Laa 2015
14 pages
Least Square by Nicholson-linear algebra-2018
No ratings yet
Least Square by Nicholson-linear algebra-2018
12 pages
Linear Algebra: Lecture Notes
No ratings yet
Linear Algebra: Lecture Notes
47 pages
Ecd 01
No ratings yet
Ecd 01
16 pages
Optimization For Machine Learning: Lecture 6: Tractable Nonconvex Problems 6.881: MIT
No ratings yet
Optimization For Machine Learning: Lecture 6: Tractable Nonconvex Problems 6.881: MIT
67 pages
1 Singular Value Decomposition (Recap) : Lecture 7: October 19, 2021
No ratings yet
1 Singular Value Decomposition (Recap) : Lecture 7: October 19, 2021
5 pages
MA 106: Linear Algebra: J. K. Verma Department of Mathematics Indian Institute of Technology Bombay
No ratings yet
MA 106: Linear Algebra: J. K. Verma Department of Mathematics Indian Institute of Technology Bombay
13 pages
SMAI-M20-L09: Aspects of Supervised Learning: C. V. Jawahar
No ratings yet
SMAI-M20-L09: Aspects of Supervised Learning: C. V. Jawahar
16 pages
Module 2 - DS I
No ratings yet
Module 2 - DS I
94 pages
Sparse Regression and Dictionary Learning
No ratings yet
Sparse Regression and Dictionary Learning
14 pages
Lecture 9 Linear Least Squares SVD
No ratings yet
Lecture 9 Linear Least Squares SVD
20 pages
Orf523 S24 HW1
No ratings yet
Orf523 S24 HW1
5 pages
Optimization Models: Exercises 2
No ratings yet
Optimization Models: Exercises 2
2 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Motivation and Mindset - The Odin Project
No ratings yet
Motivation and Mindset - The Odin Project
12 pages
Message 1
No ratings yet
Message 1
5 pages
CN - Wps.moffice Eng
No ratings yet
CN - Wps.moffice Eng
3 pages
Armstrong Pump Selection
No ratings yet
Armstrong Pump Selection
17 pages
Quiz 05inp Newton Solution
No ratings yet
Quiz 05inp Newton Solution
7 pages
7TCA083750R0017 sk020
No ratings yet
7TCA083750R0017 sk020
2 pages
Quantum Computing Models For Articial Neural Networks
No ratings yet
Quantum Computing Models For Articial Neural Networks
9 pages
Axess Corrosion - Brochure
No ratings yet
Axess Corrosion - Brochure
3 pages
Cambridge International AS & A Level: Computer Science 9618/41
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/41
38 pages
CS 2401 Learning Journal Unit 3
No ratings yet
CS 2401 Learning Journal Unit 3
2 pages
Dokumen Tips Documents Reintjes-Marine-Reduction-Gear-Operating-Manual HTML Page 52
No ratings yet
Dokumen Tips Documents Reintjes-Marine-Reduction-Gear-Operating-Manual HTML Page 52
5 pages
Belarc Advisor - Computer Profile
No ratings yet
Belarc Advisor - Computer Profile
3 pages
Module 2
No ratings yet
Module 2
16 pages
(B) // DC Motor With Motor Driver L298N and Potentiometer
No ratings yet
(B) // DC Motor With Motor Driver L298N and Potentiometer
1 page
Application of Nursing Informatics in The Health Care Delivery System in Current Situation of The Philippines
No ratings yet
Application of Nursing Informatics in The Health Care Delivery System in Current Situation of The Philippines
22 pages
Mit401 Unit 10-Slm
No ratings yet
Mit401 Unit 10-Slm
23 pages
CG Manual
No ratings yet
CG Manual
33 pages
123-Alcograin Distillers ,08-03-2025
No ratings yet
123-Alcograin Distillers ,08-03-2025
1 page
Single-Person-Budget-Template
No ratings yet
Single-Person-Budget-Template
32 pages
GMCplus-Adressable-FIRE-ALARM-CONTROL-PANEL_RS232-Espa-4.4.4-protocol
No ratings yet
GMCplus-Adressable-FIRE-ALARM-CONTROL-PANEL_RS232-Espa-4.4.4-protocol
4 pages
MVR Pro HD Introduction 2019-07
No ratings yet
MVR Pro HD Introduction 2019-07
57 pages
Log
No ratings yet
Log
85 pages
Dock Panel Suite Documentation Detailed
No ratings yet
Dock Panel Suite Documentation Detailed
47 pages
RPT-RBT T2 2024 New
No ratings yet
RPT-RBT T2 2024 New
6 pages
Composites Part B: Shibo Yan, Xi Zou, Mohammad Ilkhani, Arthur Jones
No ratings yet
Composites Part B: Shibo Yan, Xi Zou, Mohammad Ilkhani, Arthur Jones
11 pages
Datablast: Improve Your Blasting Productivity, Quality & Governance
No ratings yet
Datablast: Improve Your Blasting Productivity, Quality & Governance
2 pages
ELEC 4601 Sample-Questions
No ratings yet
ELEC 4601 Sample-Questions
12 pages
Audit Report of 9vc9 Chinese Company
No ratings yet
Audit Report of 9vc9 Chinese Company
8 pages
Swetha Srikakulam
No ratings yet
Swetha Srikakulam
2 pages
Practice Set (ICT3)
No ratings yet
Practice Set (ICT3)
2 pages

Lec7matrixnorm Part3

Uploaded by

Lec7matrixnorm Part3

Uploaded by

Matrix norm and low-rank approximation

Low-rank approximation of matrices

Remark. This problem arises in a number of tasks, e.g.,

• Orthogonal least squares fitting

• Data compression (and noise reduction)

Theorem 0.6 (Eckart–Young–Mirsky). Given A ∈ Rn×d and 1 ≤ k ≤ rank(A),

min kA − Bk2 = kA − Ak k2 = σk+1 .

Example 0.5. For the matrix

the best rank-1 approximation is

Applications of low-rank approximation

• Orthogonal least-squares fitting

Orthogonal Best-Fit Subspace

• Orthogonal (not vertical) fitting

where x̄ is the center of the data set

e = [x1 − x̄, . . . , xn − x̄]T = X−1x̄T .

Accordingly, we may rewrite the original problem as

In matrix notation, this becomes

Let the full SVD of the centered data matrix X

Then the minimum is attained when

First, rewrite the above objective function as

and apply the formula

Any minimizer m must satisfy

You might also like