n27 PDF

The document summarizes key points from a lecture on machine learning. It discusses empirical risk minimization, where models are trained to minimize empirical risk on training data but generalization to new data depends on how well the training data represents the overall distribution. It also outlines common machine learning pipelines involving data collection, feature engineering, and model training. Unsupervised learning techniques like PCA and clustering are mentioned for reducing dimensions and finding patterns in unlabeled data.

Uploaded by

Christine Straub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views

n27 PDF

Uploaded by

Christine Straub

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Closing Remarks

compiled by Alvin Wan from Professor Benjamin Rechts lecture

1 Empirical Risk Minimization

Risk is the expected loss of our prediction function, or in colloquial terms, the guess on how
well well predict:

R[f ] = E[loss(f (x), y)]

Machine learning is only as good as your data, because thats the only way we can access the
model. Samples expose the actual distribution of our data, but only partially. Our empirical
risk, or training error, is the following:

n
1X
RT [f ] = loss(f (xi ), yi )
n i=1

We can measure and optimize this empirical risk. However, we can guess how well this
estimator serves as a proxy for our actual risk. Let us consider the Fundamental Theorem
of Machine Learning:

R[f ] = (R[f ] RT [f ]) + RT [f ]

where the last term RT [f ] is the training error, and R[f ] RT [f ] is our generalization error.
Only our training error is guaranteed to be observable. We can measure generalization only
assuming that we have some holdout data that is representative of all possible data. In
effect, our assumption is the following:

RV [f ] RT [f ] R[f ] RT [f ]

q data is i.i.d., (x1 , y1 ) . . . (xn , yn ), then the scale of our error is roughly RT [f ] R[f ]
If your
d
O( n
). If d > n, then we need to regularize.
q
Its much more important that R[f ] RV [f ] O( logMK ) is small, wjere K is the number
of models used for cross validation. The ideal validation is run on newly-generated data in
exactly the same way old data was generated, independent of the training set.

1
2 Pipeline
Data

Features

Train Model

We have some rules of thumb for features:

Text: Bag of words, n-grams, histograms

Vision: Pixels, gradients, histograms of gradients, wavelets

Medicine: age, gender, family histories, blood tests

Here is a survey of methods:

linear predictors wT x

Lifting: x [x1 , x2 , x1 x2 . . . ]T (nonlinear)

Kernel trick: x (x), x (z) = k(x, z)

Neural networks: f (x, ) optimize w.r.t.

Another way to build features is to use binning, histograms, or trees. With binning, we split
data per ranges of values, such as [a > 0, a < 0]. Always compare against nearest neighbors.
In general, you should not be hyper-sensitive to continuous-valued features e.g., necessitating
the third or fourth decimal place. A model is stable (i.e., robustness) if fT fT {xi ,yi }
means a good model. One theorem states that if the previous statement holds, we know
that generalization error is fairly low.

2
3 Unsupervised Learning

We can either reduce dimensions (PCA) or cluster (k-means, agglomerative, spectral).

Stochastic gradient descent is one algorithm that is well-suited for all the methods presented
in this course. n should be fairly large, and pick a learning just large enough so that you
do not diverge, and decrease as the algorithm runs. 200 epochs is a good upper bound for
overfitting.

After this course, you can take the following to further your knowledge in this domain:

Optimization EE127A, EE227C

Probability EE126, Stat 134, Stat 210A, 210B

Applications NLP, Vision, Robotics

Learning theory teaches active learning and experiment design. Scalable machine learning
is one field to explore. Safety, reliability, robustness are also possible fields to explore.

Instant Download The Social Organization of Sports Medicine Critical Socio Cultural Perspectives 1st Edition Dominic Malcolm PDF All Chapter
100% (15)
Instant Download The Social Organization of Sports Medicine Critical Socio Cultural Perspectives 1st Edition Dominic Malcolm PDF All Chapter
60 pages
DL Unit-2
No ratings yet
DL Unit-2
24 pages
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
No ratings yet
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
14 pages
Finger Print Identification Techniques From Abrading
No ratings yet
Finger Print Identification Techniques From Abrading
43 pages
CH 1
No ratings yet
CH 1
24 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
ML 01
No ratings yet
ML 01
24 pages
1. Statistical Learning Theory
No ratings yet
1. Statistical Learning Theory
100 pages
ML Opt
No ratings yet
ML Opt
89 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
25 pages
IML-Summary
No ratings yet
IML-Summary
12 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Lecture 1
No ratings yet
Lecture 1
5 pages
ML
No ratings yet
ML
9 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
An Adventure of Epic Porpoises
No ratings yet
An Adventure of Epic Porpoises
174 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
09_EnsembleLearning
No ratings yet
09_EnsembleLearning
36 pages
UNIT 1,2,3
No ratings yet
UNIT 1,2,3
17 pages
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
No ratings yet
Introduction To Machine Learning: ETH Zurich Janik Schuettler Marcel Graetz FS18
18 pages
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
No ratings yet
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
117 pages
Lec 25
No ratings yet
Lec 25
15 pages
unit-1.2-Perceptron-2024
No ratings yet
unit-1.2-Perceptron-2024
107 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Machine Learning – I[1]
No ratings yet
Machine Learning – I[1]
126 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
Poly ML SIR
No ratings yet
Poly ML SIR
378 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Lecture01 VDL
No ratings yet
Lecture01 VDL
47 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
08 Classification
No ratings yet
08 Classification
46 pages
To Machine Learning: Isabelle Guyon
No ratings yet
To Machine Learning: Isabelle Guyon
40 pages
SML Book Draft Latest
No ratings yet
SML Book Draft Latest
194 pages
07 Intro to ML
No ratings yet
07 Intro to ML
38 pages
nonlinear
No ratings yet
nonlinear
8 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Lecture 1, Part 2: Linear Classification: Roger Grosse
No ratings yet
Lecture 1, Part 2: Linear Classification: Roger Grosse
10 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Representer Function
No ratings yet
Representer Function
12 pages
poly_aml
No ratings yet
poly_aml
76 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
Theory of Deep Learning 1652786371
No ratings yet
Theory of Deep Learning 1652786371
118 pages
Machine Learning/Data Science Interview Cheat Sheets: Aqeel Anwar
No ratings yet
Machine Learning/Data Science Interview Cheat Sheets: Aqeel Anwar
17 pages
Machine_learning(unit 3)
No ratings yet
Machine_learning(unit 3)
9 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Machine Learning The Basics
No ratings yet
Machine Learning The Basics
158 pages
MLSM Lecture1 050923
No ratings yet
MLSM Lecture1 050923
37 pages
Maxbox - Starter67 Machine Learning
No ratings yet
Maxbox - Starter67 Machine Learning
7 pages
1729585037_ML11_Generalization
No ratings yet
1729585037_ML11_Generalization
40 pages
Mock Exams 2024
No ratings yet
Mock Exams 2024
81 pages
Machine Learning Juunit2.pdf Lands
No ratings yet
Machine Learning Juunit2.pdf Lands
7 pages
Exercises
No ratings yet
Exercises
69 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Machine Learning Lecture Notes
No ratings yet
Machine Learning Lecture Notes
119 pages
Neural Networks Economics
No ratings yet
Neural Networks Economics
27 pages
Lec10 Intro ML
No ratings yet
Lec10 Intro ML
93 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mathematical Optimization: Fundamentals and Applications
From Everand
Mathematical Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
No ratings yet
Linear Regression: 1 Perspective 1: Maximum Likelihood Estimation
5 pages
n9 PDF
No ratings yet
n9 PDF
6 pages
Convolutional Neural Networks: 1 Convolution
No ratings yet
Convolutional Neural Networks: 1 Convolution
2 pages
Randomized Decision Trees II: 1 Feature Selection
No ratings yet
Randomized Decision Trees II: 1 Feature Selection
3 pages
Neural Networks: Derivation: 1 Model
No ratings yet
Neural Networks: Derivation: 1 Model
9 pages
n14 PDF
No ratings yet
n14 PDF
4 pages
n15 PDF
No ratings yet
n15 PDF
4 pages
Bias-Variance Tradeoffs: 1 Single Sample MLE
No ratings yet
Bias-Variance Tradeoffs: 1 Single Sample MLE
7 pages
Contra Positive
No ratings yet
Contra Positive
9 pages
n25 PDF
No ratings yet
n25 PDF
8 pages
Clustering With Gradient Descent: 1 Performance
No ratings yet
Clustering With Gradient Descent: 1 Performance
4 pages
h31 Higher Order Derivatives Velocity and Acceleration
No ratings yet
h31 Higher Order Derivatives Velocity and Acceleration
2 pages
UC Berkeley EECS: Cal Day, April 18, 2015
No ratings yet
UC Berkeley EECS: Cal Day, April 18, 2015
50 pages
120712ChE128 7 LiqLiq Extract
No ratings yet
120712ChE128 7 LiqLiq Extract
39 pages
Competition 2008 Spring
No ratings yet
Competition 2008 Spring
2 pages
Cheat Sheet
No ratings yet
Cheat Sheet
2 pages
Formulas Area
No ratings yet
Formulas Area
5 pages
A Review of The Path To Consistency
No ratings yet
A Review of The Path To Consistency
8 pages
Credit Scoring Checklist
100% (1)
Credit Scoring Checklist
2 pages
Final Presentation of Siemens Scandal
No ratings yet
Final Presentation of Siemens Scandal
5 pages
Audit Policy
No ratings yet
Audit Policy
16 pages
AUTOSAR CP SRS CoreTest
No ratings yet
AUTOSAR CP SRS CoreTest
25 pages
Each of The Following Situations Has An Internal
No ratings yet
Each of The Following Situations Has An Internal
1 page
Exhibit Vii Viii - Bank Guarantee & Performance Guarantee Fo
No ratings yet
Exhibit Vii Viii - Bank Guarantee & Performance Guarantee Fo
6 pages
Sustainability Perspectives: Science, Policy and Practice: A Global View of Theories, Policies and Practice in Sustainable Development Peter A. Khaiter - Download the ebook and explore the most detailed content
100% (4)
Sustainability Perspectives: Science, Policy and Practice: A Global View of Theories, Policies and Practice in Sustainable Development Peter A. Khaiter - Download the ebook and explore the most detailed content
67 pages
TURKY Halkbank business statement Word and PDF template
No ratings yet
TURKY Halkbank business statement Word and PDF template
2 pages
Petromod 3D Software: Advanced Petroleum Systems Modeling in Three Dimensions, Plus Time
No ratings yet
Petromod 3D Software: Advanced Petroleum Systems Modeling in Three Dimensions, Plus Time
2 pages
MC 025
No ratings yet
MC 025
1 page
7th Sem Internal papers (Mech)
No ratings yet
7th Sem Internal papers (Mech)
12 pages
RICS Training Courses
0% (1)
RICS Training Courses
52 pages
UAE Script
No ratings yet
UAE Script
2 pages
Case Study - The Brief
No ratings yet
Case Study - The Brief
2 pages
Maskless Lithography An Approach To SU-8 Based Sen
No ratings yet
Maskless Lithography An Approach To SU-8 Based Sen
11 pages
Hostel Management System
29% (7)
Hostel Management System
39 pages
Metering
100% (1)
Metering
147 pages
Water Tightness - Fill in - 201510091420303143
100% (1)
Water Tightness - Fill in - 201510091420303143
2 pages
Logic Synthesis
No ratings yet
Logic Synthesis
113 pages
Practicum Teaching Site Approval Form
No ratings yet
Practicum Teaching Site Approval Form
3 pages
The First Job-: by Sandra Cisneros (From The House On Mango Street)
No ratings yet
The First Job-: by Sandra Cisneros (From The House On Mango Street)
3 pages
Chapter-3 Cylinder Heads, Cylinders & Liners
No ratings yet
Chapter-3 Cylinder Heads, Cylinders & Liners
18 pages
RoofMetal Catalog 2018
No ratings yet
RoofMetal Catalog 2018
38 pages
Bahrain Dental Exam Sample
No ratings yet
Bahrain Dental Exam Sample
2 pages
Dexter Davis Bio 2023
No ratings yet
Dexter Davis Bio 2023
1 page
MS-7681 VER 2.01 PDF
No ratings yet
MS-7681 VER 2.01 PDF
43 pages
Cyclo Catalog
No ratings yet
Cyclo Catalog
28 pages