Over Fit
Over Fit
y = f(x) + noise
Can we learn f from this data?
3 7 1 3
1 3 : :
: : x1=(3).. y1=7..
Z= 1 3 y=
7
1 1 3
: :
z1=(1,3).. y1=7..
zk=(1,xk)
3 7 1 3
1 3 : :
: : x1=(3).. y1=7..
Z= 1 3 y=
7
1 1 3
: : =(ZTZ)-1(ZTy)
z1=(1,3).. y1=7..
yest = 0+ 1 x
zk=(1,xk)
y y
x x
y y
x x
y
3. Train on the remaining R-1
datapoints
y
3. Train on the remaining R-1
datapoints
4. Note your error (xk,yk)
x
y
3. Train on the remaining R-1
datapoints
4. Note your error (xk,yk)
x
When you’ve done all points,
report the mean error.
x x x
x x x
x x x
K=1
Are we guaranteed that a local Sadly, no. And in fact, the
relationship can be very bumpy.
K=2 optimum of K vs LOOCV will be
K=3 the global optimum?
These involve
choosing the value of a
real-valued parameter.
What should we do?
Described in a future
Lecture