0% found this document useful (0 votes)

38 views

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

This document provides an overview of the course ENPM808F: Robot Learning for Summer 2017. The course covers topics like memory-based learning, reinforcement learning, imitation learning, and deep reinforcement learning. It also discusses the differences between global and local learning approaches, as well as eager versus lazy learning methods. Specific techniques covered include k-nearest neighbors, locally weighted regression, radial basis function networks, and more.

Uploaded by

Jay Guthrie

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

Jay Guthrie

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

ENPM808F:

Robot Learning
Summer 2017

Lecture 2:
Memory-Based Learning
Course Outline

•  Motor Learning and the EvoluBon of Intelligence

•  Memory-Based Learning

•  Behavior Based RoboBcs

•  Reinforcement Learning

•  Value versus Policy IteraBon

•  Q-Learning and Actor-CriBc Models

•  Robot Shaping and Evolving Behaviors

•  Crossing the Reality Gap

•  ImitaBon and Learning from DemonstraBon

•  Deep Reinforcement Learning with CNNs

•  On-line and Lifelong Learning

Global vs. Local Learning

Global Neural Network Learning (e.g., mul=layer perceptron network)

u=lizes a completely distributed weight representa=on, thus requiring
update of all network weights per paFern per itera=on.

Local Neural Network Learning (e.g., CMAC, RBF, LWR) u=lizes
Locally distributed weight representa=on, thus requiring only
A small subset of all network weights.
Global Learning Local Learning

Advantages: Advantages:

•  Compact Representa.on •  Rapid Convergence
•  Automa.c Resource Alloca.on •  Computa.onally Inexpensive
•  Generally Con.nuous and •  No Local Minima
Diﬀeren.able Mappings •  Convergence Guaranteed

•  Very High Accuracy

Disadvantages: Disadvantages:

•  Very Slow Convergence •  Memory Intensive
•  Unpredictable Local Minima (may not •  Resource Alloca.on Not Automa.c
converge to global minimum) •  Con.nuity and Diﬀeren.ability of
•  Computa.onally Expensive Mapping More Diﬃcult to Guarantee
•  Generaliza.on Not Easily Controllable
•  Compara.vely Poor Accuracy
(on some problems)
Secant Approxima=on to Tangent
Con=nuous CMAC
Con=nuous CMAC
Curse of Dimensionality
vs.
Blessing of Non-Uniformity*
* --- Pedro Domingos

The Curse of Dimensionality in Machine Learning refers to the eﬀect that

many algorithms that work well in low dimensions become intractable in
higher dimensions.

The Blessing of Non-Uniformity refers to the fact that many problems, and
therefore input spaces, which are high-dimensional can be represented
using a lower dimensional manifold or representa=on.

Lazy Learning

versus

Eager Learning
Lazy versus Eager Learning

Lazy Learning methods store all of the training data and use it only
when called with a new input vector (query) to perform a
mapping. They make no assump=ons about the overall
shape of the global mapping before the query is presented.
Also referred to as Instance-based Learning methods.

Examples include: k-Nearest Neighbors, Locally Weighted Regression,
and Case-Based Reasoning
Lazy versus Eager Learning

Eager Learning methods construct an approximate representa=on

of the global func=on before receiving a query. They are
therefore limited to genera=ng a single global
approxima=on to the target mapping.

Examples include: Mul.layer Perceptrons (e.g., Backprop Networks),
RBFs, Decision Trees, CMAC, …
k-Nearest Neighbor

Training Examples !! , !(!! )

k-Nearest Neighbor

Query
!!
k-Nearest Neighbor

k = 6

!
!!! !(!! )
!(!! ) ←
!
k-Nearest Neighbor

Requires:
•  Training exemplars and queries map to points in ℜ!
•  Small input vectors
•  Suﬃcient density of training data to cover areas of interest

Advantages:
•  No informa=on is lost
•  Fast training
•  Can model highly complex surfaces

Disadvantages:
•  Addi=onal computa=onal complexity to answer queries
•  Weights all input aFributes equally
•  Suﬀers from Curse of Dimensionality
k-Nearest Neighbor Algorithm
(Real-valued)

•  Store all training examples !! , !(!! )

!! ! !
•  Given query calculate mean of values of nearest neighbors

!
!!! !(!! )
!(!! ) ←
!
•  Note that there are no weights to update
Distance Weighted
k-Nearest Neighbor Algorithm
(Real-valued)

Neighbors are weighted based upon their distance from the query point !!

!(!! , !! ) !! !!
Deﬁne distance between and
1
Deﬁne weights !! ≡ !

!(! ,
! ! ! )

!
!!! !! !(!! )
!! ! !! ←
Then given query , !
!!! !!
Shepard’s Method

•  Store all training examples !! , !(!! )

!! !
•  Given query calculate weighted sum of values of all points

!
!!! !! !(!! )
! !! ← !
!!! !!

•  Note that since all points are used, Shepard’s Method is a global learning
algorithm
Nearest Neighbor vs
Locally Weighted Regression

k-Nearest Neighbor
(Discrete)

Distance Weighted
k-Nearest Neighbor
(Con=nuous)

Locally Weighted
Regression
Locally Weighted Regression

!(!)
LWR is a Lazy Learning method in which an approxima=on is
!!
formed around each query point

!!
•  It is Local since only the points near are used.

•  It is Weighted since the inﬂuence of the points is determined by their

distance from the query point.

!(!)
•  It is Regression since approximates a real-valued target func=on.

We can approximate the target func=on as a linear combina=on of

m weighted aFributes of the training points

! ! = !! + !! !! ! + ⋯ + !! !! (!)
Locally Weighted Linear Regression

Unweighted Averaging Locally Weighted Averaging

Using Springs Using Springs
Locally Weighted Linear Regression

To calculate squared error over k nearest neighbors, deﬁne error func=on

!
1
!! !! ≡ (! ! − !(!))!
2
!!!

To weight the inﬂuence of points based upon distance, deﬁne kernel func=on
1
!! ! ≡
!
1
!! ! ≡ !
!
!
!! ! ≡ ! !!
To minimize the weighted error across the en=re training set
!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!
Locally Weighted Linear Regression

Some typical kernel

func=ons (from
Atkeson et al., 1997a)
Locally Weighted Linear Regression

To minimize the weighted error across the en=re training set

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!

To minimize the weighted error across the k nearest neighbors

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!

The weight update rule can then be expressed

△ !! = ! !(!) ! ! − ! ! !! (!)
!!!
! !
with learning rate for each aFribute of input vector
Locally Weighted Nonlinear Regression

Locally weighted regression techni-

ques may be extended to u=lize
nonlinear local support func=ons
(e.g, quadra=c, cubic polynomial)
through use of widely supported
curve fi`ng techniques from
sta=s=cal analysis.

LOESS (Locally Es=mated ScaFerplot
Smoothing) and LOWESS (Locally
Weighted ScaFerplot Smoothing) are
efficient nonparametric methods for
fi`ng models to subsets of data.
Radial Basis Func=on Networks

Radial Basis Func.on Networks (RBFs) are a type of

neural network closely related to distance-weighted
regression.

They form global approxima=ons of the target
func=ons using a linear combina=on of basis
func=ons weighted by distance from the points of
interest.
Radial Basis Func=on Network

! = (!! ! , !! ! , … , !! (!))
Radial Basis Func=on Networks

! ! = !! + !! !! (!(!! , !))
!!!

!!
Kernel func=on is deﬁned so that
it decreases with increasing distance.

!!
A common choice for is the
Gaussian func=on

! !
! ! ! (! ! ,!)
!! ! !! , ! = ! !!!

!!!
where denotes the variance of the Gaussian at !!
RBF Network Training

RBF networks are trained in a two-stage process.

First, the number of hidden units is determined (ocen the same as
the number of points in the training set). Their centers are
ini=alized (generally to the training points), and their variances
ini=alized to correspond with the chosen kernel func=on.

Second, the weights are trained using the global error func=on
!
1 !

!! !! ≡ ! ! −! ! !(!)
2
!!!
or a localized error func=on, e.g.

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!
RBF Network Training

The hidden layer nodes may

alterna=vely be ini=alized using the
EM (Expecta.on Maximiza.on)
Algorithm, an unsupervised
clustering technique for ﬁ`ng data
to a mixture of Gaussians.

Only the input vectors are used to
determine the cluster centers. As
EM is an unsupervised clustering
technique, the target vectors
(outputs) are not used or needed.
RBF Network Training

The outputs, however, are used to determine the weights connec=ng

the hidden layer nodes to the output. Since the output(s) are a linear
combina=on of these inputs, there are numerous linear regression-based
techniques for eﬃciently op=mizing these weights.

Locally Weighted Learning
for Robo=c Control

Locally weighted learning for control u=lizes Memory Based (Lazy)

Learning methods for construc=ng local models from data.

A key concept is that rather than building a global model up front,
simply store all of the data.

When a query is presented, use the data near the query point to
construct a local model to answer the query. This model is then
discarded.

Locally Weighted Learning methods specify how models may be
built using Lazy Learning methods (such as LWR), but not how
they are used to construct learning controllers.
Locally Weighted Learning
for Robo=c Control

First we will consider temporally independent control tasks, such

as setpoint control and vehicle trajectory following, such that

! = ! !, ! + !"#$%

! ! !
for output vector , state vector , and control vector .

! !
The control task is to choose so that the outcome is .

!
Use Lazy Learning to infer a model which approximates . !
Locally Weighted Learning
for Inverse Models

Inverse model based control techniques use states and desired

outcomes to predict the control inputs necessary to achieve
the desired outcomes.

! = ! !! (!, !)

Learned database
implemen=ng
inverse model
Locally Weighted Learning
for Inverse Models

Pros:

•  The database is “trained” by adding new points (!, !, !)

! !
•  If there is a monotonic rela=onship between and , then there
are eﬃcient methods for rapidly converging on the correct mapping

Cons:

•  May not work if
Ø  Vector space of ac=ons and outcomes is not the same
Ø  Mapping is not one-to-one
Ø  Data include misleading noisy observa=ons

Locally Weighted Learning
for Inverse Models

Inverse model fails to accurately predict control ac.on for

desired outcome on non-monotonic func.on.
Locally Weighted Learning
for Forward Models

Forward model based control techniques use states and control

inputs to predict outcomes.

! = !(!, !)

Learned database
implemen=ng
forward model
Locally Weighted Learning
for Forward Models

Pros:

•  The database is “trained” by adding new points (!, !, !)
•  Allows “mental simula=on,” or predic=on of the eﬀects of diﬀerent
ac=ons

Cons:

•  Requires search of the database to ﬁnd ac=on that corresponds
to the desired outcome for the current state.
Combining Inverse and
Forward Models

An Inverse Model may be used to generate a good star=ng poin=ng
for search of a Forward Model.

!! (!, ! )
! ! = ! !

! may be used with a Lazy Forward Model
!

! = !(!, !! )

! !!
If is close to then Newton’s Method may be used to
!
further reﬁne .
Locally Weighted Learning
for Robo=c Control

Next we will consider temporally dependent control tasks such that

! ! + 1 = !(! ! , ! ! )

The task is then to regulate (control) the state to achieve a desired
!!
setpoint , or a sequence of values to create a trajectory
!! 1 , !! 2 , !! 3 , …

The database is “trained” by adding triples of the form (!! , !! , !!!! )

!!
current state
! !
current control input
! next state
!!!
Deadbeat Control
for Devil S=cking

One-step deadbeat control chooses ac.ons to cause the immediate

next state to be the desired next state. If the next state is aYainable
in one step the ac.on may be chosen without regard to future
states, decisions, or performance.

Atkeson et al. (1997) applied deadbeat control to learn the Devil
S=cking task.

!! = ! !! (!! , !!!! )
First, an Inverse Model was learned
(database populated from suﬃcient explora=on).

!!
Next, given a desired state the database was queried to
determine !! = ! !! (!! , !! )

Locally Weighted Learning
for Robo=c Control

Devil s=cking robot link, by Professor Chris Atkeson:

YouTube Link: Devil S=cking Robot Video

Locally Weighted Learning
for Robo=c Control

Deadbeat Control will fail if the dynamics of the plant being

controlled require more than a single step lookahead.
Lazy Learning may be applied to more complex control
methods (e.g., LQR, Nonlinear Op.mal Control).

The use of linear regression models ocen (with sufficient data)
guarantees the existence of deriva=ves, which may be easily
calculated from the models through numerical differen=a=on.

Learning methods for Nonlinear Op.mal Control techniques
(e.g., value and policy itera=on) ocen fall under Reinforcement
Learning techniques and will be visited later in the course.

References
•  Albus, James S., “A New Approach to Manipulator Control: The Cerebellar Model
Ar=cula=on Controller,” Journal of Dynamic Systems, Measurement, and Control,
pp. 225, September 1975.

•  Atkeson, C. G., Moore, A. W., and Schaal, S., "Locally Weighted Learning,”
Ar.ficial Intelligence Review, 11.1-5 (1997): 11-73, 1997.

•  Atkeson, C. G., Moore, A. W., and Schaal, S., “Locally Weighted Learning for Control,”
Ar.ﬁcial Intelligence Review, 11:1-5 (1997): 75-113, 1997.

•  Domingos, P., "A few useful things to know about machine learning,"
Communica.ons of the ACM, 55.10:78-87, 2012.

•  Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

(CMU Technical Report), July 2006.

•  Mitchell, T., Machine Learning, Chapter 8, McGraw Hill Educa=on, 1997.
Reading Assignments
•  Atkeson, C. G., Moore, A. W., & Schaal, S., "Locally Weighted Learning,”
Ar=ﬁcial Intelligence Review 11.1-5 (1997): 11-73.

•  Atkeson, C. G., Moore, A. W., & Schaal, S., “Locally Weighted Learning for
Control,” Ar=ﬁcial Intelligence Review, 11:1-5 (1997): 75-113.

•  Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

47.1 (1991): 139-159.

•  Arkin, Ronald. "Motor schema based naviga=on for a mobile robot: An approach
to programming by behavior." Robo.cs and Automa.on. Proceedings. 1987 IEEE
Interna.onal Conference on. Vol. 4. IEEE, 1987.

•  Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.

On-line (hYp://www.am.chalmers.se/~wolﬀ/AA/Chapter3.pdf)
Homework Assignment #2
Due by 4pm, 06/20

1)  Program a Discrete CMAC and train it on a 1-D func=on (ref: Albus 1975, Fig. 5)
Explore eﬀect of overlap area on generaliza=on and =me to convergence.

2)  Program a Con=nuous CMAC by allowing par=al cell overlap, and modifying
the weight update rule accordingly. Compare the output of the Discrete CMAC
with that of the Con=nuous CMAC.

3)  Discuss how you might use recurrent connec=ons to train a CMAC to output
a desired trajectory without using =me as an input (e.g., state only).

PYC3704 Psychological Research Exam Questions and Answers: Adeles
67% (3)
PYC3704 Psychological Research Exam Questions and Answers: Adeles
168 pages
The Fundamentals of Psychoanalysis
100% (17)
The Fundamentals of Psychoanalysis
913 pages
Lecture 8.2 - Variational Quantum Eigensolver
No ratings yet
Lecture 8.2 - Variational Quantum Eigensolver
27 pages
Anti Aliasing
No ratings yet
Anti Aliasing
58 pages
Engineering Mathematics-III Important University Questions Unit-I Fourier Series Two Marks
100% (10)
Engineering Mathematics-III Important University Questions Unit-I Fourier Series Two Marks
23 pages
Kracauer - The Challenge of Qualitative Content Analysis
100% (1)
Kracauer - The Challenge of Qualitative Content Analysis
12 pages
2.3+Value+Function+Approximation
No ratings yet
2.3+Value+Function+Approximation
55 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
Lecture 13
No ratings yet
Lecture 13
43 pages
DSA5102X_lecture5
No ratings yet
DSA5102X_lecture5
44 pages
Russell & Norvig Ch. 5: - Constraint Satisfaction Offers A Powerful Problem-Solving Paradigm
No ratings yet
Russell & Norvig Ch. 5: - Constraint Satisfaction Offers A Powerful Problem-Solving Paradigm
20 pages
Sequence Alignment
No ratings yet
Sequence Alignment
92 pages
Minsky y Papert
No ratings yet
Minsky y Papert
77 pages
Hands-On Machine Learning: Chapter 5: Support Vector Machines
No ratings yet
Hands-On Machine Learning: Chapter 5: Support Vector Machines
32 pages
Baraniuk IMA Compression June07 Final
No ratings yet
Baraniuk IMA Compression June07 Final
87 pages
DSA5102_lecture5
No ratings yet
DSA5102_lecture5
45 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
4 Performance.4x
No ratings yet
4 Performance.4x
14 pages
Module 4 Continued
No ratings yet
Module 4 Continued
244 pages
Chap 4 Beyond Gradient Descent
No ratings yet
Chap 4 Beyond Gradient Descent
26 pages
Lec4 - Multiple Sequence Alignment
No ratings yet
Lec4 - Multiple Sequence Alignment
22 pages
Dlincv 161110052148 PDF
No ratings yet
Dlincv 161110052148 PDF
271 pages
RIS Signal Processing
No ratings yet
RIS Signal Processing
32 pages
17 Sampling Aliasing
No ratings yet
17 Sampling Aliasing
10 pages
01-Modelling and Approximation
No ratings yet
01-Modelling and Approximation
11 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
Object Detection Slides
No ratings yet
Object Detection Slides
90 pages
NLP - Natural Language Processing
No ratings yet
NLP - Natural Language Processing
74 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
Stable Diffusion A Tutorial
100% (1)
Stable Diffusion A Tutorial
66 pages
Lecture Slides - Linear Reg
No ratings yet
Lecture Slides - Linear Reg
34 pages
L7 Detection
No ratings yet
L7 Detection
54 pages
Introduction Cad
No ratings yet
Introduction Cad
28 pages
Lec05 - CSPs II
No ratings yet
Lec05 - CSPs II
38 pages
Distributed Large-Scale Graph Processing: Data Mining (CS6720)
No ratings yet
Distributed Large-Scale Graph Processing: Data Mining (CS6720)
4 pages
Caches: Shmuel Wimer
No ratings yet
Caches: Shmuel Wimer
69 pages
Vasp ML Intro
No ratings yet
Vasp ML Intro
29 pages
Lecture 11
No ratings yet
Lecture 11
35 pages
Slide 2 PDF
No ratings yet
Slide 2 PDF
44 pages
6 - Image Segmentation - Unit 3
No ratings yet
6 - Image Segmentation - Unit 3
55 pages
3.7 Coordinate Systems in OpenGL
No ratings yet
3.7 Coordinate Systems in OpenGL
12 pages
Lec 8
No ratings yet
Lec 8
43 pages
Icra16 Slam Tutorial Grisetti PDF
No ratings yet
Icra16 Slam Tutorial Grisetti PDF
57 pages
Computer Examples: Tenenbaum, de Silva, Langford "A Global Geometric Framework For Nonlinear Dimensionality Reduction"
No ratings yet
Computer Examples: Tenenbaum, de Silva, Langford "A Global Geometric Framework For Nonlinear Dimensionality Reduction"
37 pages
2.2+Model Free+Control
No ratings yet
2.2+Model Free+Control
92 pages
OM Presentation Rwalch PDF
No ratings yet
OM Presentation Rwalch PDF
13 pages
A Computationally Efficient Framework For Modeling Soft Body Impact
No ratings yet
A Computationally Efficient Framework For Modeling Soft Body Impact
10 pages
Lecture_RPCN_sensor_fusion_VL (1)
No ratings yet
Lecture_RPCN_sensor_fusion_VL (1)
72 pages
16 SVM
No ratings yet
16 SVM
41 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
optimization techniques (SGD alternatives)
No ratings yet
optimization techniques (SGD alternatives)
34 pages
CSE 185 Introduction To Computer Vision: Feature Matching
No ratings yet
CSE 185 Introduction To Computer Vision: Feature Matching
48 pages
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
No ratings yet
Machine Learning (2) : Inteligência Artificial E Cibersegurança (Inacs)
45 pages
Lecture 09_02.09.2024_Regression-01
No ratings yet
Lecture 09_02.09.2024_Regression-01
62 pages
2 LinearRegression2
No ratings yet
2 LinearRegression2
45 pages
Feature Matching and RANSAC: 15-463: Computational Photography Alexei Efros, CMU, Fall 2005
No ratings yet
Feature Matching and RANSAC: 15-463: Computational Photography Alexei Efros, CMU, Fall 2005
36 pages
Vision Review: Image Processing: Course Web Page
No ratings yet
Vision Review: Image Processing: Course Web Page
51 pages
Stability - Ai Image, @quantshah
No ratings yet
Stability - Ai Image, @quantshah
22 pages
Syndrome decodingML Decoding
No ratings yet
Syndrome decodingML Decoding
18 pages
Creating Surfaces: Esri International User Conference
No ratings yet
Creating Surfaces: Esri International User Conference
51 pages
788XF14L17 SLAMx
No ratings yet
788XF14L17 SLAMx
103 pages
Eleven 19204 PDF
No ratings yet
Eleven 19204 PDF
19 pages
Stable Diffusion
No ratings yet
Stable Diffusion
58 pages
Volume Rendering: Exploring Visual Realism in Computer Vision
From Everand
Volume Rendering: Exploring Visual Realism in Computer Vision
Fouad Sabry
No ratings yet
The Propaganda Model Today
100% (5)
The Propaganda Model Today
315 pages
L15 Testing of Hypothesis
No ratings yet
L15 Testing of Hypothesis
42 pages
Data Management: Bryan S. Ambre
100% (2)
Data Management: Bryan S. Ambre
104 pages
Numnercal Solutions - ACTIVITY #2
No ratings yet
Numnercal Solutions - ACTIVITY #2
2 pages
Unit 13 KO
No ratings yet
Unit 13 KO
7 pages
Numerical Methods in Heat Conduction: Heat and Mass Transfer: Fundamentals & Applications
No ratings yet
Numerical Methods in Heat Conduction: Heat and Mass Transfer: Fundamentals & Applications
31 pages
2nd-Sem Quarter4 Stat Week2
No ratings yet
2nd-Sem Quarter4 Stat Week2
32 pages
Application Note - ASTM D7575
No ratings yet
Application Note - ASTM D7575
2 pages
Muhammad Reyhan Sya'ban - Analysis of Contribution and Projection of Local Tax (Tourism Sector) To Local Own Source Revenue Banyumas Regency 2013 - 2022
No ratings yet
Muhammad Reyhan Sya'ban - Analysis of Contribution and Projection of Local Tax (Tourism Sector) To Local Own Source Revenue Banyumas Regency 2013 - 2022
94 pages
Differentiation Part 1
No ratings yet
Differentiation Part 1
38 pages
Fourier Analysis
No ratings yet
Fourier Analysis
24 pages
Penalty Methods
No ratings yet
Penalty Methods
5 pages
Gdwq4 With Add1 Annex4
No ratings yet
Gdwq4 With Add1 Annex4
9 pages
Math Camp CU 2016 Solutions To Exercises in Lecture 5: Xingye Wu
No ratings yet
Math Camp CU 2016 Solutions To Exercises in Lecture 5: Xingye Wu
4 pages
Definition of LCC
No ratings yet
Definition of LCC
13 pages
Fuzzy Statistics PDF
No ratings yet
Fuzzy Statistics PDF
41 pages
Data Processing & Statistical
0% (1)
Data Processing & Statistical
21 pages
Criteria Weighting by Using The 5ws H Technique
No ratings yet
Criteria Weighting by Using The 5ws H Technique
8 pages
BMP6001 Diss - Assessment 2 Brief - Dissertation
No ratings yet
BMP6001 Diss - Assessment 2 Brief - Dissertation
9 pages
Quadratic Functions UbD
No ratings yet
Quadratic Functions UbD
6 pages
780 PH Meter
No ratings yet
780 PH Meter
12 pages
Functions, Limits, Continuity: Function Ornota Function?
No ratings yet
Functions, Limits, Continuity: Function Ornota Function?
32 pages
ATS Active Trader v2.7 Trading System: TELEGRAM Channel
No ratings yet
ATS Active Trader v2.7 Trading System: TELEGRAM Channel
1 page
Wiley Onlinebooks List 201510
0% (1)
Wiley Onlinebooks List 201510
1,986 pages
Mini
No ratings yet
Mini
12 pages
CHEM 430 - Mass Spectrometry Problem Set II
No ratings yet
CHEM 430 - Mass Spectrometry Problem Set II
22 pages

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

ENPM808F:

Global Neural Network Learning (e.g., mul=layer perceptron network)

The Curse of Dimensionality in Machine Learning refers to the eﬀect that

Eager Learning methods construct an approximate representa=on

Training Examples !! , !(!! )

• Store all training examples !! , !(!! )

• Store all training examples !! , !(!! )

• It is Weighted since the inﬂuence of the points is determined by their

We can approximate the target func=on as a linear combina=on of

Unweighted Averaging Locally Weighted Averaging

To calculate squared error over k nearest neighbors, deﬁne error func=on

Some typical kernel

To minimize the weighted error across the en=re training set

To minimize the weighted error across the k nearest neighbors

The weight update rule can then be expressed

Locally weighted regression techni-

Radial Basis Func.on Networks (RBFs) are a type of

RBF networks are trained in a two-stage process.

The hidden layer nodes may

The outputs, however, are used to determine the weights connec=ng

Locally weighted learning for control u=lizes Memory Based (Lazy)

First we will consider temporally independent control tasks, such

Inverse model based control techniques use states and desired

Inverse model fails to accurately predict control ac.on for

Forward model based control techniques use states and control

Next we will consider temporally dependent control tasks such that

One-step deadbeat control chooses ac.ons to cause the immediate

Devil s=cking robot link, by Professor Chris Atkeson:

YouTube Link: Devil S=cking Robot Video

Deadbeat Control will fail if the dynamics of the plant being

• Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

• Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

• Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.

You might also like

•  Store all training examples !! , !(!! )

•  Store all training examples !! , !(!! )

•  It is Weighted since the inﬂuence of the points is determined by their

•  Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

•  Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

•  Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.