0% found this document useful (0 votes)

51 views

Robust Regression: 1 M-Estimation

The document describes robust regression techniques, including M-estimation and bounded-influence regression. It provides examples of applying robust regression, specifically M-estimation using Huber's method, to Duncan's occupational prestige data. This reduces the influence of outliers and produces coefficient estimates closer to those obtained by omitting the influential observations.

Uploaded by

Vikas Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Robust Regression: 1 M-Estimation

Uploaded by

Vikas Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Robust Regression

Appendix to An R and S-PLUS Companion to Applied Regression

John Fox
January 2002

M-Estimation

Linear least-squares estimates can behave badly when the error distribution is not normal, particularly when
the errors are heavy-tailed. One remedy is to remove inuential observations from the least-squares t (see
Chapter 6, Section 6.1, in the text). Another approach, termed robust regression, is to employ a tting
criterion that is not as vulnerable as least squares to unusual data.
The most common general method of robust regression is M-estimation, introduced by Huber (1964).1
Consider the linear model
yi

= + 1 xi1 + 2 xi2 + + k xik + i

= xi + i

for the ith of n observations. The tted model is

= a + b1 xi1 + b2 xi2 + + bk xik + ei

= xi b + ei

The general M-estimator minimizes the objective function

(ei ) =

i=1

(yi xi b)

i=1

where the function gives the contribution of each residual to the objective function. A reasonable should
have the following properties:
(e) 0
(0) = 0
(e) = (e)
(ei ) (ei ) for |ei | > |ei |
For example, for least-squares estimation, (ei ) = e2i .
Let = be the derivative of . Dierentiating the objective function with respect to the coecients, b,
and setting the partial derivatives to 0, produces a system of k + 1 estimating equations for the coecients:
n

i=1

(yi xi b)xi = 0

1 This

class of estimators can be regarded as a generalization of maximum-likelihood estimation, hence the term M estimation. Hubers 1964 paper introduced M-estimation in the context of estimating the location (center) of a distribution;
the method was later generalized to regression.

Dene the weight function w(e) = (e)/e, and let wi = w(ei ). Then the estimating equations may be written
as
n

wi (yi xi b)xi = 0
i=1

2 2
Solving the estimating equations is a weighted least-squares problem, minimizing
wi ei . The weights,
however, depend upon the residuals, the residuals depend upon the estimated coecients, and the estimated
coecients depend upon the weights. An iterative solution (called iteratively reweighted least-squares, IRLS )
is therefore required:
1. Select initial estimates b(0) , such as the least-squares estimates.
(t1)

2. At each iteration t, calculate residuals ei

previous iteration.

(t1)

and associated weights wi

(t1)
from the
= w ei

3. Solve for new weighted-least-squares estimates

1
X W(t1) y
b(t) = X W(t1) X

(t1)
is the current weight
where X is the model matrix, with xi as its ith row, and W(t1) = diag wi
matrix.
Steps 2. and 3. are repeated until the estimated coecients converge.
The asymptotic covariance matrix of b is
V(b) =

E(2 )
(X X)1
[E( )]2

2

Using
[(ei )]2 to estimate E(2 ), and
(ei )/n to estimate [E( )]2 produces the estimated asymp
totic covariance matrix, V(b)
(which is not reliable in small samples).

1.1

Objective Functions

Figure 1 compares the objective functions, and the corresponding and weight functions for three Mestimators: the familiar least-squares estimator; the Huber estimator; and the Tukey bisquare (or biweight)
estimator. The objective and weight functions for the three estimators are also given in Table 1.
Both the least-squares and Huber objective functions increase without bound as the residual e departs
from 0, but the least-squares objective function increases more rapidly. In contrast, the bisquare objective
function levels eventually levels o (for |e| > k). Least-squares assigns equal weight to each observation; the
weights for the Huber estimator decline when |e| > k; and the weights for the bisquare decline as soon as e
departs from 0, and are 0 for |e| > k.
The value k for the Huber and bisquare estimators is called a tuning constant; smaller values of k produce
more resistance to outliers, but at the expense of lower eciency when the errors are normally distributed.
The tuning constant is generally picked to give reasonably high eciency in the normal case; in particular,
k = 1.345 for the Huber and k = 4.685 for the bisquare (where is the standard deviation of the errors)
produce 95-percent eciency when the errors are normal, and still oer protection against outliers.
In an application, we need an estimate of the standard deviation of the errors to use these results. Usually
a robust measure of spread is employed in preference to the standard deviation of the residuals. For example,
a common approach is to take
= MAR/0.6745, where MAR is the median absolute residual.

Bounded-Influence Regression

Under certain circumstances, M -estimators can be vulnerable to high-leverage observations. A key concept
in assessing inuence is the breakdown point of an estimator: The breakdown point is the fraction of bad
2

-6

-4

-2

w LS(e)
-6

-4

-2

0.0 0.2 0.4 0.6 0.8 1.0

5
-10 -5

LS(e)

15
0 5

LS(e)

Least Squares

-6

-4

-2

-6

-4

-2

w H(e)
-6

-4

-2

0.0 0.2 0.4 0.6 0.8 1.0

1.0
0.0
-1.0

H(e)

0 1 2 3 4 5 6 7

Huber

-6

-4

-2

0
e

w B(e)

-1.0
0
-6

-4

-2

-6

-4

-2

0.0 0.2 0.4 0.6 0.8 1.0

0.0

B(e)

2
1

B(e)

1.0

Bisquare

-6

-4

-2

0
e

Figure 1: Objective, , and weight functions for the least-squares (top), Huber (middle), and bisquare
(bottom) estimators. The tuning constants for these graphs are k = 1.345 for the Huber estimator and
k = 4.685 for the bisquare. (One way to think about this scaling is that the standard deviation of the errors,
, is taken as 1.)

Method
Least-Squares
Huber

Bisquare

Objective Function

Weight Function

L S (e) =
e

1 2
for |e| k
2e
H (e) =

1 2
k|e| 2 k for |e| >
k
e 2 3
2

k
1 1
6
k
B (e) =

2
k /6

for |e| k
for |e| > k

wL S (e) =
1

1
for |e| k
wH (e) =

k/|e| for |e| > k

2 2

1 e
for |e| k
k
wB (e) =

0
for |e| > k

Table 1: Objective function and weight function for least-squares, Huber, and bisquare estimators.

data that the estimator can tolerate without being aected to an arbitrarily large extent. For example, in
the context of estimating the center of a distribution, the mean has a breakdown point of 0, because even
one bad observation can change the mean by an arbitrary amount; in contrast the median has a breakdown
point of 50 percent.
There are also regression estimators that have breakdown points of nearly 50 percent. One such boundedinuence estimator is least-trimmed squares (LTS ) regression.
The residuals from the tted regression model are
ei

= yi (a + b1 xi1 + b2 xi2 + + bk xik )

= yi xi b

Let us order the squared residuals from smallest to largest:

(e2 )(1) , (e2 )(2) , . . . , (e2 )(n)
The LTS estimator chooses the regression coecients b to minimize the sum of the smallest m of the squared
residuals,
m

(e2 )(i)
LTS(b) =
i=1

where, typically, m = n/2 + (k + 2)/2 (i.e., a little more than half of the observations), and the oor
brackets, , denote rounding down to the next smallest integer.
While the LTS criterion is easily described, the mechanics of tting the LTS estimator are complicated
(see, for example, Rousseeuw and Leroy, 1987). Moreover, bounded-inuence estimators can produce unreasonable results in certain circumstances (Stefanski, 1991), and there is no simple formula for coecient
standard errors.2

An Illustration: Duncans Occupational-Prestige Regression

Duncans occupational-prestige regression was introduced in Chapter 1 and described further in Chapter 6
on regression diagnostics. The least-squares regression of prestige on income and education produces the
following results:
>
>
>
>

library(car) # mostly for the Duncan data set

data(Duncan)
mod.ls <- lm(prestige ~ income + education, data=Duncan)
summary(mod.ls)

Call:
lm(formula = prestige ~ income + education, data = Duncan)
Residuals:
Min
1Q
-29.538 -6.417

Median
0.655

3Q
6.605

Max
34.641

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -6.0647
4.2719
-1.42
0.16
income
0.5987
0.1197
5.00 1.1e-05
education
0.5458
0.0983
5.56 1.7e-06
Residual standard error: 13.4 on 42 degrees of freedom
Multiple R-Squared: 0.828,
Adjusted R-squared: 0.82
F-statistic: 101 on 2 and 42 DF, p-value: 1.11e-016
2 Statistical inference for the LTS estimator can easily be performed by bootstrapping, however. See the Appendix on
bootstrapping for an example.

Recall from the previous discussion of Duncans data that two observations, ministers and railroad conductors, serve to decrease the income coecient substantially and to increase the education coecient, as
we may verify by omitting these two observations from the regression:
> mod.ls.2 <- update(mod.ls, subset=-c(6,16))
> summary(mod.ls.2)
Call:
lm(formula = prestige ~ income + education, data = Duncan, subset = -c(6,
16))
Residuals:
Min
1Q Median
-28.61 -5.90
1.94

3Q
5.62

Max
21.55

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -6.4090
3.6526
-1.75
0.0870
income
0.8674
0.1220
7.11 1.3e-08
education
0.3322
0.0987
3.36
0.0017
Residual standard error: 11.4 on 40 degrees of freedom
Multiple R-Squared: 0.876,
Adjusted R-squared: 0.87
F-statistic: 141 on 2 and 40 DF, p-value:
0
Alternatively, let us compute the Huber M -estimator for Duncans regression model, employing the rlm
(robust linear model) function in the MASS library:
> library(MASS)
> mod.huber <- rlm(prestige ~ income + education, data=Duncan)
> summary(mod.huber)
Call: rlm.formula(formula = prestige ~ income + education, data = Duncan)
Residuals:
Min
1Q Median
3Q
Max
-30.12 -6.89
1.29
4.59 38.60
Coefficients:
Value Std. Error t value
(Intercept) -7.111 3.881
-1.832
income
0.701 0.109
6.452
education
0.485 0.089
5.438
Residual standard error: 9.89 on 42 degrees of freedom
Correlation of Coefficients:
(Intercept) income
income
-0.297
education -0.359
-0.725
The summary method for rlm objects prints the correlations among the coecients; to suppress this
output, specify correlation=FALSE. The Huber regression coecients are between those produced by the
least-squares t to the full data set and by the least-squares t eliminating the occupations minister and
conductor.
5

1.0
0.9
0.7

0.8

streetcar.motorman
factory.owner

mail.carrier

0.6

store.clerk
machinist
contractor
conductor
insurance.agent
reporter

0.4

0.5

Huber Weight

coal.miner

carpenter

minister
0

Index

Figure 2: Weights from the robust Huber estimator for the regression of prestige on income and education.
Observations with weights less than 1 were identied interactively with the mouse.
It is instructive to extract and plot (in Figure 2) the nal weights employed in the robust t, identifying
observations with weights less than 1 using the mouse:
> plot(mod.huber$w, ylab="Huber Weight")
> identify(1:45, mod.huber$w, rownames(Duncan))
[1] 6 9 16 17 18 22 23 24 25 28 32 33
Ministers and conductors are among the observations that receive the smallest weight.
Next, I employ rlm to compute the bisquare estimator for Duncans regression. Start-values for the IRLS
procedure are potentially more critical for the bisquare estimator; specifying the argument method=MM to
rlm requests bisquare estimates with start values determined by a preliminary bounded-inuence regression.
To use this option, it is necessary rst to attach the lqs library, which contains functions for boundedinuence regression:
> library(lqs)
> mod.bisq <- rlm(prestige ~ income + education, data=Duncan, method=MM)
> summary(mod.bisq, cor=F)
Call: rlm.formula(formula = prestige ~ income + education, data = Duncan,
method = "MM")
Residuals:
Min
1Q Median
3Q
Max
-29.87 -6.63
1.44
4.47 42.40
Coefficients:
Value Std. Error t value
(Intercept) -7.389 3.908
-1.891
income
0.783 0.109
7.149
education
0.423 0.090
4.710
Residual standard error: 9.79 on 42 degrees of freedom
6

1.0
0.8
0.6
0.4

conductor

reporter
0.2

Bisquare Weight

machinist
insurance.agent

contractor

0.0

minister
0

Index

Figure 3: Weights from the robust bisquare estimator for the regression of prestige on income and
education. Observations accorded relatively small weight were identied interactively with the mouse.
Compared to the Huber estimates, the bisquare estimate of the income coecient is larger, and the
estimate of the education coecient is smaller. Figure 3 shows a graph of the weights from the bisquare
t, interactively identifying the observations with the smallest weights:
> plot(mod.bisq$w, ylab="Bisquare Weight")
> identify(1:45, mod.bisq$w, rownames(Duncan))
[1] 6 9 16 17 23 28
Finally, I use the ltsreg function in the lqs library to t Duncans model by LTS regression:3
> mod.lts <- ltsreg(prestige ~ income + education, data=Duncan)
> mod.lts
Call:
lqs.formula(formula = prestige ~ income + education, data = Duncan,
method = "lts")
Coefficients:
(Intercept)
-7.015

income
0.804

education
0.432

Scale estimates 7.77 7.56

In this case, the results are similar to those produced by the M -estimators. Note that the print method
for bounded-inuence regression gives the regression coecients and two estimates of the variation (scale)
of the errors. There is no summary method for this class of models.

References
Huber, P. J. 1964. Robust Estimation of a Location Parameter. Annals of Mathematical Statistics 35:73
101.
3 LTS

regression is also the default method for the lqs function, which additionally can fit other bounded-influence estimators.

Rousseeuw, R. J. & A. M. Leroy. 1987. Robust Regression and Outlier Detection. New York: Wiley.
Stefanski, L. A. 1991. A Note on High-Breakdown Estimators. Statistics and Probability Letters 11:353
358.

Quiz 3 LDA Predictive Modeling Great Learning
100% (5)
Quiz 3 LDA Predictive Modeling Great Learning
7 pages
Multiple Linear Regression Assignment
100% (1)
Multiple Linear Regression Assignment
4 pages
Appendix Robust Regression
No ratings yet
Appendix Robust Regression
8 pages
Appendix Robust Regression
No ratings yet
Appendix Robust Regression
17 pages
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
No ratings yet
Least Median of Squares Regression. Peter J. Rousseeuw, 1984
10 pages
Econometric Theory: Module - Ii
No ratings yet
Econometric Theory: Module - Ii
11 pages
BA501 Week5 Linear Regression
No ratings yet
BA501 Week5 Linear Regression
45 pages
Linera Regression II PDF
No ratings yet
Linera Regression II PDF
14 pages
Math644 - Chapter 1 - Part2 PDF
No ratings yet
Math644 - Chapter 1 - Part2 PDF
14 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Unit - III
No ratings yet
Unit - III
4 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Chap7
No ratings yet
Chap7
7 pages
Robust Geodetic Parameter Estimation Under Least Squares Through Weighting On The Basis of The Mean Square Error
No ratings yet
Robust Geodetic Parameter Estimation Under Least Squares Through Weighting On The Basis of The Mean Square Error
12 pages
Least Squares Method
No ratings yet
Least Squares Method
36 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
M Estimation
No ratings yet
M Estimation
12 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Simple Linear Regression: Parameters
No ratings yet
Simple Linear Regression: Parameters
34 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
No ratings yet
MA 324, Lecture 1: Yohann Tendero Yohann - Tendero@
19 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
No ratings yet
Lecture 22: Review For Exam 2 1 Basic Model Assumptions (Without Gaussian Noise)
7 pages
Chapter2 (Simple Linear Regression)
No ratings yet
Chapter2 (Simple Linear Regression)
11 pages
Cost-Function
No ratings yet
Cost-Function
31 pages
Sio223a Chap7 PDF
No ratings yet
Sio223a Chap7 PDF
13 pages
Shrinkage Algorithms for MMSE Covariance Estimation
No ratings yet
Shrinkage Algorithms for MMSE Covariance Estimation
28 pages
Bias-Variance Tradeoffs: 1 Single Sample MLE
No ratings yet
Bias-Variance Tradeoffs: 1 Single Sample MLE
7 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
Good Estimation Detection Notes
No ratings yet
Good Estimation Detection Notes
11 pages
Reg Analysis
No ratings yet
Reg Analysis
63 pages
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
No ratings yet
Lecture 1: Optimal Prediction (With Refreshers) : 36-401, Fall 2017 Sunday 3 September, 2017
13 pages
Multiple Linear Reegression
No ratings yet
Multiple Linear Reegression
21 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Least Squares Estimation PDF
No ratings yet
Least Squares Estimation PDF
5 pages
Econ 471 Notes 1
No ratings yet
Econ 471 Notes 1
14 pages
SLRM note
No ratings yet
SLRM note
15 pages
Regression Analysis
No ratings yet
Regression Analysis
37 pages
Advanced Statistical Techniques Using R: Outliers and Missing Data
No ratings yet
Advanced Statistical Techniques Using R: Outliers and Missing Data
28 pages
19-AOS1828
No ratings yet
19-AOS1828
26 pages
Recursive Least Squares Estimation: 1 Estimation of A Constant
No ratings yet
Recursive Least Squares Estimation: 1 Estimation of A Constant
10 pages
Econ-2042- Unit 6-W12-13
No ratings yet
Econ-2042- Unit 6-W12-13
77 pages
CH 11 Slides
No ratings yet
CH 11 Slides
41 pages
Fdsa UNIT V
No ratings yet
Fdsa UNIT V
18 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
55 pages
Matrix OLS NYU Notes
No ratings yet
Matrix OLS NYU Notes
14 pages
EstimationTheory Lecture 03 (1)
No ratings yet
EstimationTheory Lecture 03 (1)
21 pages
MultivariableRegression 3
No ratings yet
MultivariableRegression 3
67 pages
Introduction To Mathematical Modeling: Simple Linear Regression
No ratings yet
Introduction To Mathematical Modeling: Simple Linear Regression
21 pages
Definition of Simple Linear Regression
No ratings yet
Definition of Simple Linear Regression
9 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
Wang PDF
No ratings yet
Wang PDF
56 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
Module05 Notes
No ratings yet
Module05 Notes
19 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Derivation of BLUE Property of OLS Estimators
100% (2)
Derivation of BLUE Property of OLS Estimators
4 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Linear Regression
No ratings yet
Linear Regression
56 pages
Standard-Slope Integration: A New Approach to Numerical Integration
From Everand
Standard-Slope Integration: A New Approach to Numerical Integration
Peter James Italia, MD
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Behavioral Lab - Assignment 2 - Group 2
No ratings yet
Behavioral Lab - Assignment 2 - Group 2
5 pages
TrainingundDevelopment - Chapter9 v. 02
No ratings yet
TrainingundDevelopment - Chapter9 v. 02
7 pages
IAS Mains Management 2010
No ratings yet
IAS Mains Management 2010
20 pages
Problem Statement: Compensation For Sales Professionals
No ratings yet
Problem Statement: Compensation For Sales Professionals
21 pages
Project Report Union Bank
No ratings yet
Project Report Union Bank
201 pages
Data Analysis Management Basics
No ratings yet
Data Analysis Management Basics
9 pages
Quiz Solutions
No ratings yet
Quiz Solutions
6 pages
Big Data: Management Information Systems
No ratings yet
Big Data: Management Information Systems
11 pages
Robust Regression: 1 M-Estimation
No ratings yet
Robust Regression: 1 M-Estimation
8 pages
Indian Institute of Technology: MS 5031: Data Analysis Applications in Class Assignment-1 October 24, 2016
No ratings yet
Indian Institute of Technology: MS 5031: Data Analysis Applications in Class Assignment-1 October 24, 2016
2 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
Class 11 Mathematics Mathematics Full
No ratings yet
Class 11 Mathematics Mathematics Full
470 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Descriptives: Descriptive Statistics
No ratings yet
Descriptives: Descriptive Statistics
5 pages
Example of Stratified Diversion-Curve Model
No ratings yet
Example of Stratified Diversion-Curve Model
13 pages
ARM 2nd Mid
No ratings yet
ARM 2nd Mid
13 pages
Lecture 3 Multiple Regression Model-Estimation
No ratings yet
Lecture 3 Multiple Regression Model-Estimation
40 pages
Final Review Handout
No ratings yet
Final Review Handout
47 pages
Report Format Merged
No ratings yet
Report Format Merged
20 pages
Demand Forecasting and Estimating Methods Problems
No ratings yet
Demand Forecasting and Estimating Methods Problems
22 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
Quantitative Methods for Economic Analysis 1 Solved MCQs [set-7] McqMate.com
100% (1)
Quantitative Methods for Economic Analysis 1 Solved MCQs [set-7] McqMate.com
5 pages
Econometric Methods
No ratings yet
Econometric Methods
4 pages
Package Forecast': R Topics Documented
No ratings yet
Package Forecast': R Topics Documented
55 pages
Chapter Two Time Series Regression
No ratings yet
Chapter Two Time Series Regression
7 pages
Abnormal Return Saham Pada Kinerja Jangka Panjang
No ratings yet
Abnormal Return Saham Pada Kinerja Jangka Panjang
11 pages
Olympics Project
No ratings yet
Olympics Project
3 pages
Betas: Standardized Variables in Regression: Paul E. Johnson
No ratings yet
Betas: Standardized Variables in Regression: Paul E. Johnson
46 pages
Activity 5 (Time Series) - Rudinas
No ratings yet
Activity 5 (Time Series) - Rudinas
7 pages
A.V.C College of Engineering, Mannampandal M.E - Applied Electronics
No ratings yet
A.V.C College of Engineering, Mannampandal M.E - Applied Electronics
3 pages
U02Lecture06 Regression
No ratings yet
U02Lecture06 Regression
25 pages
Chapter 7. Sampling Distributions and Point Estimation of Paramaters
No ratings yet
Chapter 7. Sampling Distributions and Point Estimation of Paramaters
75 pages
Hasil Olahan Data Putri
No ratings yet
Hasil Olahan Data Putri
12 pages
Lecture Notes
No ratings yet
Lecture Notes
141 pages
Statistics Formula Booklet
No ratings yet
Statistics Formula Booklet
13 pages
RYAN, THOMAS P. - [Wiley Series in Probability and Statistics] Modern Regression Methods __ (2
No ratings yet
RYAN, THOMAS P. - [Wiley Series in Probability and Statistics] Modern Regression Methods __ (2
658 pages
Multiple Regression Analysis Project
No ratings yet
Multiple Regression Analysis Project
9 pages
Scor Sex Varsta Mediu Scor Gcs Greutate (KG) Inaltime (M) Glicemie 1 2 3 18.4209128 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
No ratings yet
Scor Sex Varsta Mediu Scor Gcs Greutate (KG) Inaltime (M) Glicemie 1 2 3 18.4209128 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
3 pages
Midterm Sem1 2013
No ratings yet
Midterm Sem1 2013
6 pages
Data Analysis Problems
No ratings yet
Data Analysis Problems
12 pages

Robust Regression: 1 M-Estimation

Uploaded by

Robust Regression: 1 M-Estimation

Uploaded by

Robust Regression

Appendix to An R and S-PLUS Companion to Applied Regression

= + 1 xi1 + 2 xi2 + + k xik + i

for the ith of n observations. The tted model is

= a + b1 xi1 + b2 xi2 + + bk xik + ei

The general M-estimator minimizes the objective function

(yi xi b)xi = 0

2. At each iteration t, calculate residuals ei

and associated weights wi

3. Solve for new weighted-least-squares estimates

0.0 0.2 0.4 0.6 0.8 1.0

0.0 0.2 0.4 0.6 0.8 1.0

0.0 0.2 0.4 0.6 0.8 1.0

k/|e| for |e| > k

= yi (a + b1 xi1 + b2 xi2 + + bk xik )

Let us order the squared residuals from smallest to largest:

An Illustration: Duncans Occupational-Prestige Regression

library(car) # mostly for the Duncan data set

Scale estimates 7.77 7.56

You might also like