0% found this document useful (0 votes)

54 views12 pages

An Introduction To Robust Estimation With R Functi Removed

This document discusses numerical methods for calculating maximum likelihood estimates in logistic regression models. It describes using numerical differentiation via the num.deriv function to update beta coefficients. It then summarizes the usage of the logit.BI function for fitting logistic regression models with iteratively reweighted least squares. The document also introduces two robust estimators - the Bianco and Yohai estimator and a weighted version. It applies these methods to a food stamp data example.

Uploaded by

Muhammad Zulfadhli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views12 pages

An Introduction To Robust Estimation With R Functi Removed

Uploaded by

Muhammad Zulfadhli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

by numerical differentiation. In our function logit.

BI we use the simple numerical

routine num.deriv from the sn library. Hence, the updating of the beta coefficients
is essentially performed by the commands

> g.old <- g.fun(beta.old, X, y, offset, w.x, k1)

> J.old <- num.deriv(beta.old, "g.fun", X = X, y = y,
offset = offset, w.x = w.x, k1 = k1)
> beta.new <- beta.old - qr.solve(J.old, g.old)

The logit.BI function has the following usage

logit.BI(beta.in, X, y, k1, offset, w.x, maxiter, mytol)

and the various arguments are:

• beta.in: initial value for β;

• X, y: design matrix, response vector;

• k1: tuning constant, default is 1.2.

• offset: offset (default is a vector of 0s);

• w.x: x-based prior weights for Mallows estimation (default is a vector of 1s);

• maxiter, mytol: maximum number of iterations and tolerance for the algorithm.

The function returns a list with several components, including:

• coef: parameter estimates;

• se, V: estimated standard errors and asymptotic variance matrix for the regression
coefficients;

• weights: vector of weights on the residuals.

4.2 The Bianco and Yohai estimator

An alternative method is given by the Bianco and Yohai estimator (Bianco and Yohai,
1996), defined as
%n
) *
β̂BY = argmin ρk (d(xTi β; yi)) + C(xTi β) (9)
β i=1

where d(u; y) = −y log F (u) − (1 − y) log{(1 − u)}, ρk is a bounded function and C(xTi β)
is a bias correction term. Bianco and Yohai (1996) proposed the following ρ function,
 2
x − x

if x ≤ k
ρk (x) = k 2k

 otherwise
2

46
but stressed that other choices are possible. Croux and Haesbroeck (2003) extend the
Bianco and Yohai estimator by including weights for downweighting high-leverage points,
thus defining a bounded-influence estimator
n
% ) *
β̂W BY = argmin w(xi) ρk (d(xTi β; yi)) + C(xTi β) . (10)
β i=1

They suggested a decreasing function of robust Mahalanobis distances for the weights
w(xi ), where the distances are computed using the Minimum Covariance Determinant
(MCD) estimator (see Rousseeuw and Leroy, 1987). More precisely, w(xi ) are obtained as
follows. The MCD method seeks h points whose covariance has minimum determinant,
and Croux and Haesbroeck (2003) suggest h = 3/4 n, giving a 25% breakdown point
estimator. The method is implemented by the function cov.rob (or cov.mcd, which is a
convenient wrapper) in the MASS library. If X is the design matrix, the robust estimate of
multivariate location and scale are obtained by

> hp <- floor(nrow(X) * 0.75) + 1

> mcdx <- cov.rob(X, quan = hp, method = "mcd")

Once the robust estimate of location and scale have been computed, we can obtain the
robust distances RDi . Finally the weights are defined as w(xi ) = W (RDi ), where W is
the weight function W (t) = I{t2 ≤χ2p,0.975 } , with IA denoting the indicator function of the
set A.

> rdx <- sqrt(mahalanobis(X, center = mcdx$center, cov = mcdx$cov))

> vc <- sqrt(qchisq(0.975, ncol(X)))
> wx <- as.numeric(rdx <= vc)

Notice that the same weights could also be used in (8). Croux and Haesbroeck (2003) have
implemented both the Bianco and Yohai estimator and their weighted version in some
public-domain S-PLUS functions, which can be also used in R with a few changes. The
two functions are BYlogreg and WBYlogreg, respectively, which we slightly modified in
order to deal with dummy covariates in the design matrix. The arguments of the function
BYlogreg are

• x0, y: design matrix (not including the intercept), response vector;

• x0cont: design matrix to use for computing the weighted MLE (if initwml=T)

• initwml: logical value for selecting one of the two possible methods for computing
the initial value: if initwml=T a weighted MLE, otherwise the classical MLE.

• const: tuning constant, default is 0.5.

• kmax, maxhalf: maximum number of iterations and max number of step-halving.

the algorithm.

The functions returns a list, including the components coef and sterror for parameter
estimates and standard errors. The function WBYlogreg has exactly the same arguments,
with the exception of initwml as the weighted MLE is always used as starting point.

47
Example: Food Stamp Data
This is a classical example of robust statistics, see for example Künsch et al. (1989). The
food stamp data set, of sample size 150, consists of a binary response variable participa-
tion in the US Food Stamp Program. The covariates included in the model are tenancy
(Tenancy), supplemental income (SupInc), and log(monthly income + 1) (log(Inc+1)).
The data are contained in the file foodstamp.txt. Let us start the analysis by getting
the MLE.
> food <- read.table("foodstamp.txt", T)
> food.glm <- glm(y ~ Tenancy + SupInc + log(Inc+1), binomial, food)
> summary(food.glm)
....
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 0.9264 1.6229 0.571 0.56813
Tenancy -1.8502 0.5347 -3.460 0.00054
SupInc 0.8961 0.5009 1.789 0.07365
log(Inc + 1) -0.3328 0.2729 -1.219 0.22280

The only significant coefficient seems to be that of the variable tenancy. A look at some
diagnostic plots is quite useful (see Figure 28).
> glm.diag.plots(food.glm)
It is clear that there is an observation with totally anomalous covariate values, which has
a very strong effect on the fit. In fact, observation 5 is the only one with a zero value for
the monthly income.
> food$Inc
[1] 271 287 714 521 0 518 458 1266 350 168 235 450 683 519
...
Any kind of robust method suitable for this data set must clearly bound the influence
of the design points. We get both the weights based on the hat matrix (used in their
examples by Cantoni and Ronchetti, 2001), and those based on the robust estimation of
location and scale. Following the suggestions in the code by Croux and Haesbroeck, in
the latter case we compute the weights using only the continuous covariate.
> X.food <- model.matrix(food.glm)
> w.hat.food <- sqrt(1 - hat(X.food))
> hp.food <- floor(nrow(X.food) * 0.75) + 1
> mcdx.food <- cov.mcd(as.matrix(X.food[,-(1:3)]), quan = hp.food, method = "mcd")
> rdx.food <- sqrt(mahalanobis(as.matrix(X.food[,-(1:3)]),
center = mcdx.food$center, cov = mcdx.food$cov))
> vc.food <- sqrt(qchisq(0.975, 1))
> w.rob.food <- as.numeric(rdx.food <= vc.food)
The two sets of weights present some differences, and the hat matrix-based ones provide
a smaller degree of weighting

48
Quantiles of standard normal

2
2

1
1
Residuals

0
0

!1
!1

!2
!2

!3 !2 !1 0 1 !2 !1 0 1 2

Linear predictor Ordered deviance residuals

1.5

1.5
Cook statistic

Cook statistic
1.0

1.0
0.5

0.5
0.0

0.0

0.0 0.2 0.4 0.6 0.8 1.0 0 50 100 150

h/(1!h) Case

Figure 28: Foodstamp data: diagnostic plots for maximum likelihood estimates

> mean(w.rob.food)
[1] 0.94
> mean(w.hat.food)
[1] 0.9864798

However, both types of weight reach their minimum value at the observation 5. We now
compute the robust estimates β̂M , including a Huber-type version without any prior x-
weights. We start the algorithm from the MLE and, following Cantoni and Ronchetti
(2001), we set k = 1.2.

> food.hub <- logit.BI(food.glm$coef, X.food, food$y, 1.2,)

> food.mal <- logit.BI(food.glm$coef, X.food, food$y, 1.2, w.x = w.hat.food)
> food.mal.wrd <- logit.BI(food.glm$coef, X.food, food$y, 1.2, w.x = w.rob.food)
> tab.coef <- (cbind(food.glm$coef, food.hub$coef, food.mal$coef, food.mal.wrd$coef))
> colnames(tab.coef)<- c("MLE", "HUB", "MAL-HAT", "MAL-ROB")
> print(tab.coef, digits = 3)
MLE HUB MAL-HAT MAL-ROB
(Intercept) 0.926 0.710 6.687 8.065
Tenancy -1.850 -1.778 -1.855 -1.784
SupInc 0.896 0.802 0.606 0.586
log(Inc + 1) -0.333 -0.287 -1.298 -1.540

49
The standard errors are as follows.
> tab.se <- cbind(sqrt(diag(vcov(food.glm))), food.hub$se, food.mal$se,
food.mal.wrd$se)
> colnames(tab.se) <- c("MLE", "HUB", "MAL-HAT", "MAL-ROB")
> print(tab.se, digits = 3)
MLE HUB MAL-HAT MAL-ROB
(Intercept) 1.623 1.636 3.039 3.328
Tenancy 0.535 0.527 0.581 0.588
SupInc 0.501 0.516 0.553 0.561
log(Inc + 1) 0.273 0.276 0.519 0.572
We notice that the Mallows estimates are quite different from both the MLE and the
Huber estimates. The estimated weights for the residual ψc (ri )/ri , where ri is the Pearson
residual, provide a explanation for this fact.
> wei.food <- cbind(food.hub$weights, food.mal$weights, food.mal.wrob$weights)
> cond <- apply(wei.food, 1, "<", 1)
> cond <- apply(cond, 2, sum)
> wei.food[(cond>0),]
[,1] [,2] [,3]
5 0.8412010 0.04237215 0.02127919
22 0.4953700 0.60685048 0.65761723
25 1.0000000 0.97042523 0.93395722
26 0.8020679 1.00000000 1.00000000
40 0.6464152 0.41587812 0.36433034
51 1.0000000 0.96380099 0.92400623
52 0.8144845 1.00000000 1.00000000
59 1.0000000 0.97674194 0.94117456
66 0.2543750 0.13502255 0.11816794
79 0.6857541 0.54321273 0.50018522
94 0.7980593 1.00000000 1.00000000
95 0.6679639 0.48234377 0.43440327
103 0.4653931 0.45762357 0.47048220
107 0.8014854 1.00000000 1.00000000
109 0.9518519 0.52815274 0.45262133
120 0.4756079 0.50482509 0.52859808
137 0.2884602 0.23841016 0.23198605
141 0.7969428 1.00000000 1.00000000
147 0.3144637 0.35221333 0.36859260
150 0.7920675 1.00000000 1.00000000
The weights based on the Huber-type estimates are quite different from the Mallows-
type ones. In particular, Huber-type regression does not downweight enough observation
5. Things are different if we choose another starting point for the algorithm. A sensible
choice could be to start from a weighted MLE, obtained by selecting only the observations
for which the robust distance weights W (RDi ) are equal to 1.
> food.glm.wml <- glm(y ~ Tenancy + SupInc + log(Inc+1), binomial, food,

50
subset = (w.rob.food==1))
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 5.6408 2.7665 2.039 0.041452
Tenancy -1.7749 0.5352 -3.317 0.000911
SupInc 0.6491 0.5181 1.253 0.210238
log(Inc + 1) -1.1123 0.4685 -2.374 0.017589

If we recompute the algorithm form the weighted MLE, the Huber-type estimates are
close to the Mallows one, and the weight given to observation 5 is now much smaller.

> food.hub.wml <- logit.BI(food.glm.wml$coef, X.food, food$y, 1.2)

> food.hub.wml$coef
(Intercept) Tenancy SupInc log(Inc + 1)
6.531852 -1.847610 0.607605 -1.270387
> food.hub.wml$weights[5]
5
0.04579391
A similar result is obtained with the Bianco and Yohai estimator. If we call the BYlogreg
with the argument initwml = F, the MLE is used as starting point.
> food.BY<- BYlogreg(X.food[,-1], food$y, initwml = F)
> food.BY
....
$coef
(Intercept) x0Tenancy x0SupInc x0log(Inc + 1)
[1,] 0.8814444 -1.768291 0.8456321 -0.3218968
$sterror
Tenancy SupInc log(Inc + 1)
5.7102449 0.5785001 0.6279328 0.9294853
The results are not much different from the MLE. A better option is to use the weighted
MLE as starting point. This requires as a further argument the matrix used to compute
the weights.

> food.BY.wml <- BYlogreg(X.food[,-1], food$y, as.matrix(X.food[,-(1:3)]))

> food.BY.wml
....
$coef
(Intercept) x0Tenancy x0SupInc x0log(Inc + 1)
[1,] 5.369783 -1.691211 0.6173553 -1.070233
$sterror
Tenancy SupInc log(Inc + 1)
7.0927687 0.5391049 0.5240751 1.2190520
The results are similar to the weighted version of the Bianco and Yohai estimator.

> food.WBY <- WBYlogreg(X.food[,-1], food$y, as.matrix(X.food[,-(1:3)]) )

>food.WBY

51
....
$coef
(Intercept) subx01 subx02 subx03
[1,] 5.824949 -1.832782 0.6703187 -1.148582
$sterror
[1] 3.3923786 0.5822091 0.5220386 0.5906120

In order to compare all the various estimates, we follow the approach of Kordzakhia
et al. (2001). They proposed to compare the various estimates using a goodness-of-fit
discrepancy, the chi-square statistic based on the arcsin transformation
n
% & √ ' (
2
Xarc = 4 arcsin yi − arcsin π̂i ,
i=1
√
where π̂i are the fitted probabilities. The values of the statistic for the various estimates
show again the importance of using x-weights for this data set.

> X2.arc <- function(y, mu) 4 * sum((asin(sqrt(y)) - asin(sqrt(mu)))^2)

> X2.arc(food$y, plogis(X.food %*% food.glm$coef))
[1] 173.5109
> X2.arc(food$y, plogis(X.food %*% food.glm.wml$coef))
[1] 172.3801
> X2.arc(food$y, plogis(X.food %*% food.hub$coef))
[1] 175.4812
> X2.arc(food$y, plogis(X.food %*% food.hub.wml$coef))
[1] 170.2295
> X2.arc(food$y, plogis(X.food %*% food.mal$coef))
[1] 170.0075
> X2.arc(food$y, plogis(X.food %*% food.mal.wrob$coef))
[1] 169.8280
> X2.arc(food$y, plogis(X.food %*% t(food.BY$coef)))
[1] 174.8348
> X2.arc(food$y, plogis(X.food %*% t(food.BY.wml$coef)))
[1] 173.0532
> X2.arc(food$y, plogis(X.food %*% t(food.WBY$coef)))
[1] 171.0866

Finally, the S-PLUS code of Cantoni (2004) gives the following results for the Mallows
estimator using the x-weights (1 − hii )1/2 (here we used our own port of the code)

> food.can <- glm.rob(X.food[,-1], food$y, chuber = 1.2,

weights.on.x = T, ni = rep(1,nrow(X.food)))
> food.can$coef
[1] 6.6870043 -1.8551298 0.6061823 -1.2975844
> food.can$sd
Tenancy SupInc log(Inc + 1)
3.0756946 0.5946090 0.5592972 0.5264943

52
The coefficients and standard erros are essentially the same as those obtained with our
function and stored in food.mal. The code by Cantoni (2004), however, is more reliable
and has a broad range of functions, including some functions for testing based on quasi
deviances (Cantoni and Ronchetti, 2004).

Example: Vaso-constriction Data

We consider an example from Finney (1947), already analysed in Künsch et al. (1989).
These data consist of 39 observations on three variables: the occurence of vaso-constriction
in the skin of the digits, and the rate and volume of air inspired. The model considered
by Künsch et al. (1989) regresses the occurrence of vaso-constriction on the logarithm of
air rate and volume. The data are in the file vaso.txt.
vaso <- read.table("vaso.txt", T)
vaso$lVol <- log(vaso$Vol)
vaso$lRate <- log(vaso$Rate)
vaso$Resp <- 1 - (as.numeric(vaso$Resp) - 1)
vaso$y <- vaso$Resp
A plot of the data show some difference in the covariates for the two groups of patients
with different response values.
> plot(vaso$Vol,vaso$Rate,type = "n", xlab = "Volume", ylab = "Rate")
> points(vaso$Vol[vaso$y==0], vaso$Rate[vaso$y==0], col = 1, pch = 16)
> points(vaso$Vol[vaso$y==1], vaso$Rate[vaso$y==1], col = 2, pch = 16)
> legend(2.5, 3.0, c("y=0 ", "y=1 "), fill = c(1, 2), text.col = c("black", "red"))
Standard diagnostic plots based on the maximum likelihood fit show that there two quite
influential observations (4 and 18):
> vaso.glm <- glm(Resp ~ lVol + lRate,family = binomial, data = vaso)
> summary(vaso.glm)
....
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -2.924 1.288 -2.270 0.02318
lVol 5.220 1.858 2.810 0.00496
lRate 4.631 1.789 2.589 0.00964
> glm.diag.plots(vaso.glm)
If we re-estimate the model after removing these two observations, we can observe that
their effect on the model estimate is huge.
> vaso.glm.w418 <- update(vaso.glm, data = vaso[-c(4,18),])
> summary(vaso.glm.w418)
....
Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) -24.58 14.02 -1.753 0.0796
lVol 39.55 23.25 1.701 0.0889

53
3.5
3.0
y=0
y=1

2.5
Rate

2.0
1.5 4
18
1.0
0.5

0.5 1.0 1.5 2.0 2.5 3.0 3.5

Volume

Figure 29: Vaso-constriction data: covariate scatterplot

lRate 31.94 17.76 1.798 0.0721

There is a dramatic increase in both coefficient values and standard errors. Actually,
without the two observations we are in situation of quasi-complete separation (Albert
and Anderson, 1984), with little overlap between observations with yi = 0 and yi = 1.
The model is nearly undeterminated. This is readily confirmed by Mallows (or Huber)
estimation, which assigns low weight to both observations 4 and 18, and provides results
similar to those obtained with MLE after removing the influential points.
> X.vaso <- model.matrix(vaso.glm)
> vaso.mal <- logit.BI(vaso.glm$coef, X.vaso, vaso$y, 1.2, sqrt(1 - hat(X.vaso)))
> cbind(vaso.mal$coef, vaso.mal$se)
[,1] [,2]
(Intercept) -22.01822 22.83438
lVol 36.10633 40.69736
lRate 28.72225 30.03298
> vaso.mal$weights[c(4,18)]
4 18
3.726611e-05 1.544562e-04
The same happens with the both versions of the Bianco and Yohai estimator. Once again,
the near-indeterminacy is reflected by large increases of coefficients and standard errors.
> vaso.WBY<- WBYlogreg(X.vaso[,-1], vaso$y)
> vaso.WBY

54
Quantiles of standard normal

2
2
Residuals

1
1

0
0

!1
!1

!6 !2 0 2 4 !2 !1 0 1 2

Linear predictor Ordered deviance residuals

0.4

0.4
0.3

0.3
Cook statistic

Cook statistic
0.2

0.2
0.1

0.1
0.0

0.0
0.00 0.10 0.20 0.30 0 10 20 30 40

h/(1!h) Case

Figure 30: Vaso-constriction data: diagnostic plots for maximum likelihood estimates

....
$coef
(Intercept) subx01 subx02
[1,] -6.859868 10.74855 9.3733
$sterror
[1] 10.07252 15.34863 12.80866
Notice, however, that alternative methods may give different results, like in the case of
the OBRE (see Künsch et al., 1989).

References
Agostinelli, C. (2001), Wle: A package for robust statistics using weighted likelihood. R
News, 1/3, 32–38.

Agostinelli, C., Markatou, M. (1998), A one-step robust estimator for regression based
on the weighted likelihood reweighting scheme, Statistics & Probability Letters, 37,
341–350.

Albert, A., Anderson, J. A. (1984), On the existence of maximum likelihood estimates

in logistic regression models, Biometrika, 71 , 1–10.

55
Becker, R.A., Chambers, J.M., Wilks, A.R. (1988), The New S Language, Wadsworth
and Brooks/Cole, Pacific Grove.

Belsley, D. A., Kuh, E., Welsch, R. E. (1980), Regression Diagnostics, Wiley.

Bianco, A.M., Yohai, V.J. (1996), Robust estimation in the logistic regression model.
In: Rieder, H. (Ed.), Robust Statistics, Data Analysis, and Computer Intensive
Methods, Springer, pp. 17–34.

Cantoni, E. (2004), Analysis of robust quasi-deviance for generalized linear models, Jour-
nal of Statistical Software, 10, 1–9.

Cantoni, E., Ronchetti, E. (2001). Efficient bounded-influence regression estimation.

Journal of the American Statistical Association, 96, 1022–1030.

Chushny, A.R., Peebles, A.R. (1905), The action of optical isomers. II. Hyoscines, J.
Physil., 32, 501–510.

Croux, C., Haesbroeck, G. (2003), Implementing the Bianco and Yohai estimator for
logistic regression, Computational Statistics and Data Analysis, 44, 273–295.

Finney, D. J. (1947), The estimation from individual records of the relationship between
dose and quantal response, Biometrika, 34, 320–334.

Hampel, F.R., Ronchetti, E.M., Rousseeuw, P.J., Stahel, W.A. (1986), Robust Statistics:
The Approach Based on Influence Functions, Wiley.

Hawkins, D.M., Bradu, D., Kass, G.V. (1984), Location of several outliers in multiple
regression data using elemental sets, Technometrics, 26, 197–208.

Huber, P. J. (1981), Robust Statistics, Wiley.

Jørgensen, B. (1984), The delta algorithm and GLIM, International Statistical Review,
52, 283–300.

Kordzakhia, N., Mishra, G.D., Reiersølmoen, L. (2001), Robust estimation in the logistic
model, Journal of Statistical Planning and Inference, 98, 211–223.

Krasker, W. S., Welsch, R. E. (1982), Efficient bounded-influence regression estimation.

Journal of the American Statistical Association, 77, 595–604.

Künsch, H.R., Stefanski, L.A., Carroll, R. J. (1989). Conditionally unbiased bounded-

influence estimation in general regression models, with application to generalized
linear models. Journal of the American Statistical Association, 84, 460–466.

Li, G. (1985), Robust regression, In Exploring Data Tables, Trends, and Shapes, eds.
Hoagling and Tukey, pp. 281–343, Wiley.

Marazzi, A. (1993), Algorithms, Routines, and S Functions for Robust Statistics, Wadsworth
and Brooks/Cole, Pacific Grove.

Markatou, M., Basu, A., Lindsay, B.G. (1998), Weighted likelihood equations with boot-
strap root search, Journal of the American Statistical Association, 93, 740–750.

56
McKean, J.W., Sheather, S.J., Hettmansperger, T.P. (1993), The use and interpretation
of residuals based on robust estimation, J. Amer. Statist. Ass., 88, 1254–1263.

McNeil, D. R. (1977), Interactive Data Analysis, Wiley.

Rousseeuw, P.J., Leroy, A.M. (1987), Robust regression and outliers detection, Wiley.

Staudte, R.G., Sheather, S.J. (1990), Robust Estimation and Testing, Wiley.

Street, J.O., Carroll, R.J., Ruppert, D. (1988), A note on computing robust regression
estimates via iteratively reweighted least squares. American Statistician, 42, 152–
154.

Venables, W. N., Ripley, B. D. (2002), Modern Applied Statistics with S. Fourth edition.
Springer.

View publication stats

Stat 331 Course Notes
No ratings yet
Stat 331 Course Notes
79 pages
2024 Coinbase Method
33% (3)
2024 Coinbase Method
36 pages
Robust Nonparametric Statistical Methods
No ratings yet
Robust Nonparametric Statistical Methods
505 pages
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
No ratings yet
Data Fitting and Uncertainty (A Practical Introduction To Weighted Least Squares and Beyond)
6 pages
Model Linear
No ratings yet
Model Linear
33 pages
Practice-Training_BTTC
No ratings yet
Practice-Training_BTTC
25 pages
Sandwich PDF
No ratings yet
Sandwich PDF
26 pages
Principes D'économétrie Avec R
No ratings yet
Principes D'économétrie Avec R
20 pages
User Guide For Johansen S Method
No ratings yet
User Guide For Johansen S Method
13 pages
RJournal 2012-2 Kloke+McKean
No ratings yet
RJournal 2012-2 Kloke+McKean
8 pages
222BDA35 Activity2
No ratings yet
222BDA35 Activity2
5 pages
HW4 Solutions: Problem 6.2
No ratings yet
HW4 Solutions: Problem 6.2
8 pages
Stat 378
No ratings yet
Stat 378
73 pages
Multiple Regression
No ratings yet
Multiple Regression
7 pages
Homework 2
100% (1)
Homework 2
14 pages
A Robust Instrumental-Variables Estimator: 1 Theory
No ratings yet
A Robust Instrumental-Variables Estimator: 1 Theory
13 pages
2021 - Creel - econometrics (githuib book)
No ratings yet
2021 - Creel - econometrics (githuib book)
1,060 pages
Object-Oriented Computation of Sandwich Estimators: Achim Zeileis
No ratings yet
Object-Oriented Computation of Sandwich Estimators: Achim Zeileis
16 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
Masaki Rhodes 11/16/2020: Library Function
No ratings yet
Masaki Rhodes 11/16/2020: Library Function
1 page
ANOVA Models
No ratings yet
ANOVA Models
44 pages
GianluigiDeRubertis 228766
No ratings yet
GianluigiDeRubertis 228766
9 pages
Graduate Econometrics Lecture Notes - Michael Creel (414 Pages)
100% (1)
Graduate Econometrics Lecture Notes - Michael Creel (414 Pages)
414 pages
Generalised Linear Models: Getwd
No ratings yet
Generalised Linear Models: Getwd
7 pages
Cursus Advanced Econometrics
No ratings yet
Cursus Advanced Econometrics
129 pages
Lab 8
No ratings yet
Lab 8
30 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
Glmext4 Preview
No ratings yet
Glmext4 Preview
27 pages
Linear Regression
100% (2)
Linear Regression
228 pages
Glmnet
No ratings yet
Glmnet
42 pages
Lecture 20: Outliers and Influential Points
No ratings yet
Lecture 20: Outliers and Influential Points
11 pages
Robust Statistic
No ratings yet
Robust Statistic
599 pages
PHD Unimi R07738
No ratings yet
PHD Unimi R07738
134 pages
TP MSDC 3
No ratings yet
TP MSDC 3
6 pages
Regression Gl m
No ratings yet
Regression Gl m
315 pages
Xxxx Statistical Estimation
No ratings yet
Xxxx Statistical Estimation
87 pages
MODEL SELECTION R TUTORIAL
No ratings yet
MODEL SELECTION R TUTORIAL
6 pages
Regression Analysis
100% (1)
Regression Analysis
280 pages
Similar With My Research
No ratings yet
Similar With My Research
8 pages
Using Gretl For Principles of Econometrics, 3rd Edition
No ratings yet
Using Gretl For Principles of Econometrics, 3rd Edition
381 pages
Assignment of Regression.
No ratings yet
Assignment of Regression.
4 pages
Creel M Econometrics
No ratings yet
Creel M Econometrics
479 pages
CS 2008 3complete PDF
No ratings yet
CS 2008 3complete PDF
53 pages
Georglm: A Package For Generalised Linear Spatial Models Introductory Session
No ratings yet
Georglm: A Package For Generalised Linear Spatial Models Introductory Session
10 pages
Econometric s
No ratings yet
Econometric s
1,341 pages
Ebook Econometrics
No ratings yet
Ebook Econometrics
1,006 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Course Notes18
No ratings yet
Course Notes18
113 pages
Logistic Regression (With R) : 1 Theory
No ratings yet
Logistic Regression (With R) : 1 Theory
15 pages
R Code Default Data PDF
No ratings yet
R Code Default Data PDF
10 pages
5-R
No ratings yet
5-R
65 pages
Hettmansperger RobustNonparametricMethods 2000
No ratings yet
Hettmansperger RobustNonparametricMethods 2000
6 pages
Maximum Likelihood Estimation With Stata, Fourth Edition by William Gould, Jeffrey Pitblado, Brian Poi
No ratings yet
Maximum Likelihood Estimation With Stata, Fourth Edition by William Gould, Jeffrey Pitblado, Brian Poi
376 pages
Ordinary Kriging R
100% (1)
Ordinary Kriging R
19 pages
Get (Ebook) Maximum Likelihood Estimation with Stata, Fourth Edition by William Gould, Jeffrey Pitblado, Brian Poi ISBN 9781597180788, 1597180785 PDF ebook with Full Chapters Now
No ratings yet
Get (Ebook) Maximum Likelihood Estimation with Stata, Fourth Edition by William Gould, Jeffrey Pitblado, Brian Poi ISBN 9781597180788, 1597180785 PDF ebook with Full Chapters Now
81 pages
V75i03 PDF
No ratings yet
V75i03 PDF
24 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Standard-Slope Integration: A New Approach to Numerical Integration
From Everand
Standard-Slope Integration: A New Approach to Numerical Integration
Peter James Italia, MD
No ratings yet
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Confirmation _ Check-in
No ratings yet
Confirmation _ Check-in
2 pages
8. DAFTAR PUSTAKA
No ratings yet
8. DAFTAR PUSTAKA
4 pages
list_of_change_IJAM_EL_240905Xj
No ratings yet
list_of_change_IJAM_EL_240905Xj
1 page
sim.3315
No ratings yet
sim.3315
12 pages
Hipotesis Parameter Deret Fourier
No ratings yet
Hipotesis Parameter Deret Fourier
13 pages
curvilinear effect in logistic regression (3)
No ratings yet
curvilinear effect in logistic regression (3)
42 pages
Data Split Plot 2
No ratings yet
Data Split Plot 2
41 pages
Ebib Untukmuindonesia
No ratings yet
Ebib Untukmuindonesia
1 page
Ebib Covid19challenge
No ratings yet
Ebib Covid19challenge
1 page
Ebib Untukmuindonesia
No ratings yet
Ebib Untukmuindonesia
1 page
Service Manual Car Audio CDX-GT49UM GT490U GT490US GT494U
100% (2)
Service Manual Car Audio CDX-GT49UM GT490U GT490US GT494U
40 pages
Checksheet Incoming - Nut, Wheel Single LH
No ratings yet
Checksheet Incoming - Nut, Wheel Single LH
1 page
EE 134 Homework 1
No ratings yet
EE 134 Homework 1
2 pages
2016-10-19 homework solutions
No ratings yet
2016-10-19 homework solutions
4 pages
AUTOSAR SWS MemoryMapping
No ratings yet
AUTOSAR SWS MemoryMapping
63 pages
EMS Lab Volt
No ratings yet
EMS Lab Volt
11 pages
5 Year Plan Template 19
No ratings yet
5 Year Plan Template 19
13 pages
A Study On Customer Relationship Management (CRM) in Jammu & Kashmir Bank With Special Reference To Kashmir
No ratings yet
A Study On Customer Relationship Management (CRM) in Jammu & Kashmir Bank With Special Reference To Kashmir
7 pages
000146
No ratings yet
000146
9 pages
SWAT + Bieger Et Al., 2016
No ratings yet
SWAT + Bieger Et Al., 2016
16 pages
ANALOG TO DIGITAL DATA
No ratings yet
ANALOG TO DIGITAL DATA
8 pages
Contol Devices
No ratings yet
Contol Devices
6 pages
ARCH264 - R7: A2-1 Rear Elevation A2-2 Left-Side Elevation
No ratings yet
ARCH264 - R7: A2-1 Rear Elevation A2-2 Left-Side Elevation
1 page
unity
No ratings yet
unity
2 pages
Data Collection Cleaning Preprocessing Presentation
No ratings yet
Data Collection Cleaning Preprocessing Presentation
13 pages
COMPUTERSCIENCESET1MarkingSc_50ce3fd735224405ae56dd157acdb1a4_87164
No ratings yet
COMPUTERSCIENCESET1MarkingSc_50ce3fd735224405ae56dd157acdb1a4_87164
9 pages
Wuthiphong Chueasuk
No ratings yet
Wuthiphong Chueasuk
5 pages
Apex Pro Fundamentals
No ratings yet
Apex Pro Fundamentals
2 pages
Week 4
No ratings yet
Week 4
27 pages
Writing An Applied Linguistics Thesis or Dissertation: A Guide To Presenting Empirical Research (Ebook, 2010) (WorldCat - Org)
No ratings yet
Writing An Applied Linguistics Thesis or Dissertation: A Guide To Presenting Empirical Research (Ebook, 2010) (WorldCat - Org)
3 pages
boston scientific icefx user manual
No ratings yet
boston scientific icefx user manual
112 pages
Python Program - X Pyplot-Ii
No ratings yet
Python Program - X Pyplot-Ii
9 pages
Educational Measurement - 2012 - Williamson - A Framework For Evaluation and Use of Automated Scoring
No ratings yet
Educational Measurement - 2012 - Williamson - A Framework For Evaluation and Use of Automated Scoring
12 pages
Characteristics of Modern Operating Systems
No ratings yet
Characteristics of Modern Operating Systems
16 pages
Requirements Pracs Week6 Sample Answers
No ratings yet
Requirements Pracs Week6 Sample Answers
4 pages
Skoda SWOT Analysis
No ratings yet
Skoda SWOT Analysis
21 pages
Cybersecurity Lab Maual
No ratings yet
Cybersecurity Lab Maual
66 pages
(4th Year) Roadmap To Dream Placement
No ratings yet
(4th Year) Roadmap To Dream Placement
1 page
DigitalTwin Challenges Trends Biological
No ratings yet
DigitalTwin Challenges Trends Biological
165 pages

An Introduction To Robust Estimation With R Functi Removed

Uploaded by

An Introduction To Robust Estimation With R Functi Removed

Uploaded by

by numerical differentiation. In our function logit.

BI we use the simple numerical

> g.old <- g.fun(beta.old, X, y, offset, w.x, k1)

The logit.BI function has the following usage

logit.BI(beta.in, X, y, k1, offset, w.x, maxiter, mytol)

and the various arguments are:

• beta.in: initial value for β;

• X, y: design matrix, response vector;

• k1: tuning constant, default is 1.2.

• offset: offset (default is a vector of 0s);

The function returns a list with several components, including:

• coef: parameter estimates;

• weights: vector of weights on the residuals.

4.2 The Bianco and Yohai estimator

> hp <- floor(nrow(X) * 0.75) + 1

> rdx <- sqrt(mahalanobis(X, center = mcdx$center, cov = mcdx$cov))

• x0, y: design matrix (not including the intercept), response vector;

• const: tuning constant, default is 0.5.

• kmax, maxhalf: maximum number of iterations and max number of step-halving.

Linear predictor Ordered deviance residuals

0.0 0.2 0.4 0.6 0.8 1.0 0 50 100 150

> food.hub <- logit.BI(food.glm$coef, X.food, food$y, 1.2,)

> food.hub.wml <- logit.BI(food.glm.wml$coef, X.food, food$y, 1.2)

> food.BY.wml <- BYlogreg(X.food[,-1], food$y, as.matrix(X.food[,-(1:3)]))

> food.WBY <- WBYlogreg(X.food[,-1], food$y, as.matrix(X.food[,-(1:3)]) )

> X2.arc <- function(y, mu) 4 * sum((asin(sqrt(y)) - asin(sqrt(mu)))^2)

> food.can <- glm.rob(X.food[,-1], food$y, chuber = 1.2,

Example: Vaso-constriction Data

0.5 1.0 1.5 2.0 2.5 3.0 3.5

Figure 29: Vaso-constriction data: covariate scatterplot

lRate 31.94 17.76 1.798 0.0721

Linear predictor Ordered deviance residuals

Albert, A., Anderson, J. A. (1984), On the existence of maximum likelihood estimates

Belsley, D. A., Kuh, E., Welsch, R. E. (1980), Regression Diagnostics, Wiley.

Cantoni, E., Ronchetti, E. (2001). Efficient bounded-influence regression estimation.

Huber, P. J. (1981), Robust Statistics, Wiley.

Krasker, W. S., Welsch, R. E. (1982), Efficient bounded-influence regression estimation.

Künsch, H.R., Stefanski, L.A., Carroll, R. J. (1989). Conditionally unbiased bounded-

McNeil, D. R. (1977), Interactive Data Analysis, Wiley.

View publication stats

You might also like