0% found this document useful (0 votes)

89 views34 pages

factorModelTutorial Handout PDF

Uploaded by

qwertyuiop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

89 views34 pages

factorModelTutorial Handout PDF

Uploaded by

qwertyuiop

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Financial Risk Models in R:

Factor Models for Asset Returns

and Interest Rate Models
Scottish Financial Risk Academy,
March 15, 2011
Eric Zivot
Robert Richards Chaired Professor of Economics
Adjunct Professor, Departments of Applied Mathematics,
Finance and Statistics
University of Washington
BlackRock Alternative Advisors, Seattle WA

Workshop Overview
• About Me
• Brief Introduction to R in Finance
• Factor Models for Asset Returns
• Estimation of Factor Models in R
• Factor Model Risk Analysis
• Factor Model Risk Analysis in R
• Modeling Interest Rates in R (brief discussion)

© Eric Zivot 2011

1
About Me
• Robert Richards Chaired Professor of
Economics at the University of Washington
– Adjunct Professor of Applied Mathematics,
Finance, and Statistics
• Co-Director of MS Program in Computational
Finance and Risk Management at UW
• BS in Economics and Statistics from UC
Berkeley
• PhD in Economics from Yale University
© Eric Zivot 2011

About Me: R and Finance

• 12 years programming in S language
• 8 years
ears Research scientist and consultant
cons ltant for
Mathsoft/Insightful (makers of SPLUS)
• Co-developed S+FinMetrics for Insightful
• Co-authored Modeling Financial Time Series
with SPLUS,
SPLUS Springer
• 2 ½ years developing FoHF factor model
based risk management system in R for
BlackRock Alternative Advisors
© Eric Zivot 2011

2
Brief Introduction to R in Finance
• R is a language and environment for statistical computing and
graphics
• R is based on the S language originally developed by John
Chambers and colleagues at AT&T Bell Labs in the late 1970s and
early 1980s
• R (sometimes called\GNU S" ) is free open source software
licensed under the GNU general public license (GPL 2)
• R development
d l t was initiated
i iti t d by
b Robert
R b t Gentleman
G tl andd Ross
R Ih k
Ihaka
at the University of Auckland, New Zealand
• R is formally known as The R Project for Statistical Computing
• www.r-project.org

© Eric Zivot 2011

What is R great at?

• Data analysis

• Data Manipulation

• Data Visualization

• Statistical Modeling and

Programming

© Eric Zivot 2011

3
S Language Implementations
• R is the most recent and
full-featured
full featured implementation
of the S language
• Original S - AT & T Bell
Labs
• S-PLUS (S plus a GUI)
• Statistical Sciences, Inc.y
– M
Mathsoft,
h f Inc.,
I I i hf l
Insightful,
Inc., Tibco, Inc.
• R - The R Project for
Statistical Computing

© Eric Zivot 2011

R Timeline

© Eric Zivot 2011

4
Recognition for Software Excellence

© Eric Zivot 2011

The R Foundation
• The R Foundation is the non-profit organization
l t d in
located i Vienna,
Vi A t i which
Austria hi h is
i responsible
ibl
for developing and maintaining R
– Hold and administer the copyright of R software and
documentation
– Support continued development of R
– Organize meetings and conferences related to
statistical computing

© Eric Zivot 2011

5
R Homepage
• http://www.r-project.org
• List
Li off CRAN mirror
i
sites
• Manuals
• FAQs
• Mailing Lists
• Links

© Eric Zivot 2011

CRAN – Comprehensive R
Archive Network
• http://cran.fhcrc.org
• CRAN Mirrors
Mi
– About 75 sites
worldwide
– About 16 sites in US
• R Binaries
• R Packages
P k
• R Sources
• Task Views

© Eric Zivot 2011

6
CRAN Task Views
• Organizes 2600+ R
packages by application
• Relevant tasks for
financial applications:
– Finance
– Time Series
– Econometrics
– Optimization
– Machine Learning

© Eric Zivot 2011

R-Sig-Finance
• https://stat.ethz.ch/mail
man/listinfo r sig
man/listinfo.r-sig-
finance
• Nerve center of the R
finance community
• Daily must read
• Exclusively
E l i l for f
Finance-specific
questions, not general R
questions
© Eric Zivot 2011

7
Other Useful R Sites
• R Seek R specific search site:
– http://www.rseek.org/
• R Bloggers Aggregation of about 100 R blogs:
– http://www.r-bloggers.com
• Stack Overflow Excellent developer Q&A forum
– http://stackoverflow.com
• R Graph Gallery Examples of many possible R graphs
– http://addictedtor.free.fr/graphiques
• Blog from David Smith of Revolution
– http://blog.revolutionanalytics.com
• Inside-R R community site by Revolution Analytics
– http://www.inside-r.org
© Eric Zivot 2011

Estimation of Factor Models in R

• Data for examples
• Estimation of macroeconomic factor model
Sharpe’s single index model
• Estimation of fundamental factor model
– BARRA-type industry model
• Estimation
E ti ti off statistical
t ti ti l factor
f t model
d l
– Principal components

© Eric Zivot 2011

8
Set Options and Load Packages
# set output options
> options(width = 70, digits=4)
digits 4)

# load required packages

> library(ellipse) # functions plotting
# correlation matrices
> library(fEcofin) # various economic and
# financial data sets
> library(PerformanceAnalytics) # performance and risk
# analysis functions
> library(zoo) # time series objects
# and utility functions

© Eric Zivot 2011

Berndt Data
# load Berndt investment data from fEcofin package
> data(berndtInvest)
> class(berndtInvest)
[1] "data.frame"

> colnames(berndtInvest)
[1] "X.Y..m..d" "CITCRP" "CONED" "CONTIL"
[5] "DATGEN" "DEC" "DELTA" "GENMIL"
[9] "GERBER" "IBM" "MARKET" "MOBIL"
[13]
[ ] "PANAM" "PSNH" "TANDY" "TEXACO"
[17] "WEYER" "RKFREE"
# create data frame with dates as rownames
> berndt.df = berndtInvest[, -1]
> rownames(berndt.df) = as.character(berndtInvest[, 1])

© Eric Zivot 2011

9
Berndt Data
> head(berndt.df, n=3)
CITCRP CONED CONTIL DATGEN DEC
1978-01-01 -0.115 -0.079 -0.129 -0.084 -0.100
1978-02-01 -0.019 -0.003 0.037 -0.097 -0.063
1978-03-01 0.059 0.022 0.003 0.063 0.010

> tail(berndt.df, n=3)

CITCRP CONED CONTIL DATGEN DEC
1987-10-01 -0.282 -0.017 -0.372 -0.342 -0.281
1987-11-01 -0.136 -0.012 -0.148 -0.075 -0.127
1987-12-01 0.064 -0.006 0.050 0.181 0.134

© Eric Zivot 2011

Sharpe’s Single Index Model

> returns.mat = as.matrix(berndt.df[, c(-10, -17)])
> market.mat = as.matrix(berndt.df[,10, drop=F])
> n.obs = nrow(returns.mat)
( )
> X.mat = cbind(rep(1,n.obs),market.mat)
> colnames(X.mat)[1] = "intercept"
> XX.mat = crossprod(X.mat)

# multivariate least squares

> G.hat = solve(XX.mat)%*%crossprod(X.mat,returns.mat)
> beta.hat = G.hat[2,]
> E
E.hat
hat = returns
returns.mat
mat - X.mat%
X mat%*%G
%G.hat
hat
> diagD.hat = diag(crossprod(E.hat)/(n.obs-2))
# compute R2 values from multivariate regression
> sumSquares = apply(returns.mat, 2,
+ function(x) {sum( (x - mean(x))^2 )})
> R.square = 1 - (n.obs-2)*diagD.hat/sumSquares
© Eric Zivot 2011

10
Estimation Results
> cbind(beta.hat, diagD.hat, R.square)
beta.hat diagD.hat R.square
CITCRP 0.66778 0.004511 0.31777
CONED 0
0.09102
09102 00.002510
002510 00.01532
01532
CONTIL 0.73836 0.020334 0.11216
DATGEN 1.02816 0.011423 0.30363
DEC 0.84305 0.006564 0.33783
DELTA 0.48946 0.008152 0.12163
GENMIL 0.26776 0.003928 0.07919
GERBER 0.62481 0.005924 0.23694
IBM 0.45302 0.002546 0.27523
MOBIL 0.71352 0.004105 0.36882
PANAM 0.73014 0.015008 0.14337
PSNH 0.21263 0.011872 0.01763
TANDY 1.05549 0.011162 0.31986
TEXACO 0.61328 0.004634 0.27661
WEYER 0.81687 0.004154 0.43083
© Eric Zivot 2011

> par(mfrow=c(1,2))
> barplot(beta.hat, horiz=T, main="Beta values", col="blue",
+ cex.names = 0.75, las=1)
> barplot(R.square, horiz=T, main="R-square values", col="blue",
+ cex.names = 0.75, las=1)
> par(mfrow=c(1,1))
Beta values R-square values

WEYER WEYER

TEXACO TEXACO

TANDY TANDY

PSNH PSNH

PANAM PANAM

MOBIL MOBIL

IBM IBM

GERBER GERBER

GENMIL GENMIL

DELTA DELTA

DEC DEC

DATGEN DATGEN

CONTIL CONTIL

CONED CONED

CITCRP CITCRP

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.1 0.2 0.3 0.4
© Eric Zivot 2011

11
Compute Single Index Covariance
# compute single index model covariance/correlation
> cov.si =
as.numeric(var(market.mat))*beta.hat%*%t(beta.hat)
+ diag(diagD.hat)
> cor.si = cov2cor(cov.si)

# plot correlation matrix using plotcorr() from

# package ellipse
> ord
d <-
< order(cor.si[1,])
d ( i[1 ])
> ordered.cor.si <- cor.si[ord, ord]
> plotcorr(ordered.cor.si,
+ col=cm.colors(11)[5*ordered.cor.si + 6])

© Eric Zivot 2011

Single Index Correlation Matrix

GERBER

DATGEN
TEXACO
GENMIL

CITCRP
WEYER
CONTIL
CONED

PANAM

TANDY
DELTA

MOBIL
PSNH

DEC
IBM

CONED
PSNH
GENMIL
CONTIL
DELTA
PANAM
GERBER
IBM
TEXACO
DATGEN
TANDY
DEC
MOBIL
WEYER
CITCRP

© Eric Zivot 2011

12
Sample Correlation Matrix

GERBER

DATGEN
TEXACO

GENMIL

CITCRP
WEYER
CONTIL
CONED

PANAM

TANDY
DELTA
MOBIL
PSNH

DEC
IBM
PSNH
CONED
TEXACO
PANAM
MOBIL
DELTA
IBM
GERBER
GENMIL
DEC
CONTIL
TANDY
DATGEN
WEYER
CITCRP

© Eric Zivot 2011

Minimum Variance Portfolio

# use single index covariance
> w.gmin.si = solve(cov.si)%
solve(cov.si)%*%rep(1,nrow(cov.si))
%rep(1,nrow(cov.si))
> w.gmin.si = w.gmin.si/sum(w.gmin.si)
> colnames(w.gmin.si) = "single.index"

# use sample covariance

> w.gmin.sample =
+ solve(var(returns.mat))%*%rep(1,nrow(cov.si))
> w.gmin.sample = w.gmin.sample/sum(w.gmin.sample)
> colnames(w.gmin.sample)
l ( i l ) = "
"sample"
l "

© Eric Zivot 2011

13
Single Index Weights

0.3

0.2

0.1

0.0

TEXACO
IBM

PANAM
CITCRP

DELTA

TANDY
CONED

DATGEN

DEC

GERBER

PSNH

WEYER
CONTIL

GENMIL

MOBIL
Sample Weights

0.3

0.2

0.1

0.0

TEXACO
IBM

PANAM
CITCRP

DELTA

TANDY
CONED

DATGEN

DEC

GERBER

PSNH

WEYER
CONTIL

GENMIL

MOBIL

© Eric Zivot 2011

Estimate Single Index Model in Loop

> asset.names = colnames(returns.mat)
> asset.names
[1] "CITCRP" "CONED" "CONTIL" "DATGEN" "DEC"
[6] "DELTA" "GENMIL" "GERBER" "IBM" "MOBIL"
[11] "PANAM" "PSNH" "TANDY" "TEXACO" "WEYER"

# initialize list object to hold regression objects

> reg.list = list()
# loop over all assets and estimate regression
> for (i in asset.names)
asset names) {
+ reg.df = berndt.df[, c(i, "MARKET")]
+ si.formula = as.formula(paste(i,"~",
+ "MARKET", sep=" "))
+ reg.list[[i]] = lm(si.formula, data=reg.df)
+ }
© Eric Zivot 2011

14
List Output
> names(reg.list)
[1]
[ ] "CITCRP" "CONED" "CONTIL" "DATGEN" "DEC"
[6] "DELTA" "GENMIL" "GERBER" "IBM" "MOBIL"
[11] "PANAM" "PSNH" "TANDY" "TEXACO" "WEYER"
> class(reg.list$CITCRP)
[1] "lm"
> reg.list$CITCRP

Call:
lm(formula = si.formula, data = reg.df)

Coefficients:
(Intercept) MARKET
0.00252 0.66778

© Eric Zivot 2011

Regression Summary Output

> summary(reg.list$CITCRP)

Call:
lm(formula = si.formula, data = reg.df)

Residuals:
Min 1Q Median 3Q Max
-0.16432 -0.05012 0.00226 0.04351 0.22467

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 0.00252 0.00626 0.40 0.69
MARKET 0.66778 0.09007 7.41 2.0e-11 ***

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 0.0672 on 118 degrees of freedom

Multiple R-squared: 0.318, Adjusted R-squared: 0.312
F-statistic: 55 on 1 and 118 DF, p-value: 2.03e-11
© Eric Zivot 2011

15
Plot Actual and Fitted Values:
Time Series
# use chart.TimeSeries() function from
# PerformanceAnalytics
P f A l ti package
k

> dataToPlot = cbind(fitted(reg.list$CITCRP),

+ berndt.df$CITCRP)
> colnames(dataToPlot) = c("Fitted","Actual")
> chart.TimeSeries(dataToPlot,
+ main="Single Index Model for CITCRP",
+ colorset=c("black","blue"),
( , ),
+ legend.loc="bottomleft")

© Eric Zivot 2011

Single Index Model for CITCRP

0.3
0.2
0.1
Value

0.0
-0.1
-0.2

Fitted
Actual
-0.3

Jan 78 Jan 79 Jan 80 Jan 81 Jan 82 Jan 83 Jan 84 Jan 85 Jan 86 Jan 87

Date

© Eric Zivot 2011

16
Plot Actual and Fitted Values:
Cross Section

> plot(berndt.df$MARKET, berndt.df$CITCRP,

main="SI model for CITCRP",
+ type="p", pch=16, col="blue",
+ xlab="MARKET", ylab="CITCRP")
> abline(h=0, v=0)
> abline(reg.list$CITCRP, lwd=2, col="red")

© Eric Zivot 2011

SI model for CITCRP

0.3
0.2
0.1
CITCRP

0.0
-0.1
-0.2
-0.3

-0.2 -0.1 0.0 0.1

MARKET
© Eric Zivot 2011

17
Extract Regression Information 1
## extract beta values, residual sd's and R2's from list
## of regression objects by brute force loop
> reg.vals
l = matrix(0,
t i (0 llength(asset.names),
th( t ) 3)
> rownames(reg.vals) = asset.names
> colnames(reg.vals) = c("beta", "residual.sd",
+ "r.square")
> for (i in names(reg.list)) {
+ tmp.fit = reg.list[[i]]
+ tmp.summary = summary(tmp.fit)
+ reg.vals[i,
g [ , "beta"]
] = coef(tmp.fit)[2]
( p )[ ]
+ reg.vals[i, "residual.sd"] = tmp.summary$sigma
+ reg.vals[i, "r.square"] = tmp.summary$r.squared
+}

© Eric Zivot 2011

Regression Results
> reg.vals
beta residual.sd r.square
CITCRP 0.66778 0.06716 0.31777
CONED 00.09102
09102 0
0.05010
05010 00.01532
01532
CONTIL 0.73836 0.14260 0.11216
DATGEN 1.02816 0.10688 0.30363
DEC 0.84305 0.08102 0.33783
DELTA 0.48946 0.09029 0.12163
GENMIL 0.26776 0.06268 0.07919
GERBER 0.62481 0.07697 0.23694
IBM 0.45302 0.05046 0.27523
MOBIL 0.71352 0.06407 0.36882
PANAM 0.73014 0.12251 0.14337
PSNH 0.21263 0.10896 0.01763
TANDY 1.05549 0.10565 0.31986
TEXACO 0.61328 0.06808 0.27661
WEYER 0.81687 0.06445 0.43083
© Eric Zivot 2011

18
Extract Regression Information 2
# alternatively use R apply function for list
# objects
j - lapply
pp y or sapply
pp y
extractRegVals = function(x) {
# x is an lm object
beta.val = coef(x)[2]
residual.sd.val = summary(x)$sigma
r2.val = summary(x)$r.squared
ret.vals = c(beta.val, residual.sd.val, r2.val)
names(ret.vals) = c("beta", "residual.sd",
"
"r.square")
")
return(ret.vals)
}
> reg.vals = sapply(reg.list, FUN=extractRegVals)

© Eric Zivot 2011

Regression Results
> t(reg.vals)
beta residual.sd r.square
CITCRP 0.66778 0.06716 0.31777
CONED 00.09102
09102 0
0.05010
05010 00.01532
01532
CONTIL 0.73836 0.14260 0.11216
DATGEN 1.02816 0.10688 0.30363
DEC 0.84305 0.08102 0.33783
DELTA 0.48946 0.09029 0.12163
GENMIL 0.26776 0.06268 0.07919
GERBER 0.62481 0.07697 0.23694
IBM 0.45302 0.05046 0.27523
MOBIL 0.71352 0.06407 0.36882
PANAM 0.73014 0.12251 0.14337
PSNH 0.21263 0.10896 0.01763
TANDY 1.05549 0.10565 0.31986
TEXACO 0.61328 0.06808 0.27661
WEYER 0.81687 0.06445 0.43083
© Eric Zivot 2011

19
Industry Factor Model
# create loading matrix B for industry factor model
> n.stocks = ncol(returns.mat)
> tech.dum = oil.dum = other.dum =
+ matrix(0,n.stocks,1)
> rownames(tech.dum) = rownames(oil.dum) =
+ rownames(other.dum) = asset.names
> tech.dum[c(4,5,9,13),] = 1
> oil.dum[c(3,6,10,11,14),] = 1
> other.dum = 1 - tech.dum - oil.dum
> B.mat = cbind(tech.dum,oil.dum,other.dum)
> colnames(B.mat) = c("TECH","OIL","OTHER")

© Eric Zivot 2011

Factor Sensitivity Matrix

> B.mat
TECH OIL OTHER
CITCRP 0 0 1
CONED 0 0 1
CONTIL 0 1 0
DATGEN 1 0 0
DEC 1 0 0
DELTA 0 1 0
GENMIL 0 0 1
GERBER 0 0 1
IBM 1 0 0
MOBIL 0 1 0
PANAM 0 1 0
PSNH 0 0 1
TANDY 1 0 0
TEXACO 0 1 0
WEYER 0 0 1
© Eric Zivot 2011

20
Multivariate Least Squares
Estimation of Factor Returns
# returns.mat is T x N matrix, and fundamental factor
# model treats R as N x T.
> returns.mat = t(returns.mat)
# multivariate OLS regression to estimate K x T matrix
# of factor returns (K=3)
> F.hat =
+ solve(crossprod(B.mat))%*%t(B.mat)%*%returns.mat

# rows of F.hat are time series of estimated industry

# factors
> F.hat
1978-01-01 1978-02-01 1978-03-01 1978-04-01
TECH -0.0720 -0.0517500 0.0335 0.13225
OIL -0.0464 -0.0192000 0.0642 0.09920
OTHER -0.0775 -0.0006667 0.0220 0.05133
© Eric Zivot 2011

Plot Industry Factors

# plot industry factors in separate panels - convert
# to zoo time series object for plotting with dates
> F.hat.zoo = zoo(t(F.hat), as.Date(colnames(F.hat)))
> head(F.hat.zoo,
h d(F h t n=3)
3)
TECH OIL OTHER
1978-01-01 -0.07200 -0.0464 -0.0775000
1978-02-01 -0.05175 -0.0192 -0.0006667
1978-03-01 0.03350 0.0642 0.0220000

# panel function to put horizontal lines at zero in each

panel
p
> my.panel <- function(...) {
+ lines(...)
+ abline(h=0)
+}
> plot(F.hat.zoo, main="OLS estimates of industry
+ factors“, panel=my.panel, lwd=2, col="blue")
© Eric Zivot 2011

21
OLS estimates of industry factors

0.3
0.2
0.1
TECH

-0.2 -0.1 0.0

0.2
0.1
0.0
OIL

-0.1
-0.2
05
0.0
OTHER

-0.05
-0.15

1978 1980 1982 1984 1986 1988

Index

© Eric Zivot 2011

GLS Estimation of Factor Returns

# compute N x T matrix of industry factor model residuals
> E.hat = returns.mat - B.mat%*%F.hat
# compute residual variances from time series of errors
> diagD.hat = apply(E.hat, 1, var)
> Dinv.hat = diag(diagD.hat^(-1))

# multivariate FGLS regression to estimate K x T matrix

# of factor returns
> H.hat = solve(t(B.mat)%*%Dinv.hat%*%B.mat)
+ %*%t(B.mat)%*%Dinv.hat
> colnames(H.hat) = asset.names
# note: rows of H sum to one so are weights in factor
# mimicking portfolios
> F.hat.gls = H.hat%*%returns.mat

© Eric Zivot 2011

22
GLS Factor Weights
> t(H.hat)
TECH OIL OTHER
CITCRP 0.0000 0.0000 0.19918
CONED 0.0000 0.0000 0.22024
CONTIL 0.0000 0.0961 0.00000
DATGEN 0.2197 0.0000 0.00000
DEC 0.3188 0.0000 0.00000
DELTA 0.0000 0.2233 0.00000
GENMIL 0.0000 0.0000 0.22967
GERBER 0.0000 0.0000 0.12697
IBM 0
0.2810
2810 0
0.0000
0000 0
0.00000
00000
MOBIL 0.0000 0.2865 0.00000
PANAM 0.0000 0.1186 0.00000
PSNH 0.0000 0.0000 0.06683
TANDY 0.1806 0.0000 0.00000
TEXACO 0.0000 0.2756 0.00000
WEYER 0.0000 0.0000 0.15711
© Eric Zivot 2011

OLS and GLS estimates of TECH factor

0.2
Return

0.0

OLS
-0.2

GLS

1978 1980 1982 1984 1986 1988

Index

OLS and GLS estimates of OIL factor

0.2
Return

0.0

OLS
-0.2

GLS

1978 1980 1982 1984 1986 1988

Index

OLS and GLS estimates of OTHER factor

0.00 0.10
Return

OLS
-0.15

GLS

1978 1980 1982 1984 1986 1988

Index

© Eric Zivot 2011

23
Industry Factor Model Covariance
# compute covariance and correlation matrices
> cov.ind = B.mat%
B.mat%*%var(t(F.hat.gls))%*%t(B.mat)
%var(t(F.hat.gls))% %t(B.mat) +
+ diag(diagD.hat)
> cor.ind = cov2cor(cov.ind)
# plot correlations using plotcorr() from ellipse
# package
> rownames(cor.ind) = colnames(cor.ind)
> ord <- order(cor.ind[1,])
> ordered.cor.ind <- cor.ind[ord, ord]
> plotcorr(ordered.cor.ind,
(
+ col=cm.colors(11)[5*ordered.cor.ind + 6])

© Eric Zivot 2011

Industry Factor Model Correlations

GERBER
DATGEN
TEXACO

GENMIL
CITCRP
WEYER
CONTIL

CONED
PANAM

TANDY
DELTA

MOBIL

PSNH

DEC
IBM

CONTIL
PANAM
DELTA
TEXACO
MOBIL
TANDY
DATGEN
PSNH
IBM
DEC
GERBER
WEYER
CONED
GENMIL
CITCRP

© Eric Zivot 2011

24
Industry Factor Model Summary
> ind.fm.vals
TECH OIL OTHER fm.sd residual.sd r.square
CITCRP 0 0 1 0.07291 0.05468 0.4375
CONED 0 0 1 0.07092 0.05200 0.4624
CONTIL 0 1 0 0.13258 0.11807 0.2069
DATGEN 1 0 0 0.10646 0.07189 0.5439
DEC 1 0 0 0.09862 0.05968 0.6338
DELTA 0 1 0 0.09817 0.07747 0.3773
GENMIL 0 0 1 0.07013 0.05092 0.4728
GERBER 0 0 1 0.08376 0.06849 0.3315
IBM 1 0 0 0
0.10102
10102 0
0.06356
06356 0
0.6041
6041
MOBIL 0 1 0 0.09118 0.06839 0.4374
PANAM 0 1 0 0.12222 0.10630 0.2435
PSNH 0 0 1 0.10601 0.09440 0.2069
TANDY 1 0 0 0.11159 0.07930 0.4950
TEXACO 0 1 0 0.09218 0.06972 0.4279
WEYER 0 0 1 0.07821 0.06157 0.3802
© Eric Zivot 2011

Global Minimum Variance Portfolios

Industry FM Weights

0.15

0.10

0.05

0.00
IBM

PANAM

TEXACO
CONED

DATGEN

DEC

GERBER

PSNH

EYER
CITCRP

DELTA

TANDY
CONTIL

GENMIL

MOBIL

Sample Weights

0.3

0.2

0.1

0.0
IBM

PANAM

TEXACO
CITCRP

DELTA

TANDY
CONED

DATGEN

DEC

GERBER

PSNH

EYER
CONTIL

GENMIL

MOBIL

© Eric Zivot 2011

25
Statistical Factor Model: Principal
Components Method
# continue
ti t
to use B
Berndt
dt d
data
t
> returns.mat = as.matrix(berndt.df[, c(-10, -17)])
# use R princomp() function for principal component
# analysis
> pc.fit = princomp(returns.mat)

> class(pc.fit)
[1] "princomp"
> names(pc.fit)
[1] "sdev" "loadings" "center" "scale" "n.obs"
[6] "scores" "call"

eigenvectors
principal components
© Eric Zivot 2011

Total Variance Contributions

> summary(pc.fit)
Importance of components:
Comp.1 Comp.2 Comp.3 Comp.4 Comp.5
Standard deviation 0.2282 0.1408 0.1264 0.10444 0.09741
Proportion of Variance 0.3543 0.1349 0.1087 0.07423 0.06458
Cumulative Proportion 0.3543 0.4892 0.5979 0.67218 0.73676
Comp.6 Comp.7 Comp.8 Comp.9
Standard deviation 0.09043 0.08123 0.07731 0.06791
Proportion of Variance 0.05565 0.04491 0.04068 0.03138
Cumulative Proportion 0.79241 0.83732 0.87800 0.90938
Comp.10 Comp.11 Comp.12 Comp.13
Standard deviation 0
0.05634
05634 0
0.05353
05353 0
0.04703
04703 0
0.04529
04529
Proportion of Variance 0.02160 0.01950 0.01505 0.01396
Cumulative Proportion 0.93098 0.95048 0.96553 0.97950
Comp.14 Comp.15
Standard deviation 0.04033 0.037227
Proportion of Variance 0.01107 0.009432
Cumulative Proportion 0.99057 1.000000
© Eric Zivot 2011

26
Eigenvalue Scree Plot
pc.fit

0.05
0.04
0.03
Variances

0.02
0.01
0.00

Comp.1 Comp.3 Comp.5 Comp.7 Comp.9

Loadings (eigenvectors)
> loadings(pc.fit) # pc.fit$loadings

Loadings:
Comp.1 Comp.2 Comp.3 Comp.4 Comp.5 Comp.6 Comp.7
CITCRP 0.273
CONED
CONTIL 0.377 -0.824 -0.199 0.157 0.144 -0.191
DATGEN 0.417 0.152 0.277 -0.329 0.287 -0.497
DEC 0.305 0.129 0.202 -0.141 0.368
DELTA 0.250 0.179 0.258 0.242 0.481
GENMIL 0.133 0.128 0.249 0.117
GERBER 0.167 -0.199 -0.418 0.349
IBM 0
0.146
146 0
0.142
142
MOBIL 0.155 0.248 -0.241 -0.459 -0.155
PANAM 0.311 0.365 -0.630 0.227 -0.343 -0.390 -0.197
PSNH -0.527 -0.692 0.249 0.360
TANDY 0.412 0.207 0.188 0.323 0.356 0.385 -0.564
TEXACO 0.132 0.245 -0.219 -0.430 -0.325
WEYER 0.265 0.131 -0.128 -0.111 0.152 0.291
© Eric Zivot 2011

27
Principal Component Factors
> head(pc.fit$scores[, 1:4])
Comp 1
Comp.1 Comp
Comp.22 Comp
Comp.3
3 Comp
Comp.4
4
1978-01-01 -0.28998 0.069162 -0.07621 0.0217151
1978-02-01 -0.14236 -0.141967 -0.01794 0.0676476
1978-03-01 0.14927 0.113295 -0.09307 0.0326150
1978-04-01 0.35056 -0.032904 0.01128 -0.0168986
1978-05-01 0.10874 0.004943 -0.04640 0.0612666
1978-06-01 -0.06948 0.041330 -0.06757 -0.0009816

Note: Scores are based on centered (demeaned) returns

Comp.1
0.5
0.0
Value

-0.5
-1.0

Jan 78 Jan 79 Jan 80 Jan 81 Jan 82 Jan 83 Jan 84 Jan 85 Jan 86 Jan 87

Date

> chart.TimeSeries(pc.fit$scores[, 1, drop=FALSE],

28
Direct Eigenvalue Computation
> eigen.fit = eigen(var(returns.mat))
> names(eigen.fit)
[1] "
"values"
l " ""vectors"
t "
> names(eigen.fit$values) =
+ rownames(eigen.fit$vectors) = asset.names

# compare princomp output with direct eigenvalue output

> cbind(pc.fit$loadings[,1:2], eigen.fit$vectors[, 1:2])
Comp.1 Comp.2
CITCRP 0.27271 -0.085495 -0.27271 -0.085495
CONED 0.04441 0.001193 -0.04441 0.001193
CONTIL 0.37694 -0.823575 -0.37694 -0.823575
DATGEN 0.41719 0.151818 -0.41719 0.151818
DEC 0.30493 0.129067 -0.30493 0.129067
…
Notice sign change!
© Eric Zivot 2011

Compare Centered and Uncentered

Principal Component Factors
# compute uncentered pc factors from eigenvectors
# and return data
> pc.factors.uc = returns.mat %*% eigen.fit$vectors
$
> colnames(pc.factors.uc) =
+ paste(colnames(pc.fit$scores),".uc",sep="")
# compare centered and uncentered scores. Note sign
# change on first factor
> cbind(pc.fit$scores[,1,drop=F],
+ -pc.factors.uc[,1,drop=F])
Comp.1 Comp.1.uc
1978-01-01 -0.289978 -0.250237
1978-02-01 -0.142355 -0.102614
1978-03-01 0.149273 0.189015
1978-04-01 0.350563 0.390304
1978-05-01 0.108743 0.148484
© Eric Zivot 2011

29
Centered and Uncentered Principle Component Factors

0.5
Value

0.0
-0.5

Comp.1
Comp.1.uc
-1.0

Jan 78 Jan 79 Jan 80 Jan 81 Jan 82 Jan 83 Jan 84 Jan 85 Jan 86 Jan 87

Interpreting Principal Component Factor

# Compute correlation with market return

> cor(cbind(pc.factors.uc[,1,drop=F],
( (p [, , p ],
+ berndt.df[, "MARKET",drop=F]))
Comp.1.uc MARKET
Comp.1.uc 1.0000 -0.7657
MARKET -0.7657 1.0000

# Correlation with sign change

> cor(cbind(-pc.factors.uc[,1,drop=F],
+ b
berndt.df[,
dt df[ "MARKET"
"MARKET",drop=F]))
d F]))
Comp.1.uc MARKET
Comp.1.uc 1.0000 0.7657
MARKET 0.7657 1.0000

30
Comp.1.uc

ρ = 0.77
0.5
Value

0.0
-0.5

Comp.1.uc
MARKET

Jan 78 Jan 79 Jan 80 Jan 81 Jan 82 Jan 83 Jan 84 Jan 85 Jan 86 Jan 87

Factor Mimicking Portfolio

> p1 = pc.fit$loadings[, 1]
> p1
CITCRP CONED CONTIL DATGEN DEC DELTA GENMIL
0.27271 0.04441 0.37694 0.41719 0.30493 0.25017 0.13256
GERBER IBM MOBIL PANAM PSNH TANDY TEXACO
0.16716 0.14644 0.15517 0.31067 0.08407 0.41193 0.13225
WEYER
0.26488
> sum(p1)
[1] 3.471

# create factor mimicking

g p
portfolio by
y
normalizing
# weights to unity
> p1 = p1/sum(p1)
# normalized principle component factor
> f1 = returns.mat %*% p1

31
Factor mimicking weights

0.12

0.10

0.08

0.06

0.04

0.02

0.00

TEXACO
IBM

PANAM
CITCRP

DELTA

TANDY
CONED

DATGEN

DEC

GERBER

PSNH

WEYER
CONTIL

GENMIL

MOBIL

Estimate Factor Betas

# estimate factor betas by multivariate regression
> X.mat = cbind(rep(1,n.obs), f1)
> colnames(X.mat) = c("intercept", "Factor 1")
> XX.mat = crossprod(X.mat)
# multivariate least squares
> G.hat = solve(XX.mat)%*%crossprod(X.mat,returns.mat)
> beta.hat = G.hat[2,]
> E.hat = returns.mat - X.mat%*%G.hat
> diagD.hat = diag(crossprod(E.hat)/(n.obs-2))
# compute R2 values from multivariate regression
> sumSquares = apply(returns.mat, 2, function(x)
+ {sum( (x - mean(x))^2 )})
> R.square = 1 - (n.obs-2)*diagD.hat/sumSquares

32
Regression Results
> cbind(beta.hat, diagD.hat, R.square)
beta.hat diagD.hat R.square
CITCRP 0.9467 0.002674 0.59554
CONED 0
0.1542
1542 00.002444
002444 00.04097
04097
CONTIL 1.3085 0.015380 0.32847
DATGEN 1.4483 0.007189 0.56176
DEC 1.0586 0.004990 0.49664
DELTA 0.8685 0.005967 0.35704
GENMIL 0.4602 0.003336 0.21808
GERBER 0.5803 0.006284 0.19058
IBM 0.5084 0.002378 0.32318
MOBIL 0.5387 0.005229 0.19600
PANAM 1.0785 0.012410 0.29168
PSNH 0.2918 0.011711 0.03096
TANDY 1.4300 0.007427 0.54746
TEXACO 0.4591 0.005480 0.14455
WEYER 0.9195 0.003583 0.50904
© Eric Zivot 2011

Regression Results
Beta values R-square values

WEYER WEYER

TEXACO TEXACO

TANDY TANDY

PSNH PSNH

PANAM PANAM

MOBIL MOBIL

IBM IBM

GERBER GERBER

GENMIL GENMIL

DELTA DELTA

DEC DEC

DATGEN DATGEN

CONTIL CONTIL

CONED CONED

CITCRP CITCRP

0.0 0.4 0.8 1.2 0.0 0.2 0.4

33
Principal Components Correlations

GERBER

DATGEN
TEXACO

GENMIL

CITCRP
WEYER
CONTIL
CONED

PANAM

TANDY
DELTA
MOBIL
PSNH

DEC
IBM
PSNH
CONED
TEXACO
GERBER
MOBIL
GENMIL
PANAM
IBM
CONTIL
DELTA
DEC
WEYER
TANDY
DATGEN
CITCRP

Global Minimum Variance Portfolios

Principal Component Weights

0.3

0.2

0.1

0.0
IBM

PANAM

TEXACO
CITCRP

DELTA

TANDY
CONED

DATGEN

DEC

GERBER

PSNH

EYER
CONTIL

MOBIL
GENMIL

Sample Weights

0.3

0.2

0.1

0.0
TEXACO
IBM

PANAM
CONED

DATGEN

DEC

GERBER

PSNH

EYER
TANDY
CITCRP

DELTA
CONTIL

GENMIL

MOBIL

WQU FundamentalsofStochasticFinance m1
No ratings yet
WQU FundamentalsofStochasticFinance m1
45 pages
Design and Analysis of Truss Using Staad Pro
67% (3)
Design and Analysis of Truss Using Staad Pro
18 pages
Oracle CRM Service Contracts Queries
No ratings yet
Oracle CRM Service Contracts Queries
55 pages
Online Book Store Project Report
100% (2)
Online Book Store Project Report
51 pages
WQU ECONOMETRICS M1 Compiled Content PDF
No ratings yet
WQU ECONOMETRICS M1 Compiled Content PDF
67 pages
Learn R Programming in 24 Hours
From Everand
Learn R Programming in 24 Hours
Alex Nordeen
No ratings yet
Essential R
No ratings yet
Essential R
183 pages
1IntrotoFinProgR [HO]
No ratings yet
1IntrotoFinProgR [HO]
28 pages
EssentialR PDF
No ratings yet
EssentialR PDF
181 pages
Financial Market Data For R
No ratings yet
Financial Market Data For R
194 pages
Tidy Portfoliomanagement in R
100% (1)
Tidy Portfoliomanagement in R
94 pages
R Socialscience
No ratings yet
R Socialscience
62 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Data Analysis and Graphics Using R 1st Edition Matthew Norman - The ebook is ready for download with just one simple click
No ratings yet
Data Analysis and Graphics Using R 1st Edition Matthew Norman - The ebook is ready for download with just one simple click
80 pages
Chronos
No ratings yet
Chronos
304 pages
Econometrics 2019 PDF
No ratings yet
Econometrics 2019 PDF
143 pages
Advanced Topics in Analysis of Economic and Financial Data Using R
No ratings yet
Advanced Topics in Analysis of Economic and Financial Data Using R
148 pages
Data Analysis and Graphics Using R 1st Edition Matthew Norman download
No ratings yet
Data Analysis and Graphics Using R 1st Edition Matthew Norman download
53 pages
Introduction To R
No ratings yet
Introduction To R
15 pages
Time Series Analysis With R - Part I
No ratings yet
Time Series Analysis With R - Part I
23 pages
R Lab File Deepak
No ratings yet
R Lab File Deepak
27 pages
Download ebooks file Tidy Finance with R First Edition Christoph Scheuch all chapters
100% (1)
Download ebooks file Tidy Finance with R First Edition Christoph Scheuch all chapters
65 pages
Introduccion A R en Mexico
No ratings yet
Introduccion A R en Mexico
29 pages
Portfolio Optimization With R Rmetrics Diethelm Wrtz Yohan Chalabi pdf download
100% (1)
Portfolio Optimization With R Rmetrics Diethelm Wrtz Yohan Chalabi pdf download
87 pages
Theory 1. R Basics
No ratings yet
Theory 1. R Basics
43 pages
R in Finance PDF
No ratings yet
R in Finance PDF
38 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Advanced Statistical Methods using R Notes
No ratings yet
Advanced Statistical Methods using R Notes
55 pages
Introduction To R
No ratings yet
Introduction To R
36 pages
Howtouser: 1 What Is R
No ratings yet
Howtouser: 1 What Is R
6 pages
An R Companion To Statistical Thinking For The 21st Century
No ratings yet
An R Companion To Statistical Thinking For The 21st Century
159 pages
Data Analysis and Graphics Using R An Example Based Approach Third Edition John Maindonald - The latest updated ebook is now available for download
100% (2)
Data Analysis and Graphics Using R An Example Based Approach Third Edition John Maindonald - The latest updated ebook is now available for download
47 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
L1 Financial Data and Their Properties
No ratings yet
L1 Financial Data and Their Properties
53 pages
v33b01 PDF
No ratings yet
v33b01 PDF
3 pages
41458395861
No ratings yet
41458395861
2 pages
WQU Econometrics Module 1
No ratings yet
WQU Econometrics Module 1
73 pages
Ifw Deep Dive R-quick Guide
No ratings yet
Ifw Deep Dive R-quick Guide
12 pages
Statistics-with-R
No ratings yet
Statistics-with-R
10 pages
Consolidated Presentation v2
No ratings yet
Consolidated Presentation v2
24 pages
Download Tidy Finance with R First Edition Christoph Scheuch ebook All Chapters PDF
100% (1)
Download Tidy Finance with R First Edition Christoph Scheuch ebook All Chapters PDF
24 pages
intro of bi mba
No ratings yet
intro of bi mba
17 pages
03-Data Gathering and Preparation
No ratings yet
03-Data Gathering and Preparation
71 pages
R Introduction
No ratings yet
R Introduction
4 pages
Download full Tidy Finance with R First Edition Christoph Scheuch ebook all chapters
100% (5)
Download full Tidy Finance with R First Edition Christoph Scheuch ebook all chapters
40 pages
Foundations and Applications of Statistics An Introduction Using R by Randall Pruim (z-lib.org)
No ratings yet
Foundations and Applications of Statistics An Introduction Using R by Randall Pruim (z-lib.org)
842 pages
Introduction to R
No ratings yet
Introduction to R
23 pages
Harnessing the Power of R in Business
No ratings yet
Harnessing the Power of R in Business
26 pages
Time Series Analysis Cheat Sheet
No ratings yet
Time Series Analysis Cheat Sheet
2 pages
Time Series Analysis With MATLAB and Econometrics Toolbox
No ratings yet
Time Series Analysis With MATLAB and Econometrics Toolbox
2 pages
Time Series Analysis Cheat Sheet
No ratings yet
Time Series Analysis Cheat Sheet
2 pages
Introduction to Spatial Econometrics Statistics A Series of Textbooks and Monographs 1st Edition James Lesage - Download the ebook and explore the most detailed content
100% (2)
Introduction to Spatial Econometrics Statistics A Series of Textbooks and Monographs 1st Edition James Lesage - Download the ebook and explore the most detailed content
57 pages
A Crash R Course On Statistical Graphics
No ratings yet
A Crash R Course On Statistical Graphics
169 pages
Getting_Started_with_R_Detailed_Notes
No ratings yet
Getting_Started_with_R_Detailed_Notes
3 pages
Beginner's Guide to R Programming
From Everand
Beginner's Guide to R Programming
Agasti Khatri
No ratings yet
Programming And Coding in Intermidiate Level
From Everand
Programming And Coding in Intermidiate Level
Memo
No ratings yet
R Programming - a Comprehensive Guide: Software
From Everand
R Programming - a Comprehensive Guide: Software
Editor IJSMI
No ratings yet
Architectural, Engineering & Related Service Revenues World Summary: Market Values & Financials by Country
From Everand
Architectural, Engineering & Related Service Revenues World Summary: Market Values & Financials by Country
Editorial DataGroup
No ratings yet
Beginning Linux Programming
From Everand
Beginning Linux Programming
Neil Matthew
No ratings yet
Beginning R: The Statistical Programming Language
From Everand
Beginning R: The Statistical Programming Language
Mark Gardener
4.5/5 (4)
Learning Cypher
From Everand
Learning Cypher
Onofrio Panzarino
No ratings yet
Learning RStudio for R Statistical Computing: Learn to effectively perform R development, statistical analysis, and reporting with the most popular R IDE
From Everand
Learning RStudio for R Statistical Computing: Learn to effectively perform R development, statistical analysis, and reporting with the most popular R IDE
Mark van der Loo
4/5 (8)
Domain-Specific Languages in R: Advanced Statistical Programming
From Everand
Domain-Specific Languages in R: Advanced Statistical Programming
Thomas Mailund
No ratings yet
GRE AWA Practice PDF
No ratings yet
GRE AWA Practice PDF
12 pages
AnnualReport 0708 PDF
No ratings yet
AnnualReport 0708 PDF
454 pages
How Comparable Are India's Labour Market Surveys?: CSE Working Paper
No ratings yet
How Comparable Are India's Labour Market Surveys?: CSE Working Paper
28 pages
Honor Killing in India
No ratings yet
Honor Killing in India
2 pages
Part 4 Text: Unit 1: Introduction: What Is MTDS?
No ratings yet
Part 4 Text: Unit 1: Introduction: What Is MTDS?
30 pages
FPP1x Video Transcript Module 0
No ratings yet
FPP1x Video Transcript Module 0
13 pages
Skills For The Growth Sub-Sectors in The Agricultural and Informal Sectors
No ratings yet
Skills For The Growth Sub-Sectors in The Agricultural and Informal Sectors
13 pages
Sample Personal Statement For Public Policy Study: Quick Degree Finder
No ratings yet
Sample Personal Statement For Public Policy Study: Quick Degree Finder
1 page
E-40 Lyrics: "Drought Season"
No ratings yet
E-40 Lyrics: "Drought Season"
3 pages
No.,Name and Reservation Status of Assembly Constituency
No ratings yet
No.,Name and Reservation Status of Assembly Constituency
43 pages
Term Paper Apps
100% (1)
Term Paper Apps
4 pages
Correlation Coefficient
No ratings yet
Correlation Coefficient
3 pages
SRS
No ratings yet
SRS
6 pages
1800 Professional Education Practice Drills
No ratings yet
1800 Professional Education Practice Drills
254 pages
Latching Relay For Momentary Contact Switches
No ratings yet
Latching Relay For Momentary Contact Switches
2 pages
MASIMULA DEON MOAHLODIcafe
No ratings yet
MASIMULA DEON MOAHLODIcafe
15 pages
case study 1 on cloud standard by IT59 TUSHAR BISANE
No ratings yet
case study 1 on cloud standard by IT59 TUSHAR BISANE
7 pages
Dss
No ratings yet
Dss
3 pages
Trovebox 4.0.0-Rc6 SQL Injection Bypss SSRF
No ratings yet
Trovebox 4.0.0-Rc6 SQL Injection Bypss SSRF
8 pages
Old Company Name in Catalogs and Other Documents
No ratings yet
Old Company Name in Catalogs and Other Documents
8 pages
FOTON Section Two ENG PDF
100% (3)
FOTON Section Two ENG PDF
884 pages
Application Config
No ratings yet
Application Config
17 pages
User Instructions: For Smartlf Scan! Large Format Scanner Rev. G June 2016 (F/W 1.01)
No ratings yet
User Instructions: For Smartlf Scan! Large Format Scanner Rev. G June 2016 (F/W 1.01)
25 pages
TTL & CMOS Series (Complete) PDF
100% (1)
TTL & CMOS Series (Complete) PDF
22 pages
Lab-3 Functional and Non-Functional Requirements
No ratings yet
Lab-3 Functional and Non-Functional Requirements
3 pages
Little Book Scam PDF
75% (4)
Little Book Scam PDF
52 pages
ET01 User Manual
No ratings yet
ET01 User Manual
3 pages
Ricardo Bautista JR
No ratings yet
Ricardo Bautista JR
4 pages
Device Config OKIDL (1)
No ratings yet
Device Config OKIDL (1)
6 pages
2D2024_2687 Appellants Motion to Disqualify Circuit Judge Patricia Muscarella Due to Conflicts of Interest and to Vacate All Orders Issued in Lower Court
No ratings yet
2D2024_2687 Appellants Motion to Disqualify Circuit Judge Patricia Muscarella Due to Conflicts of Interest and to Vacate All Orders Issued in Lower Court
156 pages
Lab Manual - Routing
No ratings yet
Lab Manual - Routing
9 pages
Computational Fluid Dynamics Assignment 2
No ratings yet
Computational Fluid Dynamics Assignment 2
20 pages
Chapter 9 - Enabling The Organization - Decision Making
No ratings yet
Chapter 9 - Enabling The Organization - Decision Making
33 pages
Keywin 2 Start Guide
No ratings yet
Keywin 2 Start Guide
26 pages
Data Base Assignment 2024
No ratings yet
Data Base Assignment 2024
12 pages
TV - Samsung Cl21z50mq-Chassis - KSBH-P-CB1J PDF
No ratings yet
TV - Samsung Cl21z50mq-Chassis - KSBH-P-CB1J PDF
53 pages
JavaScript Lecture Notes BCA-5
No ratings yet
JavaScript Lecture Notes BCA-5
13 pages