Lecture 12
Lecture 12
Lars A. Lochstoer
UCLA Anderson School of Management
Winter 2025
are we?
suppose some less sophisticated investors have portfolios that are biased
towards large, growth …rms (because these are more glamorous)
then, the sophisticated investors have to overweight the small, value …rms in
their portfolio of risky assets
these sophisticated investors do not and cannot hold the market portfolio
E [Ri ,t Rf ,t ] = βM ,i E [RM ,t Rf ,t ]
Then:
E Rie,t = e
βMVE ,i E RMVE ,t
= βMVE ,i E a1 RFe 1,t + a2 RFe 2,t + a3 RFe 3,t
= βMVE ,i a1 E RFe 1,t + βMVE ,i a2 E RFe 2,t + βMVE ,i a3 E RFe 3,t
= β1 E RFe 1,t + β2 E RFe 2,t + β3 E RFe 3,t
Rite = αi + βi ft + εit ,
where εit are uncorrelated across stocks and time, and ft is a traded factor (e.g.,
excess market returns)
Since ε-risk can be diversi…ed away in large portfolios, it cannot will not earn a
risk premium if markets are perfectly competitive
Two sources of variation in returns: βi ft and εit
I Risk premium on βi ft is βi E [ft ]
I Risk premium on εit is zero
I Thus, risk premium on stock should be E [Rite ] = βi E [ft ] and so ... αi = 0!
theh goal e
i is to maximize the variation in the expected excess returns E Rit vs.
E Rjte , i.e. the left hand side variables in the regression.
the CAPM predicts that αi = 0 for all assets and that variation in the cross-section
of expected returns can be explained by variation in the market times an assets βi
one-step procedure: run a time series regression of excess returns on the factor to
estimate the βi ’s
Rite = αi + βi Rmt
e
+ εit
CAPM α’s for the 25 Fama-French portfolios from a time series regression of
returns on the factors. Monthly data. 1960-2015.
CAPM β’s for the 25 Fama-French portfolios from a time series regression of
returns on the factors. Monthly data. 1960-2015.
CAPM β’s for the 25 Fama-French portfolios from a time series regression of
returns on the factors. Monthly data. 1960-2015.
the CAPM fails to capture the cross-section of expected returns built from the FF
25 portfolios sorted on size and book-to-market.
Rtm Rmt Rft , the excess return on the market, is the value-weight return on
all NYSE, AMEX, and NASDAQ stocks (from CRSP) minus the one-month
Treasury bill rate (from Ibbotson Associates).
one-step procedure: run a time series regression of excess returns on the factor to
estimate the βi ’s
De…ne the factor "risk premium" or "risk price" as λ̂ = λm , λsmb , λhml where
j
λ̂ = 1
T ∑T
t =1 Rtj for j = m, smb, hml
0
let’s plot the predicted excess returns β̂i λ̂ against the realized excess returns
R̄ie = T1 ∑T e
t =1 Rit
Fama-French 3-factors α̂’s for the 25 Fama-French portfolios from a time series
regression on the factors. Monthly data. 1960-2015.
Fama-French 3-factors β̂’s for the 25 Fama-French portfolios from a time series
regression on the factors. Monthly data. 1960-2015.
Rite = αi + βi ft + εit , t = 1, . . . , T
With i.i.d errors, the asymptotic test statistic for the pricing errors is:
" # 1
f¯ 2 1
T 1+ 2 α̂0 Σ̂ α̂ χ2N .
σ̂f
With i.i.d Normally distributed errors, the exact small-sample test statistic for the
pricing errors is:
h i 1
0 1
(T N K )/N 1 + f̄ Σ̂f f̄ α̂0 Σ̂ε 1 α̂ FN ,T N K
where Σ̂ε denotes the covariance matrix of εt , Σ̂f denotes the covariance matrix
of the factors ft , f̄ is the average factor, and α̂ are the OLS estimates of α.
The Fama-French 3-factor model is also rejected...! (though not nearly at the
same signi…cance level as the CAPM)
Lars A. Lochstoer UCLA Anderson School of Management () Winter 2025 25 / 54
Factor Models and Mean-Variance E¢ ciency:
The Data Mining Concern
wMVE ∝ Ω̂ 1
R̄ e
The in-sample mean variance e¢ cient portfolio therefore has squared Sharpe ratio:
2
SRMVE = R̄ e 0 Ω̂ 1
R̄ e
Ω̂w k R̄ e = 0
Thus:
wMVE = k Ω̂ 1
R̄ e .
k R̄ e 0 Ω̂ 1 R̄ e 2
2 e
SR (RMVE ) = = R̄ e 0 Ω̂ 1
R̄ e
k 2 R̄ e 0 Ω̂ 1 R̄ e
α̂0 Σ̂ε 1 α̂
T 0 χ2 (N )
1 + R̄Fe Σ̂F 1 R̄Fe
0
Note that R̄Fe Σ̂F 1 R̄Fe is the in-sample maximum Sharpe ratio squared obtained
using the factor portfolios only.
ΣF is the K K variance-covariance matrix of the factors
Note that α̂0 Σ̂ε 1 α̂ is the maximum Sharpe ratio squared of hedged stock returns.
Recall:
h 0 e
i
E [α̂i + ε̂it ] = E Rite ^ βi RFt = α̂i for all i and
var (α̂ + ε̂t ) = Σ̂ε (an N N matrix)
Thus, all alphas are zero if the maximum in-sample Sharpe ratio obtained using
the factors equals the maximum in-sample Sharpe ratio of the test assets!
In other words, if a linear combination of the factors is the in-sample
mean-variance e¢ cient portfolio, the factor model prices all assets perfectly
in the sense that all alphas equal zero
Lars A. Lochstoer UCLA Anderson School of Management () Winter 2025 31 / 54
The failure of the CAPM revisited
The fact that the CAPM does not work means the Sharpe ratio of the market
portfolio is far from the maximum Sharpe ratio obtainable using the test assets
Let Ω̂ denote the sample variance covariance matrix of Rt . Note that it is block
diagonal:
Σ̂F 0
Ω̂ =
0 Σ̂ε
wMVE ∝ Ω̂ 1
Rt
0
Note that Rt = R̄F0 α̂0 , where R̄F is the K 1 vector of sample factor means
and α̂ is the N 1 vector of estimated alpha for the individual assets. Thus:
From our earlier results, the max Sharpe ratio squared is:
2 0 1
SRMVE = Rt Ω̂ Rt
Σ̂F 1 0 R̄F
= R̄F0 α̂0 = R̄F0 Σ̂F 1 R̄F + α̂0 Σ̂ε 1 α̂
0 Σ̂ε 1 α̂
wα ∝ Σ ε 1 α
Adding this factor mechanically ensures, in-sample, that the factors span the
in-sample mean-variance e¢ cient portfolio
wα ∝ Σ ε 1 α
The ’…nal’factor goes long positive alpha assets and short negative alpha
assets
Seems a lot like how we do our characteristics-based factor. E.g., value and
momentum
I Go long value stocks, short growth stocks
I Go long winners, short losers, etc.
The fact that we know how to make our model work in-sample, for a given set of
test assets, makes introducing this factor vacuous in itself
We need:
Thus, given how easy it is to data-mine, we are looking for economics that explain
why we would think the phenomenon persists out-of-sample
It’s posted on BruinLearn under ’Week 9’and may serve as a useful background
reading in addition to the slides
I’ve also posted the Fama-French (1993) paper, as well as the Gibbons, Ross,
Shanken (1987) paper.
E εit (fjt f jt ) = 0 8 i, j
some textbooks/authors (implicitly) assume the factors are uncorrelated with one
another...an orthogonal factor model.
assumptions:
Cov[εit , fjt ] = 0
Then, there exist risk prices λj for each factor such that the expected return on
any security j can be stated as:
E [Rite ] = λ0 + bi 1 λ1 + bi 2 λ2 + . . . + biK λK , i = 1, . . . , N
The theory puts no restrictions on these risk prices, except when the factors are
traded assets.
(for details; see Chapter 9 of Cochrane (2005) and Chapter 6 of Campbell, Lo, and MacKinley
(1997))
NP NP NP NP
1 1 1 1
e
Rpt =
NP ∑ αi +
NP ∑ βi 1 f1t +
NP ∑ βi 2 f2t +
NP ∑ εit
i =1 i =1 i =1 i =1
NP
1
= αp + βp1 f1t + βp2 f2t +
NP ∑ εit
i =1
e
only systematic risk is left: Rpt αp + βp1 f1t + βp2 f2t
2 factor-1-mimicking portfolio
1 construct a zero-investment portfolio with β1 = 1 and β2 = 0
2
e ]
the expected (excess) return on this portfolio is: E [R1t
3 factor-2-mimicking portfolio
1 construct a zero-investment portfolio with β1 = 0 and β2 = 1
2
e ]
the expected return on this portfolio is: E [R2t
E [Rite ] λ0 + βi 1 λ1 + βi 2 λ2 ,
2 factor-1-mimicking portfolio
1 construct a zero-investment portfolio with β1 = 1 and β2 = 0
2
e ]
the expected (excess) return on this portfolio is the risk price λ1 = E [R1t
3 factor-2-mimicking portfolio
1 construct a zero-investment portfolio with β1 = 0 and β2 = 1
2
e ]
the expected return on this portfolio is the risk price λ2 = E [R2t
construct a portfolio C :
1 invest .75 in factor-1-mimicking portfolio
2 invest .75 in factor-2-mimicking portfolio
3 invest 1 .75 .75 in risk-free
hence, we can …nd the price of risk for each factor (λ1 , λ2 ) such that:
E [Rite ] = βi 1 λ1 + βi 2 λ2 , i = 1, . . . , N
Pricing in equilibrium: There exist risk prices λj for each factor such that the
expected return on any security i can be stated as:
E [Rite ] = βi 1 λ1 + βi 2 λ2 + . . . + βiK λK , i = 1, . . . , N
very general
no need to measure the return on the total wealth portfolio (or ‘the market’)
the theory does not actually tell you which factors to use!
I for a larger set of securities, you probably need more factors
White covariance matrix of residuals allows for non-normal errors terms with
time-varying volatility
Asymptotic test
Here goes...
Then, f ( β) = xt εt is a (N K) 1 vector.
where
T T
1 0 1
ST =
T ∑ ft β̂ ft β̂ =
T ∑ xt ε̂t ε̂t0 xt0
t =1 t =1
is an (N K) (N K ) matrix.
Call the N N covariance matrix of the α’s that result from extracting the odd
rows and columns from Σ ( β) for Σα,White . Then the asymptotic test is
1
α̂0 Σα,White α̂ χ2 (N )