0% found this document useful (0 votes)

71 views

Econometrics CRT M2: Regression Model Evaluation

The document describes regression models attempting to explain a dependent variable y using explanatory variables x1, x2, x3, and x4. Six models are presented without fully analyzing them. The best model is determined by examining each variable's coefficients and significance, the models' diagnostic statistics like R-squared and adjusted R-squared, and the explanatory power and correlation of the variables. Model 5 is preferred as it has the highest adjusted R-squared and shows little correlation among explanatory variables.

Uploaded by

Dickson phiri

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views

Econometrics CRT M2: Regression Model Evaluation

Uploaded by

Dickson phiri

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

ECONOMETRICS CRT M2

Regression Model Evaluation.

In the following table, the results of 6 models attempt to explain a dependent variable of interest, y. You may
assume that there is sufficient theoretical reason to consider any or all of the explanatory
variables x1, x2, x3 and x4 in a model for y, but it is unknown whether all of them are necessary to effectively
model the data generating process of y.

Provide a thorough, rigorous analysis of which of the models is preferred. Your analysis should include features of each
coefficient, each model, and each of the diagnostic statistics. Do NOT analyses them one-by-one, but by theme as identified in
Module 2 of Econometrics. For the preferred model, give an analysis of the likely correlation among the explanatory variables.

ANSWERS

1. A Linear Regression Model is used to predict the value of a dependent variable Y, using a set
of independent variables and an intercept/constant.
When the model is run, it returns a set of coefficients for each of the input variables, their significance in
the predictive model and a set of consolidated test scores for the model, such as R-squared, Adjusted R-
squared, F-statistic, etc.
2. We try to reject the following Null Hypothesis and accept the alternative
hypothesis. Null Hypothesis : The coefficient of all the input variables is
zero. Alternative Hypothesis : The estimated coefficient is not zero.
3. Explanation of some of the KPIs from the Regression output:
a. Coefficient of variables: If the coefficient is positive, it means that the independent variable
x and the dependent variable Y are proportional to each other, i.e. a positive movement in x
will result in a positive movement in Y.
b. P-Value of variables: p-value refers to the degree of significance of the independent variable.
It is the measure of the probability that an observed difference could have occurred just by
random chance. A smaller p-value signifies stronger evidence in favor of the alternative
hypothesis.
c. R-squared: It represents the proportion of the variance for a dependent variable that is explained
by the set of independent variables used in the model.
4. Model selection:
Step 1: Reject the model where any of the variable is having a p-value > 0.05. If any of the variable is
having a higher p-value, null value cannot be rejected for it, thus the model becomes insignificant.
Model 3, Model 4 and Model 6 are rejected as x4 is having p-value > 0.05 in all the three
models.

a. Step 2: Check the degree of variance in the dependent variable being explained by
the independent variables by looking at R-squared.

R-squared of Model 5 (0.4638) > Model 2 (0.4148) > Model 1 (0.3775)

It appears that Model 5 is the best looking at this result. However, it will be too early to judge at
this point, as the higher value of R-squared can be due to higher number of independent variables
used in the model.

Model 5 is having 3 variables which is greater than the no of variables in Model 1 and Model 2.
Thus, the higher value of R-squared can be due to additional variables.

However, between Model 1 and Model 2, we can conclude that Model 2 is better, as the number
of variables is same for both the models.

b. Step 3: Compute adjusted R-squared to finalize the best Model

Adjusted R-squared is the metric that adds a penalty to the R-squared values of higher order
models. Thus, adjusted R-squared can be used to identify the best model from the set of models
with different no of parameters.

Adjusted R-squared = 1 – (1 – R2) (n – 1)

(n – p – 1)
where,
R2 = R-squared value of the model
n = No of observations in the model
p = No of independent variables used.

We do not have the information on no of observations in the question. So, we calculate it using
F-Statistics equation.
F-statistics = R2 * (n – p – 1)
(1 – R2) * p
In Model 5, R2 = 0.4638, p = 3, F-statistics = 56.51. Thus, we obtain value of n by substituting in
the above formula.
56.51 = 0.4638 * ( n – 3 – 1)
(1 – 0.4638) * 3

Thus, n = 200
Using the value of n in the Adjusted R-squared formula for Model 2 and Model 5 we get:
Model 2:
Adjusted R-squared = 1 – (1 – R2) (n – 1)
(n – p – 1)

= 1 – (1 – 0.4148) (200 – 1)
(200 – 2 – 1)

= 0.4088

Model 5:
Adjusted R-squared = 1 – (1 – R2) (n – 1)
(n – p – 1)

= 1 – (1 – 0.4638) (200 – 1)
(200 – 3 – 1)

= 0.4556

We can clearly see, that Model 5 is having higher adjusted R-squared. Hence, it is the best model
amongst the set of models in the question.

For the preferred model, give an analysis of the likely correlation among the explanatory variables.

We have the following model:

Y = 0.07374 – 0.0813 X1 + 0.33752 X2 + 0.23387 X3

Y is proportionally dependent on X2 and X3 and inversely proportional to X1.

VIF=1/(1-R^2)

VIF=1/(1-0.4638)= 1.8649

The correlation among explanatory variable is very small as can be seen from the VIF calculation.
(3) Write the Fama-French 3 factor model equation, specifying what each term means

E(R) = Rf + β1 (Rm − Rf ) + β2 (SMB) + β3 (HML) + α

E(R) = Expected rate of return

RF = Risk-free rate

β1, β2, β3 = Factor coefficients

(Rm − Rf ) = Market risk premium

SMB = Historic excess returns of small-cap companies over large-cap companies

HML = Historic excess returns of value stocks over growth stocks

α = its the risk

(4) Explain in words how the model improves upon CAPM

According to the Fama-French three factor model, small-cap companies outperform large-cap companies and value
companies outperform growth companies. The model expands over the CAPM model to adjust for these out
performance tendencies.

(5) Formulate the Fama-French regression using your stock’s returns, all the Fama-French factors, and the
benchmark returns.

MKTRF = col_number(), SMB = col_number(),

HML = col_number(), RF = col_number(),

TR = col_number(), XF = col_number()))

View(ffdata)

MKTRF<-ffdata[,2]

SMB<-ffdata[,3]

HML<-ffdata[,4]

XF<ffdata[,7]

XF<-ffdata[,7]
Call:

lm(formula = XF ~ MKTRF + SMB + HML, data = ffdata)

Coefficients:

(Intercept) MKTRF SMB HML

-0.8693 1.1497 0.9102 -0.3090

print(summary(ffregression))

SUMMARY OF THE REGRESSION RESULTS

ffregression←lm(XF~MKTRF+SMB+HML, data = ffdata)

print(summary(ffregression))

Call:

lm(formula = XF ~ MKTRF + SMB + HML, data = ffdata)

Residuals:

Min 1Q Median 3Q Max

-15.148 -1.244 0.174 1.506 16.096

Coefficients:

Estimate Std. Error t value Pr(>|t|)

(Intercept) -0.8693 0.1872 -4.644 5.55e-06 ***

MKTRF 1.1497 0.2366 4.860 2.09e-06 ***

SMB 0.9102 0.4171 2.182 0.030 *

HML -0.3090 0.3198 -0.966 0.335

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 2.934 on 247 degrees of freedom

(1 observation deleted due to missingness)

Multiple R-squared: 0.1374, Adjusted R-squared: 0.1269

F-statistic: 13.11 on 3 and 247 DF, p-value: 5.653e-08

(6) What is the Greek letter used in front of each factor?

alpha refers to the excess return over the benchmark. beta refers to the factor coefficients, seen in the FF equation
above.

(7) Which model performed better? CAPM

Summary OF THE REGRESSION RESULTS
ffregression<-lm(XF~MKTRF,data =ffdata)
Summary OF THE REGRESSION RESULTS
ffregression←lm(XF~MKTRF, data =ffdata)
print(summary(ffregression))

Call:
lm(formula = XF ~ MKTRF, data = ffdata)

Residuals:
Min 1Q Median 3Q Max
-15.6947 -1.2038 0.1935 1.4596 16.1200

Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) -0.8920 0.1877 -4.751 3.42e-06 ***
MKTRF 1.3153 0.2266 5.804 1.97e-08 ***

Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 2.953 on 249 degrees of freedom

(1 observation deleted due to missingness)

Multiple R-squared: 0.1192, Adjusted R-squared: 0.1156

F-statistic: 33.68 on 1 and 249 DF, p-value: 1.965e-08
Fama-French 3 factor model has the following statistical parameters for significance test

adjusted R^2 =12.69%

F-Statistics = 13.11

P-value = 5.653e-08

CAPM has the following statistical parameters for significance test.

Adjusted R^2 =11.56%

F-statistics =33.68

p-value: =1.965e-08

Fama-French 3 factor model better approximates as it accounts for out-performance tendencies from growth/value
and small/large cap companies over the CAPM model.
Choosing the model based on adjusted R^2:
The adjusted R^2 on Fama-French model is higher (12.69%) as compared to to the adjusted R^2 on CAPM
(11.56%) hence Fama-French 3 factor model performed better as compared to CAPM.

Fdocuments - in How To Build The Quick Canoe 155 Amazon s3 Better Than A Pure Flat Bottomed Thing
No ratings yet
Fdocuments - in How To Build The Quick Canoe 155 Amazon s3 Better Than A Pure Flat Bottomed Thing
44 pages
Proc Robust Reg
No ratings yet
Proc Robust Reg
56 pages
Ms 236 N 0
No ratings yet
Ms 236 N 0
63 pages
Black-Scholes Excel Formulas and How To Create A Simple Option Pricing Spreadsheet - Macroption
No ratings yet
Black-Scholes Excel Formulas and How To Create A Simple Option Pricing Spreadsheet - Macroption
8 pages
Multiple Linear Regression: Beginning of Next Lecture - Online Course Evaluation (Bring A Tablet, Laptop, Phone?)
No ratings yet
Multiple Linear Regression: Beginning of Next Lecture - Online Course Evaluation (Bring A Tablet, Laptop, Phone?)
37 pages
Section 2
No ratings yet
Section 2
22 pages
Machine Learning-Lecture 1(Student)
No ratings yet
Machine Learning-Lecture 1(Student)
14 pages
Evalate Regression and Descripe
No ratings yet
Evalate Regression and Descripe
2 pages
CH 14 Handout
No ratings yet
CH 14 Handout
6 pages
Exam 1 Spring 2023 Donald
No ratings yet
Exam 1 Spring 2023 Donald
8 pages
Team8 Lab3
No ratings yet
Team8 Lab3
12 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Simpreg
No ratings yet
Simpreg
6 pages
HW 2
No ratings yet
HW 2
12 pages
The Arbitrage Pricing Theory Model
No ratings yet
The Arbitrage Pricing Theory Model
3 pages
ClassOf1 Regression Prediction Intervals 8
No ratings yet
ClassOf1 Regression Prediction Intervals 8
7 pages
Linear Regression in R
No ratings yet
Linear Regression in R
7 pages
Applied Linear Regression
No ratings yet
Applied Linear Regression
13 pages
Business Statistics, 5 Ed.: by Ken Black
No ratings yet
Business Statistics, 5 Ed.: by Ken Black
34 pages
Lab-5-1-Regression and Multiple Regression
100% (2)
Lab-5-1-Regression and Multiple Regression
8 pages
Mscfe CRT m2
100% (1)
Mscfe CRT m2
6 pages
Multiple Linear Regression
100% (1)
Multiple Linear Regression
14 pages
STA 405: Linear Modelling 2: Dr. Idah
No ratings yet
STA 405: Linear Modelling 2: Dr. Idah
30 pages
Multiple Regression - Selecting The Best Equation: An Example
No ratings yet
Multiple Regression - Selecting The Best Equation: An Example
29 pages
MakeUpCat
No ratings yet
MakeUpCat
6 pages
Amit Sir - Assignment
No ratings yet
Amit Sir - Assignment
19 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
No ratings yet
Isye4031 Regression and Forecasting Practice Problems 2 Fall 2014
5 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
No ratings yet
Summary of Topics For Midterm Exam #2: STA 371G, Fall 2017
6 pages
Multiple Regression Slides Mod-Ed
No ratings yet
Multiple Regression Slides Mod-Ed
32 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
Applied Business Forecasting and Planning: Multiple Regression Analysis
No ratings yet
Applied Business Forecasting and Planning: Multiple Regression Analysis
100 pages
Multiple Regression
100% (1)
Multiple Regression
100 pages
125.785 Module 2.2
No ratings yet
125.785 Module 2.2
95 pages
CE1 Sol
No ratings yet
CE1 Sol
7 pages
Regression Model
No ratings yet
Regression Model
30 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
STAT 252-Notes-Topic 5-Multiple Linear Regression
No ratings yet
STAT 252-Notes-Topic 5-Multiple Linear Regression
33 pages
Evans Analytics2e PPT 08
No ratings yet
Evans Analytics2e PPT 08
65 pages
REGRESSION
No ratings yet
REGRESSION
8 pages
Unit 10 - More Multiple Regression - 4 Per Page
No ratings yet
Unit 10 - More Multiple Regression - 4 Per Page
8 pages
Tugas Pip
No ratings yet
Tugas Pip
2 pages
Tutorial Session 12 - Model Selection Solution
No ratings yet
Tutorial Session 12 - Model Selection Solution
4 pages
Regression
No ratings yet
Regression
24 pages
Intergrated Problem
No ratings yet
Intergrated Problem
8 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
Estadisticas Descriptivas - DSTAT Rhs ONE, X1, X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12$
No ratings yet
Estadisticas Descriptivas - DSTAT Rhs ONE, X1, X2, X3, X4, X5, X6, X7, X8, X9, X10, X11, X12$
4 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
No ratings yet
Week 5 Multiple Regression: Busa3500 Statistics For Business Ii Piedmont College
57 pages
Najah Mubashira Final STT 351 Project
No ratings yet
Najah Mubashira Final STT 351 Project
7 pages
Chap01-3 (Autosaved)
No ratings yet
Chap01-3 (Autosaved)
51 pages
Stepwiseselection MATTOUHI AICHA
No ratings yet
Stepwiseselection MATTOUHI AICHA
7 pages
Multiple Linear Regression-I
No ratings yet
Multiple Linear Regression-I
6 pages
QTMS Final Assessment (Spring 2020) PDF
No ratings yet
QTMS Final Assessment (Spring 2020) PDF
6 pages
Multiple Regression
No ratings yet
Multiple Regression
100 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
17 pages
Topic 7-Regression Analysis
No ratings yet
Topic 7-Regression Analysis
56 pages
228371_Lecture_Notes_Week_3
No ratings yet
228371_Lecture_Notes_Week_3
61 pages
Review For Final Exam: New Material ONLY
No ratings yet
Review For Final Exam: New Material ONLY
4 pages
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Default Correlations - Riskprep
No ratings yet
Default Correlations - Riskprep
7 pages
Black-Scholes Excel Formulas and How To Create A Simple Option Pricing Spreadsheet - Macroption
No ratings yet
Black-Scholes Excel Formulas and How To Create A Simple Option Pricing Spreadsheet - Macroption
8 pages
Econ610 m4 CRT Final
No ratings yet
Econ610 m4 CRT Final
6 pages
CRT 3
No ratings yet
CRT 3
7 pages
R R + (R - R) + (SMB) + (HML) +
No ratings yet
R R + (R - R) + (SMB) + (HML) +
3 pages
Fitting The Nelson-Siegel-Svensson Model With Differential Evolution
No ratings yet
Fitting The Nelson-Siegel-Svensson Model With Differential Evolution
10 pages
Study and Testing of CI Engine by Rope Brake Dynamometer
33% (3)
Study and Testing of CI Engine by Rope Brake Dynamometer
21 pages
P Raves H Mishra
No ratings yet
P Raves H Mishra
3 pages
Password Cracking
No ratings yet
Password Cracking
41 pages
Canada Course & College List
No ratings yet
Canada Course & College List
6 pages
United States v. Milligan, 4th Cir. (2001)
No ratings yet
United States v. Milligan, 4th Cir. (2001)
4 pages
South nts332 Spec
No ratings yet
South nts332 Spec
2 pages
(Ebook) Analytical fluid dynamics by George Emanuel ISBN 9780849391149, 9781420036596, 0849391148, 1420036599 2024 scribd download
100% (2)
(Ebook) Analytical fluid dynamics by George Emanuel ISBN 9780849391149, 9781420036596, 0849391148, 1420036599 2024 scribd download
61 pages
What Is Bitcoin (The Summary) 2
No ratings yet
What Is Bitcoin (The Summary) 2
3 pages
Kakawate National High School: San Ignacio ST., Poblacion I, City of San Jose Del Monte, Bulacan
No ratings yet
Kakawate National High School: San Ignacio ST., Poblacion I, City of San Jose Del Monte, Bulacan
10 pages
Power Electronics and Drives
No ratings yet
Power Electronics and Drives
3 pages
High Performance: Wax Additives
No ratings yet
High Performance: Wax Additives
36 pages
Frieslandcampina Nederland B.V. Laboratory & Quality Services
No ratings yet
Frieslandcampina Nederland B.V. Laboratory & Quality Services
12 pages
SA Ch08
No ratings yet
SA Ch08
21 pages
PAPER. (FINETURBO) Investigation of A Steam Turbine With Leaned Blades by Through Flow Analysis and 3D CFD Simulation
No ratings yet
PAPER. (FINETURBO) Investigation of A Steam Turbine With Leaned Blades by Through Flow Analysis and 3D CFD Simulation
6 pages
SPB Kiosk 4 User Manual
100% (1)
SPB Kiosk 4 User Manual
10 pages
Employee Motivation On Renuka Sugar's LTD.: Preeti M Devangavi 2BVO9MBA33
No ratings yet
Employee Motivation On Renuka Sugar's LTD.: Preeti M Devangavi 2BVO9MBA33
49 pages
Corporate Social Responsibility Assignment 1
No ratings yet
Corporate Social Responsibility Assignment 1
3 pages
Migration Issues Is A Global Issue: World (Fifth Edition), New York, Oxford University, 201, P 215-239
No ratings yet
Migration Issues Is A Global Issue: World (Fifth Edition), New York, Oxford University, 201, P 215-239
9 pages
Chapter 3.3 - Close and Open
No ratings yet
Chapter 3.3 - Close and Open
2 pages
04 Feb Khorasimalu
No ratings yet
04 Feb Khorasimalu
6 pages
Offer Letter
No ratings yet
Offer Letter
3 pages
PopSockets v. DozTrading - Complaint
No ratings yet
PopSockets v. DozTrading - Complaint
106 pages
Globalization Module 7 Climate Change
100% (1)
Globalization Module 7 Climate Change
51 pages
CBSE Class 11 Accountancy Sample Paper 2013 (4) - 0 PDF
No ratings yet
CBSE Class 11 Accountancy Sample Paper 2013 (4) - 0 PDF
12 pages
Planner of B.SC .ITVI 2022-Nov
No ratings yet
Planner of B.SC .ITVI 2022-Nov
1 page
Registration Certificate Government of Karnataka
No ratings yet
Registration Certificate Government of Karnataka
4 pages
MUN Resolution
No ratings yet
MUN Resolution
2 pages
WRM Y7 Spring b2 Multiplication Division Assessment Answers A
No ratings yet
WRM Y7 Spring b2 Multiplication Division Assessment Answers A
2 pages
Patterns and Relations Variables and Equations Unit Plan 1
No ratings yet
Patterns and Relations Variables and Equations Unit Plan 1
6 pages

Econometrics CRT M2: Regression Model Evaluation

Uploaded by

Econometrics CRT M2: Regression Model Evaluation

Uploaded by

ECONOMETRICS CRT M2

Regression Model Evaluation.

R-squared of Model 5 (0.4638) > Model 2 (0.4148) > Model 1 (0.3775)

b. Step 3: Compute adjusted R-squared to finalize the best Model

Adjusted R-squared = 1 – (1 – R2) (n – 1)

We have the following model:

Y = 0.07374 – 0.0813 X1 + 0.33752 X2 + 0.23387 X3

Y is proportionally dependent on X2 and X3 and inversely proportional to X1.

E(R) = Rf + β1 (Rm − Rf ) + β2 (SMB) + β3 (HML) + α

E(R) = Expected rate of return

β1, β2, β3 = Factor coefficients

(Rm − Rf ) = Market risk premium

SMB = Historic excess returns of small-cap companies over large-cap companies

HML = Historic excess returns of value stocks over growth stocks

α = its the risk

(4) Explain in words how the model improves upon CAPM

MKTRF = col_number(), SMB = col_number(),

HML = col_number(), RF = col_number(),

lm(formula = XF ~ MKTRF + SMB + HML, data = ffdata)

(Intercept) MKTRF SMB HML

-0.8693 1.1497 0.9102 -0.3090

SUMMARY OF THE REGRESSION RESULTS

ffregression←lm(XF~MKTRF+SMB+HML, data = ffdata)

lm(formula = XF ~ MKTRF + SMB + HML, data = ffdata)

Min 1Q Median 3Q Max

-15.148 -1.244 0.174 1.506 16.096

Estimate Std. Error t value Pr(>|t|)

(Intercept) -0.8693 0.1872 -4.644 5.55e-06 ***

MKTRF 1.1497 0.2366 4.860 2.09e-06 ***

SMB 0.9102 0.4171 2.182 0.030 *

HML -0.3090 0.3198 -0.966 0.335

(1 observation deleted due to missingness)

Multiple R-squared: 0.1374, Adjusted R-squared: 0.1269

F-statistic: 13.11 on 3 and 247 DF, p-value: 5.653e-08

(6) What is the Greek letter used in front of each factor?

(7) Which model performed better? CAPM

Residual standard error: 2.953 on 249 degrees of freedom

Multiple R-squared: 0.1192, Adjusted R-squared: 0.1156

adjusted R^2 =12.69%

CAPM has the following statistical parameters for significance test.

Adjusted R^2 =11.56%

You might also like