0% found this document useful (0 votes)

21 views

Lab - Lecture 1

This document provides instructions for students to conduct a hands-on computer exercise to estimate a simple linear regression model using STATA. It uses hypothetical weekly income and consumption expenditure data to estimate a regression model and interpret the results. Key steps covered include generating a scatter plot to assess linear fit, running the regression, interpreting coefficients and goodness of fit, conducting hypothesis tests on coefficients, and overlaying the regression line on the scatter plot.

Uploaded by

Sohel Rana Sohag

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Lab - Lecture 1

Uploaded by

Sohel Rana Sohag

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Hajee Mohammad Danesh Science & Technology University, Dinajpur

Department of Economics
ECN-311: Basic Econometrics (Lab Lecture-1)
Course Teacher: Nazrul Islam

This lecture is a sequel to your lecture sessions on the theory of CLRM, the method and
properties of OLS estimation (BLUE) and hypothesis testing. From this lecture this lecture,
you will have a hands-on computer exercise in estimating simple (bi-variate) linear regression
models, interpreting regression results, testing hypothesis and carrying out basic post
regression diagnosies. The latter include statistical tests to detect whether the assumptions of
CLRM actually hold in a given example. The above mentioned elements constitute what we
call statistical inference.

We start with a hypothetical data on weekly family income and weekly family consumption
expenditure. Theory suggests that as income increases, individual tend to increase their
consumption expenditure but not by as much as the increase in income. Although theory does
not specify the functional form of this relationship it does suggest that the marginal propensity
to consume (MPC) is less than one. If we are willing to assume a linear relationship, the
econometric model would look like equation(1):

𝑌𝑖 = 𝛽1 + 𝛽2 𝑋𝑖 + 𝑢𝑖 (1)

Where Y is weekly consumption and X is weekly income. Our objective is to estimate the
population parameters 𝛽1 and 𝛽2 based on a random sample drown from the population.

Before running any regression, however, it is important to examine the relationship between
the two variables graphically and determine if a linear model would fit the data very well. To
do so we use a simple scatter plot. A scatter plot can be drawn in STATA using the graphics
facilities.

Click <Graphics> <Twoway Plots> and select ‘Scatter’ from the plot type. And then
choose ‘income’ for the X variable and Consumption for the Y variable or you can type
scatter consm income in the command window.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Figure-1: Regression of weekly consumption on income.

250
200
consm

150
100
50

0 200 400 600 800

income

The graph suggests that there is a linear relationship between the two variables and a linear
regression model will be appropriate.

In addition to the scatter plot, you can supplement your understanding of the data by looking
at the summary statistics (The mean, median, the inter-quartile range etc.)

Estimation
Running regression in STATA is simple and straightforward. We use the regress command.

Regress consm income (reg for short) requests stata to run regression of the dependent
variable consumption on the explanatory (predictor) variable income.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

STATA reports three blocks of results each providing information on different aspects of the
regression model. It is not just the coefficients and their significance that a researcher should
look for the entire set of information is important for a complete interpretation of the results.

̂
To begin with we can write the model as: 𝐶𝑜𝑛𝑠𝑢𝑚𝑝𝑡𝑖𝑜𝑛 = 70.21 + 0.245 𝑖𝑛𝑐𝑜𝑚𝑒

This result suggests that for a one dollar increase in income, average consumption rises by
0.245 cents. Which upon initial inspection appears to support the Keynesian theory of
consumption stated above. In the model 𝛽1 = 70.21 which is the intercept. Often the
intercept does not carry much of an economic meaning -but in this case it can be viewed as
autonomous consumption, i.e., the average consumption when income is zero.

The reported 𝑅 2 suggests that income explains 94.66% of the total variation in consumption.
In other words, our model explains 91.25% of the interpersonal differences in consumption
expenditure. In the regression results reported above SS stands for sum of squares. Therefore,
the Model SS, i.e., 5408.4973 stands for ESS (the Explained Sum of Squares) and Residual
SS which equals 5185.46934 corresponds to the RSS (Residual Sum of Squares); measures
which are discussed in the lecture sessions. The Total SS is the sum of the ESS and RSS. As
you already know the 𝑅 2 = 𝐸𝑆𝑆⁄𝑇𝑆𝑆.

Hypothesis Testing

The reported t-statistic which equals 17.09 is based on the null hypothesis that the coefficient
of income in this model is equal to zero, i.e., 𝐻0 : 𝛽2 = 0 against 𝐻1 : 𝛽2 ≠ 0

̂2 −𝐸(𝛽̂
𝛽 2)
̂ −𝛽
𝛽2 2
We know 𝑡 = ̂ = 𝑠𝑒(𝛽
̂)
𝑠𝑒(𝛽2 2

For this reason, the t-statistic for the coefficient of income reported above is the ratio of the
coefficient to its own standard error, i.e., t=0.2453/0.0144=17.09 (except for rounding error).
This is calculated t value.

To test the stated hypothesis at 5% level of significance, you need to look at the t-distribution
table.
1
Since this is a two tailed test, you need to use the top row and look for 0.025 (2 𝛼 where 𝛼 =
0.05) with a degrees of freedom equal to 28. The degrees of freedom is the total number of
observations less the number of coefficients estimated. That is critical 𝑡𝛼/2,28 = 2.048.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

The critical value is 2.048 which is far below the calculated t-statistic 17.09. We say that the
test is significant at 5% and we can thus reject the null hypothesis in favour of the alternative
hypothesis.
Notice that the 95% confidence interval confirms the same test result. If the null hypothesis
were true, the 95% confidence interval would have contained the hypothesized value, i.e.,
zero. But it does not and hence we can reject the null hypothesis with 95% level of
confidence. This means that there is a 5% chance that such an interval may contain the value
zero. The p-value gives another look at the test of significance. It tells us that chances of
committing Type-I error is very low, even less than 1 in a thousand.

Although the preceding test procedures explain the practice of hypothesis testing, they do not
constitute a test of the Keynesian theory of consumption. The Keynesian theory does not
question whether the coefficient on income is zero or not. (i.e. whether there is a relationship
between consumption or not). Rather its concern is whether the marginal propensity to
consume is less than one or not. In that sense, the hypothesis should be expressed as:

𝐻0 : 𝛽2 = 1 against \𝐻1 : 𝛽2 ≠ 1
Accordingly, the t-statistic should be calculated as follows:
̂2 −𝐸(𝛽̂
𝛽 2)
̂ −𝛽
𝛽 2 2 0.245−1
|𝑡| = | ̂2 |=| 𝑠𝑒(𝛽
̂)|=| 0.01435 | = 52.61
𝑠𝑒(𝛽 2

Once again the calculated value is greater than the critical value of 2.048 confirming that our
regression result supports the Keynesian theory of consumption.
Drawing the regression line
Our model as already indicated is 𝑌𝑖 = 70.21 + 0.245𝑋𝑖 + 𝑢𝑖
Where Y is consumption and X is income. The expected value of this model is the systematic
̂1 + 𝛽
component which is often represented as 𝑌𝑖 = 𝛽 ̂2 𝑋𝑖 = 70.21 + 0.245𝑋𝑖

We call this the predicted value and we use STATA’s predict command to calculate it.

Predict yhat

This command generates a new variable yhat which contains the predicted value of
consumption for each value of income. To see the regression line, we resort back to the
graphics command.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Click <Graphics> <Overlaid twoway>
This option allows you to combine scatter plots with line graphs and many others. Here we
will combine the scatter plot we have already seen and over lay it with the regression line. All
we need to do is to choose Scatter plot for plot 1 and Line graph for plot 2. In both plots we
use income as the X-variable. In plot-1 the Y-variable will be consumption and in Plot-2 the
Y-variable will be yhat.

You can as well type the following command to get the same result.
twoway (scatter consm income) (line yhat income)
Figure-2
250
200
150
100
50

0 200 400 600 800

income

consm Fitted values

• Indicates weekly consumption and solid line indicates fitted values

Such a graph allows us to inspect how well the regression line represents the data points. We
can see that the regression curve is a good fit. Once you know how the regression line is
obtained based on the predicted value, there is even an easier way to get the same graph in
STATA.

Click <Graphics> <Regression Diagnostics> <Component-plus-Residual> and select the

independent variable which in this case is income.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Testing Normality
One of the key assumptions in regression analysis is that the error terms are normally
distributed.
There are several ways to test this assumption. In this exercise we focus on two approaches:
a) Using Histograms
b) Normal Probability plot or Quantile-Normal plot
These two are visual (informal) methods of checking normality. For these informal tests we
need to generate the residual from our regression. After running the regression, you can get
the residuals by using the predict command:

predict resid, residual here resid is the variable name we chose to give the residual
generated by this command. The option residual after the comma, tells stata to generate the
residual. This is to distinguish it from the command used to generate the fitted value.

hist resid, bin(9) draws a histogram of the residual

hist resid, bin(9) norm superimposes the normal curve on the histogram.
qnorm resid draws a normal probability plot. Here we compare the quantiles of a variable
(in this case resid) against the quantiles of a theoretical normal distribution which has the
same mean and variance as resid. If resid is normally distributed, the normal probability plot
would show an overlap between the two distributions.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Exercise A

In this exercise you are required to repeat the preceding regression on transformed variables,
i.e., on the logarithms of consumption and income. First you need to generate the logarithmic
values:
gen lnconsm=ln(consm)
gen lnincome=ln(income)

Your Task:
i) Run a regression of log-consumption on log-income and comment on your results.
ii) Draw a graph with scatter plot overlaid with the fitted value based on the log-
linear model. Does the regression line fit the data very well?
iii) Construct a 90% confidence interval for the slope coefficient.
iv) Test the null hypothesis that the income elasticity of consumption equals to one,
i.e., testing unitary elasticity. Use the confidence interval approach at 90%
confidence level.
v) Use the t-test at 1% level of significance to test the same null hypothesis that
income elasticity of consumption equals one.
vi) Test the hypothesis that the log-linear model is a regression through the origin.
Use the 5% level of significance.
vii) Test if the normality assumption holds in this regression using graphical methods.

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Measurement Procedure Comparison and Bias Estimation Using Patient Samples Approved Guideline-Third Edition
100% (10)
Measurement Procedure Comparison and Bias Estimation Using Patient Samples Approved Guideline-Third Edition
98 pages
Econometrics Cheatsheet en
100% (1)
Econometrics Cheatsheet en
3 pages
Answers Review Questions Econometrics PDF
93% (14)
Answers Review Questions Econometrics PDF
59 pages
45 Colonial Broadcasting
100% (1)
45 Colonial Broadcasting
17 pages
Time Series Forecasting Project Report
100% (3)
Time Series Forecasting Project Report
62 pages
Assignments
No ratings yet
Assignments
6 pages
Econometrics: A Simple Introduction
From Everand
Econometrics: A Simple Introduction
K.H. Erickson
3.5/5 (5)
+part 02 - AMEFA - 2024 - Introduction and Repetition
No ratings yet
+part 02 - AMEFA - 2024 - Introduction and Repetition
78 pages
Introduction To Econometrics
No ratings yet
Introduction To Econometrics
28 pages
Problem Set 05 - Solutions (Odtuclass)
No ratings yet
Problem Set 05 - Solutions (Odtuclass)
10 pages
Studenmund Top1.107
No ratings yet
Studenmund Top1.107
10 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
ch08 Ans
No ratings yet
ch08 Ans
20 pages
Econometrics_Problem_Set_2
No ratings yet
Econometrics_Problem_Set_2
3 pages
Econometrics Project
100% (1)
Econometrics Project
10 pages
Prac 3
No ratings yet
Prac 3
8 pages
Answers Review Questions Econometrics
84% (25)
Answers Review Questions Econometrics
59 pages
AEA 309 - Lecture 4
No ratings yet
AEA 309 - Lecture 4
37 pages
ASSI ECOMRT1 (1)
No ratings yet
ASSI ECOMRT1 (1)
17 pages
Eco 5
No ratings yet
Eco 5
30 pages
Case
No ratings yet
Case
22 pages
Using Stata Chapter 5
No ratings yet
Using Stata Chapter 5
10 pages
Introductory Econometrics For Finance Chris Brooks Solutions To Review - Chapter 3
100% (2)
Introductory Econometrics For Finance Chris Brooks Solutions To Review - Chapter 3
7 pages
(eBook PDF) Basic Econometrics 5th Edition by Gujarati 2024 scribd download
100% (2)
(eBook PDF) Basic Econometrics 5th Edition by Gujarati 2024 scribd download
50 pages
Econometrics Methodologies
No ratings yet
Econometrics Methodologies
4 pages
Econometrics: Domodar N. Gujarati
No ratings yet
Econometrics: Domodar N. Gujarati
36 pages
Problem Set 5 With Solutions
No ratings yet
Problem Set 5 With Solutions
10 pages
Econometrics 1 Topic 1
No ratings yet
Econometrics 1 Topic 1
38 pages
Econometrics Notes of Book
No ratings yet
Econometrics Notes of Book
161 pages
ECON209__Problem_Set_2
No ratings yet
ECON209__Problem_Set_2
3 pages
Econometric S
No ratings yet
Econometric S
26 pages
Applied Econometrics Module
100% (1)
Applied Econometrics Module
142 pages
Econometric Project - Linear Regression Model
No ratings yet
Econometric Project - Linear Regression Model
17 pages
ECON410: Econometrics: Lecturers: Assist. Prof. Dr. Derviş Kırıkkaleli Department of Economics Room TO104
No ratings yet
ECON410: Econometrics: Lecturers: Assist. Prof. Dr. Derviş Kırıkkaleli Department of Economics Room TO104
41 pages
Econometrics Essay
No ratings yet
Econometrics Essay
9 pages
Regression Explaination
No ratings yet
Regression Explaination
2 pages
Jimma University: M.SC in Economics (Industrial Economics) Regular Program Individual Assignment: Econometrics
No ratings yet
Jimma University: M.SC in Economics (Industrial Economics) Regular Program Individual Assignment: Econometrics
20 pages
CHAPTER 2
No ratings yet
CHAPTER 2
17 pages
Econometrics Till Midsem
No ratings yet
Econometrics Till Midsem
236 pages
IAPRI Technical Training-Intro To Applied Econometrics 2018 06 25+-+Nicole+Mason
No ratings yet
IAPRI Technical Training-Intro To Applied Econometrics 2018 06 25+-+Nicole+Mason
29 pages
Research What Is Research?
No ratings yet
Research What Is Research?
72 pages
Instant ebooks textbook (eBook PDF) Basic Econometrics 5th Edition by Gujarati download all chapters
100% (5)
Instant ebooks textbook (eBook PDF) Basic Econometrics 5th Edition by Gujarati download all chapters
45 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
29 pages
Ekonometrika
No ratings yet
Ekonometrika
24 pages
Introduction First Lecture in Class
No ratings yet
Introduction First Lecture in Class
27 pages
Econometrics For Finance Chapter 3
No ratings yet
Econometrics For Finance Chapter 3
32 pages
Lecture #1
No ratings yet
Lecture #1
22 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
03 Statistics in Regrression Analysis
No ratings yet
03 Statistics in Regrression Analysis
24 pages
Week 9 lecture slides - T
No ratings yet
Week 9 lecture slides - T
22 pages
Spring 19 Exam 1
No ratings yet
Spring 19 Exam 1
6 pages
Regression Analysis and Its Interpretation With Spss
No ratings yet
Regression Analysis and Its Interpretation With Spss
4 pages
Regression
No ratings yet
Regression
3 pages
(Ebook PDF) Basic Econometrics 5th Edition by Gujarati All Chapter Instant Download
100% (4)
(Ebook PDF) Basic Econometrics 5th Edition by Gujarati All Chapter Instant Download
41 pages
ASSIGNMENT 2 PART 1 MULTIPLE LINEAR, AUTOCORRELATION AND HETEROSCEDASTICITY
No ratings yet
ASSIGNMENT 2 PART 1 MULTIPLE LINEAR, AUTOCORRELATION AND HETEROSCEDASTICITY
10 pages
Resume Ekonometrika Bab 2
No ratings yet
Resume Ekonometrika Bab 2
6 pages
Topic 2 - Stages of Econometric Research
No ratings yet
Topic 2 - Stages of Econometric Research
16 pages
Econometrics Cheatsheet en
No ratings yet
Econometrics Cheatsheet en
3 pages
Lecture 3 Classical Linear Regression Model
No ratings yet
Lecture 3 Classical Linear Regression Model
55 pages
10 - Regression - Explained - SPSS - Important For Basic Concept
No ratings yet
10 - Regression - Explained - SPSS - Important For Basic Concept
23 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
SMITH 2016 Essentials of Applied Econometrics Icindekiler
No ratings yet
SMITH 2016 Essentials of Applied Econometrics Icindekiler
3 pages
Requireme TS: Addis Ababa May, 1999
No ratings yet
Requireme TS: Addis Ababa May, 1999
157 pages
Harits 2024 IOP Conf. Ser. Earth Environ. Sci. 1359 012095
No ratings yet
Harits 2024 IOP Conf. Ser. Earth Environ. Sci. 1359 012095
18 pages
Unit - 1 1.introduction To ML
No ratings yet
Unit - 1 1.introduction To ML
74 pages
3.2. Asumsi Regresi
No ratings yet
3.2. Asumsi Regresi
14 pages
An Investment Strategy Based On Stochastic Unit Root Models
No ratings yet
An Investment Strategy Based On Stochastic Unit Root Models
8 pages
Astm D2555
No ratings yet
Astm D2555
16 pages
Network Intrusion Detection System
No ratings yet
Network Intrusion Detection System
4 pages
Mode Choice Modelling Toward Sultan Mahmud Badarudin Ii Palembang Airport With Multinomial Logistic Regression Analysis
No ratings yet
Mode Choice Modelling Toward Sultan Mahmud Badarudin Ii Palembang Airport With Multinomial Logistic Regression Analysis
1 page
Wickens, 2008 - Cognitive Failures As Predictors of Driving Errors, Lapsus
No ratings yet
Wickens, 2008 - Cognitive Failures As Predictors of Driving Errors, Lapsus
11 pages
BI and TI
No ratings yet
BI and TI
884 pages
When Can History Be Our Guide? The Pitfalls of Counterfactual Inference
No ratings yet
When Can History Be Our Guide? The Pitfalls of Counterfactual Inference
28 pages
Suitability Analysis of Pearl Oyster Farming in Lampung Bay, Pesawaran, Lampung Province, Indonesia
No ratings yet
Suitability Analysis of Pearl Oyster Farming in Lampung Bay, Pesawaran, Lampung Province, Indonesia
12 pages
Daba Abdissa, Tesfaye Adugna, Urge Gerema, and Diriba Dereje
No ratings yet
Daba Abdissa, Tesfaye Adugna, Urge Gerema, and Diriba Dereje
6 pages
Proc Logistic
No ratings yet
Proc Logistic
261 pages
Math357 Term
No ratings yet
Math357 Term
43 pages
Simple Linear Regression: Interpreting Minitab Output
No ratings yet
Simple Linear Regression: Interpreting Minitab Output
4 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
6 pages
Legese Fekede
No ratings yet
Legese Fekede
55 pages
An Overview of Artificial Intelligence Applications For Power Electronics
No ratings yet
An Overview of Artificial Intelligence Applications For Power Electronics
27 pages
Ribaj & Mexhuani (2021)
No ratings yet
Ribaj & Mexhuani (2021)
13 pages
ABE 413 merged. Olumech
No ratings yet
ABE 413 merged. Olumech
109 pages
Regresie: Tara Speranta de Viata La Nastere (Ani) y Indicele de Dezvoltare Umana (Valoare) x1
No ratings yet
Regresie: Tara Speranta de Viata La Nastere (Ani) y Indicele de Dezvoltare Umana (Valoare) x1
6 pages
Chapter 7: Heteroscedasticity
No ratings yet
Chapter 7: Heteroscedasticity
20 pages
Rethinking Sustainable Food Offering in Peru
No ratings yet
Rethinking Sustainable Food Offering in Peru
17 pages
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
No ratings yet
Derivation of The Ordinary Least Squares Estimator Simple Linear Regression Case
17 pages
Full Download (eBook PDF) Statistics in Context by Barbara Blatchley PDF DOCX
100% (6)
Full Download (eBook PDF) Statistics in Context by Barbara Blatchley PDF DOCX
43 pages
Glossary of Six Sigma Terms and Acronyms
No ratings yet
Glossary of Six Sigma Terms and Acronyms
13 pages

Lab - Lecture 1

Uploaded by

Lab - Lecture 1

Uploaded by

Hajee Mohammad Danesh Science & Technology University, Dinajpur

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

0 200 400 600 800

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

0 200 400 600 800

consm Fitted values

• Indicates weekly consumption and solid line indicates fitted values

Click <Graphics> <Regression Diagnostics> <Component-plus-Residual> and select the

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

hist resid, bin(9) draws a histogram of the residual

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

Nazrul Islam, LECTURER, DEPT. OF ECONOMICS, RABINDRA UNIVERSITY, SIRAJGANJ-6770

You might also like