Econometrics II CH 1
Econometrics II CH 1
University
College of Business and Economics
Econometrics II
Abebe Mucheye Kassie (M.Sc.)
Aug 2022
1
Chapter One
Regression on Dummy Variables
The nature of dummy variables
•In regression analysis the dependent variable is frequently
influenced not only by variables that can be readily quantified
on some well-defined scale
• (e.g., income, output, prices, costs, height, and temperature),
but also by variables that are essentially qualitative in nature
(e.g., sex, race, color, religion, nationality, wars, earthquakes,
strikes, political upheavals, and changes in government
economic policy).
2
3
Cont.…
• Variables that assume such 0 and 1 values are
called dummy variables.
• Alternative names are indicator variables, binary
variables, categorical variables, and dichotomous
variables.
• Dummy variables can be used in regression models just as
easily as quantitative variables.
• As a matter of fact, a regression model may contain
explanatory variables that are exclusively dummy, or
qualitative, in nature.
4
Regression on one quantitative variable and one
qualitative variable with two classes, or categories
5
Cont.…
• Model (5.03) contains one quantitative variable
(years of teaching experience) and one qualitative
variable (sex) that has two classes (or levels,
classifications, or categories), namely, male and
female.
• What is the meaning of this equation?
Assuming, as usual, that E(Ui ) = ,0 we see that
6
Cont….
7
Dummy variable trap
8
Dummy
• The group, category, or classification that is
assigned the value of 0 is often referred to as:
The base,
Benchmark,
Control,
Comparison,
Reference, or
Omitted category.
It is the base in the sense that comparisons are
made with that category.
9
Regression on one quantitative variable and
two qualitative variables
10
Cont…
11
Cont…
12
Cont.…
13
14
15
16
17
18
19
20
Binary Choice Model
21
Binary Choice
Model
22
Cont.
23
Cont.
24
Linear probability model (LPM)
25
Linear probability model (LPM)
26
Linear probability model (LPM)
27
Logistic regression
► In Econometrics one dealt with multiple regression
with a continuous dependent variable, extending
the methods of simple linear regression.
31
♣ The logit transformation, written as logit (p). Here p is the proportion of
individuals with the characteristic.
The logit can take any value from minus infinity to plus infinity.
♣ We can fit regression models to the logit which are very similar to the
ordinary multiple regression models found for data from a normal
distribution.
eZ
P
1 e Z
33
Significance tests
► The process by which coefficients are tested for significance for
inclusion or elimination from the model involves several different
techniques.
I) Z-test
The significance of each variable can be assessed by treating
b
Z= se(b)
35
Deviance
The deviance of a model is -2 times the log likelihood (-
2LL) associated with each model.
37
The likelihood ratio test (LRT), which
makes use of the deviance, is analogous to the
F-test from linear regression.
38
Assumptions
► Logistic regression is popular in part because it
enables the researcher to overcome many of the
restrictive assumptions of OLS regression:
41
8. Large samples: Unlike OLS regression, logistic
regression uses maximum likelihood estimation
(MLE) rather than ordinary least squares (OLS) to
derive parameters.
42
Hosmer and Lemeshow Test
♣ The Hosmer -Lemeshow goodness - of - fit statistic is used to assess
whether the necessary assumptions for the application of multiple
logistic regression are fulfilled.
43
Probit model
44
,,
45
Cont…
46
Cont,,,,
47
21/02/24 48