0% found this document useful (0 votes)

9 views

DiD Regression

NUS BT2101 DiD Regression

Uploaded by

datnt21413ca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

DiD Regression

NUS BT2101 DiD Regression

Uploaded by

datnt21413ca

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

BT2101 AY 23/24 Semester 2

Assignment 4

DiD Regression
Load the data from Wooldridge

Select Kentucky Data

Please use data

only from Kentucky.
Question (a)

• Estimate the impact of the policy based on a difference-in-differences (DiD) regression without
including any other control variables.

• Guidelines:
oUse durat as the dependent variable.
oYour specification should include three terms: the control/treatment group dummy, before/after
intervention time dummy, and a term capturing the interaction between control/treatment and
before/after the intervention.

• Please interpret all the coefficient estimates in your regression table.

3
Question (a)

4
Coefficient interpretation
• Intercept = 6.272 means the duration is 6.272 on average for those without highearn and before the policy
change. This is statistically significant at the 5% level, since p-value < 2e-16.

• Coefficient of afchnge = 0.766 represents the expected change in duration if the policy change is implemented on
low-earn individual, ceteris paribus. The coefficient is statistically insignificant at the 5% level since p-value =
0.314. Hence, we cannot reject the null hypothesis and continue to believe that no change in duration can be
expected if the policy change is implemented on low-earn individual, ceteris paribus.

• Coefficient of highearn = 4.905 means without the policy change, the duration of high-earn is expected to be 4.905
weeks more than low-earn, ceteris paribus. This is statistically significant at the 5% level, since p-value = 1.3e-09.

• Coefficient of afchnge*highearn = 0.951 means the expected change in duration is 0.951 more for policy
implementation on high-earn group compared to low-earn group. The coefficient is statistically insignificant at the
5% level since p-value = 0.414. Hence, we cannot reject the null hypothesis and continue to believe that no
change in duration can be expected if the policy change is implemented on high-earn individual, ceteris paribus.

5
Question (b)
• Change the dependent variable into ldurat, and repeat a similar DiD regression as the
question (a). Please interpret all the coefficient estimates in this regression table.

Log-transformation Recap

6
Question (b)

7
Question (b)

• Intercept = 1.126 means the log duration is 1.126 on average for those without highearn and
before the policy change. This is statistically significant at the 5% level, since p-value < 2e-16.

• Coefficient of afchnge = 0.0077 means the duration will be expected to increase by 0.77% if the
policy change is implemented on low-earn individual, ceteris paribus. This is statistically
insignificant at the 5% level, since p-value = 0.864 is large.

• Coefficient of highearn = 0.2565 means without the policy change, the duration of high-earn is
expected to be 25.65% more than low-earn, ceteris paribus. This is statistically significant at the
5% level, since p-value = 6.72e-08.

• Coefficient of afchnge*highearn = 0.1906 means the policy change's effect on the duration is
19.06% more for high-earn group compared to low-earn group, ceteris paribus. This is statistically
significant at the 5% level, since p-value=0.00542.

8
Question (c)

• Using ldurat as the dependent variable, and the independent variables already used in
the previous question, now add more control variables: male, married, and the full set of
industry and injury type dummy variables.

• How does the coefficient of interaction term change when these other factors are
controlled? Is the estimate still statistically significant? Please explain the changes, if any.

9
Question (c)

The coefficient of the interaction term changed from

0.190601 to 0.230877 (increasing)

It is still statistically significant since p-value is < 0.05

The coefficient slightly increased. A possible reason is
that when we add more control variables, we remove
noises in the effect, which can be seen in much lower
p-value, although it is already very low. (a gain from
complicating the model)

10
Question (c) – Alternative way to include dummy variables

Or, by using dummy variables through as.factor()

11
Question (d)
• Your colleague argues that we cannot draw a causal inference due to the small
magnitude of the R-squared and adjust R-squared in question (c).

• How will you respond to this argument? Explain.

R-squared is 0.0412 and adjusted R-squared is 0.0387.

Small multiple R-squared value indicates that covariates in the regression model only
explain 0.04 of the variance observed in the dependent variable ldurat.
However, this does not indicate that the estimation is useless or biased as the
estimation is still statistically significant in this model. Model fit does not indicate the
statistical significance of the impact we aim to measure. This is determined by the p-
value of the coefficient, which remains significant.
Moreover, employing Difference-in-Differences (DID) with controls is effective in
addressing concerns about endogeneity. The causal inference still holds.
12
Question (e)

• What is the most critical assumption of the difference-in-differences model? Even if you
cannot provide conclusive proof, can you use the data to offer some qualitative
support/opposition to the validity of this assumption in this dataset? Using your own
words, discuss what plots and/or statistics would help you support/oppose this
assumption. Construct/compute these plots/statistics and make a concluding statement
describing your support/opposition to the validity of this critical assumption in this dataset.

13
Question (e)

The most critical assumption is that the control (baseline) and treatment group
follows a parallel trend.

To observe parallel trends, we can plot a line graph to observe the trend of ldurat
over time. However, with the current dataset, we are unable to plot a line graph due
to the lack of information on how ldurat changes over time. We only have sufficient
data to differentiate ldurat just before and after implementation of the policy.

14
What will happen if I introduce fixed effects to the DiD analysis with the
treatment group indicator? (Extra insights)

Treatment group indicator indicates whether the observation is from treatment group or not.

What will happen to the treatment group indicator if we cover fixed effect in the above model_b?

Recall: fixed effect refers to the effect brought by the fixed characteristics that vary by unit but not by
time. Here the unit is each person (or individual).

If I let the above model to cover fixed effect, the treatment group indicator will
be omitted automatically from the model coefficient estimation.
Including Fixed-effect VS DiD
In Difference-in-Differences (DID) analyses, the treatment group indicator may sometimes be omitted.
This happens due to introducing fixed effects (FE), which can cause perfect multicollinearity.
If introducing fixed effect (recall from last tutorial: fixed effect is the effect varying across units but not
varying over time) to the model, we are giving each unit different intercept. In this example, the unit is
each person. i.e. we are giving each individual one unique intercept.
In DiD analysis, the coefficient estimation of the treatment group indicator give treatment group and
control group different intercept. So from the intercept perspectives, unit fixed effect and DiD are just
ways to capture the intercept in different level (unit fixed effect capture individual intercept VS DiD’s
treatment group indicator capture the intercept of treatment group/control group).
So if I include the unit fixed effect, let the model estimate individual intercepts, the model then will not
estimate the coefficient of treatment group indicator. Because if I estimate the intercept of each
individuals, the intercept of being in treatment group/control group is directly included. In other words,
treatment group indicator does not provide any new information anymore. Individual variation
already covers the variation brought by this individual being in treatment group or not.
Including Fixed-effect VS DiD
Another way to look at it, why introducing fixed effect will make treatment group
indicator redundant and be automatically omitted from model estimation, is that:
Introducing unit fixed effect will net out all the fixed characteristics’ impact
that vary across units but not over time. The treatment group indicator, meaning
whether this observation is in treatment group or control group, is one fixed
characteristics that vary across units (different individuals may be allocated to either
treatment group or control group) but not by time (individuals stay in the same group
throughout the different time point).
Therefore, by introducing unit fixed effect, the effect brought by the treatment group
indicator is net out automatically. In this case, the model will not need to predict the
effect brought by treatment group indicator and will need to omit the indicator
because adding redundant information will cause perfect multicollinearity.
Thank you!

Download Full Business Statistics Abridged: Australia and New Zealand 8th Edition Eliyathamby A. Selvanathan Saroja Selvanathan Gerald Keller PDF All Chapters
100% (4)
Download Full Business Statistics Abridged: Australia and New Zealand 8th Edition Eliyathamby A. Selvanathan Saroja Selvanathan Gerald Keller PDF All Chapters
40 pages
Fixed Effects Lecture1 PDF
No ratings yet
Fixed Effects Lecture1 PDF
40 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Correlation Analysis Multiple Linear Regression With All 3 Analysts
No ratings yet
Correlation Analysis Multiple Linear Regression With All 3 Analysts
3 pages
Panel Data Lecture Notes
No ratings yet
Panel Data Lecture Notes
38 pages
Lect - 10 - Difference-in-Differences Estimation PDF
No ratings yet
Lect - 10 - Difference-in-Differences Estimation PDF
19 pages
Lect 10 Diffindiffs 230305 014504
No ratings yet
Lect 10 Diffindiffs 230305 014504
20 pages
Handout 6 Causality
No ratings yet
Handout 6 Causality
16 pages
2024 DiD Handout
No ratings yet
2024 DiD Handout
4 pages
正在发送邮件 wk-08-slides
No ratings yet
正在发送邮件 wk-08-slides
96 pages
EC313 Assignment2 W12 Sol
No ratings yet
EC313 Assignment2 W12 Sol
4 pages
Takehome - Exam DiD and RDD
No ratings yet
Takehome - Exam DiD and RDD
36 pages
Exam Questions
No ratings yet
Exam Questions
3 pages
Chapter_13
No ratings yet
Chapter_13
14 pages
Wooldridge Slides 10 Diff in Diffs
No ratings yet
Wooldridge Slides 10 Diff in Diffs
31 pages
What's New in Econometrics? Difference-in-Differences Estimation
No ratings yet
What's New in Econometrics? Difference-in-Differences Estimation
31 pages
Wooldridge Session 5
No ratings yet
Wooldridge Session 5
57 pages
Experiments and Quasi-Experiments: Solutions To Exercises
No ratings yet
Experiments and Quasi-Experiments: Solutions To Exercises
4 pages
Exam 2
No ratings yet
Exam 2
21 pages
Differences in Differences
No ratings yet
Differences in Differences
78 pages
Type It Nicely (Latex or Word With Equation Editor) - Upload The Word or PDF File in Blackboard. Scanned Handwritten Problem Sets Are Not Allowed and Will Not Be Graded
No ratings yet
Type It Nicely (Latex or Word With Equation Editor) - Upload The Word or PDF File in Blackboard. Scanned Handwritten Problem Sets Are Not Allowed and Will Not Be Graded
3 pages
Econometrics Trial exam 1
No ratings yet
Econometrics Trial exam 1
15 pages
Sample Exam With Solutions. Econometrics II 2015.
No ratings yet
Sample Exam With Solutions. Econometrics II 2015.
15 pages
GMU Econ535-Applied Econometrics Final Exam Spring 2023 solutions
No ratings yet
GMU Econ535-Applied Econometrics Final Exam Spring 2023 solutions
13 pages
Diff - Simplifying The Estimation of Difference-In-difference Treatment Effects
No ratings yet
Diff - Simplifying The Estimation of Difference-In-difference Treatment Effects
20 pages
DID
No ratings yet
DID
28 pages
Lesson 4 - Diff in diff
No ratings yet
Lesson 4 - Diff in diff
15 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages
Solutions 5
No ratings yet
Solutions 5
6 pages
Part I: Short Answer/True, False, or Uncertain Questions. Please Be Sure To Explain Your Answers Thoroughly. (30 Points Total)
No ratings yet
Part I: Short Answer/True, False, or Uncertain Questions. Please Be Sure To Explain Your Answers Thoroughly. (30 Points Total)
22 pages
Pooled Cross Sections and Panel Data, Difference in Difference
No ratings yet
Pooled Cross Sections and Panel Data, Difference in Difference
35 pages
PPE Midterm Review
No ratings yet
PPE Midterm Review
2 pages
Evaluating the Impact of Health Policies Using a Difference-Indifferences Approach
No ratings yet
Evaluating the Impact of Health Policies Using a Difference-Indifferences Approach
6 pages
Empirical Methods in Microeconomics
No ratings yet
Empirical Methods in Microeconomics
3 pages
L_II_3 (2)
No ratings yet
L_II_3 (2)
37 pages
Past Paper 2019
No ratings yet
Past Paper 2019
7 pages
Aea Cookbook Econometrics Module 1
No ratings yet
Aea Cookbook Econometrics Module 1
117 pages
CC655 Final 2022 Key
No ratings yet
CC655 Final 2022 Key
6 pages
RM Questions
No ratings yet
RM Questions
4 pages
01_Introduction
No ratings yet
01_Introduction
53 pages
Answers_Exam_AFER_20230113 (2)
No ratings yet
Answers_Exam_AFER_20230113 (2)
5 pages
Practice Final Exam #1
No ratings yet
Practice Final Exam #1
11 pages
CC655 Final 2021 Key
No ratings yet
CC655 Final 2021 Key
13 pages
w29691 PDF
No ratings yet
w29691 PDF
30 pages
Panal Data Method ch14 PDF
No ratings yet
Panal Data Method ch14 PDF
38 pages
Applied Econometrics: William Greene Department of Economics Stern School of Business
No ratings yet
Applied Econometrics: William Greene Department of Economics Stern School of Business
68 pages
338457226
No ratings yet
338457226
176 pages
Micro-Econometrics ECO 6175: Abel Brodeur
No ratings yet
Micro-Econometrics ECO 6175: Abel Brodeur
34 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
AE Lecture 3 Differences-in-Differences
No ratings yet
AE Lecture 3 Differences-in-Differences
55 pages
Homework 2
No ratings yet
Homework 2
3 pages
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
No ratings yet
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
13 pages
Introduction To The Difference-In-Differences Regression Model (2021)
No ratings yet
Introduction To The Difference-In-Differences Regression Model (2021)
2 pages
MEM Group Problem Set 2022
No ratings yet
MEM Group Problem Set 2022
3 pages
Regression-Discontinuity Design
No ratings yet
Regression-Discontinuity Design
33 pages
EC395 Lab 8
No ratings yet
EC395 Lab 8
5 pages
Term Paper Sample PDF
No ratings yet
Term Paper Sample PDF
10 pages
Did, Iv
No ratings yet
Did, Iv
42 pages
Lec06 - Panel Data
No ratings yet
Lec06 - Panel Data
160 pages
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
Stat 401B Exam 2 Key F16
No ratings yet
Stat 401B Exam 2 Key F16
10 pages
Solution Manual for Statistical Inference, Second Edition, George Casella, Roger L. Berger download
100% (4)
Solution Manual for Statistical Inference, Second Edition, George Casella, Roger L. Berger download
37 pages
Advanced Statistical Theory 2 1112
No ratings yet
Advanced Statistical Theory 2 1112
3 pages
Experiment 3: Name: Harshit Kapoor Reg. No: 15BCE0657 Slot: L11+L12
No ratings yet
Experiment 3: Name: Harshit Kapoor Reg. No: 15BCE0657 Slot: L11+L12
8 pages
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
No ratings yet
SAS Code To Select The Best Multiple Linear Regression Model For Multivariate Data Using Information Criteria
6 pages
Econometrics - Chapter 17 - Simultaneous Equations Models - Shalabh, IIT Kanpur
No ratings yet
Econometrics - Chapter 17 - Simultaneous Equations Models - Shalabh, IIT Kanpur
30 pages
Ap7101 Advanced Digital Signal Processing
No ratings yet
Ap7101 Advanced Digital Signal Processing
1 page
CH 05 Wooldridge 6e PPT Updated
No ratings yet
CH 05 Wooldridge 6e PPT Updated
8 pages
Reading 2 Time-Series Analysis
No ratings yet
Reading 2 Time-Series Analysis
47 pages
Arathi
No ratings yet
Arathi
9 pages
625399102
No ratings yet
625399102
2 pages
Multilevel Modeling Methodological Advances Issues and Applications 1st Edition Steven P. Reise
No ratings yet
Multilevel Modeling Methodological Advances Issues and Applications 1st Edition Steven P. Reise
84 pages
Systat
No ratings yet
Systat
8 pages
AMR Concept Notes (Sessions 11-12)
No ratings yet
AMR Concept Notes (Sessions 11-12)
5 pages
Chapter 7-Tahoe-Salt
No ratings yet
Chapter 7-Tahoe-Salt
14 pages
10 HWsol
No ratings yet
10 HWsol
2 pages
RLB Contoh
No ratings yet
RLB Contoh
13 pages
Peramalan Regarima Pada Data Time Series (Studi Kasus: Penjualan Tiket Pesawat PT. Kumala Wisata Tenggarong)
No ratings yet
Peramalan Regarima Pada Data Time Series (Studi Kasus: Penjualan Tiket Pesawat PT. Kumala Wisata Tenggarong)
6 pages
AP Stats 3.2
No ratings yet
AP Stats 3.2
57 pages
Econometrics Note Gujarati Chapter 14
No ratings yet
Econometrics Note Gujarati Chapter 14
10 pages
Chap 1
No ratings yet
Chap 1
77 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
101 pages
Chapter 8 Ken Black
No ratings yet
Chapter 8 Ken Black
31 pages
Productivity Estimation of Bulldozers Using Generalized Linear Mixed Models
No ratings yet
Productivity Estimation of Bulldozers Using Generalized Linear Mixed Models
11 pages
Unit - 3 Machine Learning
No ratings yet
Unit - 3 Machine Learning
30 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Nvidia Make or Buy
No ratings yet
Nvidia Make or Buy
15 pages
Choosing Between and Interpreting The Heckit and Two-Part Models For Corner Solutions
No ratings yet
Choosing Between and Interpreting The Heckit and Two-Part Models For Corner Solutions
14 pages

DiD Regression

Uploaded by

DiD Regression

Uploaded by

BT2101 AY 23/24 Semester 2

Select Kentucky Data

Please use data

• Please interpret all the coefficient estimates in your regression table.

The coefficient of the interaction term changed from

It is still statistically significant since p-value is < 0.05

Or, by using dummy variables through as.factor()

• How will you respond to this argument? Explain.

R-squared is 0.0412 and adjusted R-squared is 0.0387.

You might also like