0% found this document useful (0 votes)

21 views

Causal Inference: 1.1 Two Types of Causal Questions

This document discusses the differences between prediction and causal inference. Prediction involves observing the relationship between variables, while causal inference involves determining the effect of actively intervening on a variable. There are two main approaches to causal inference - using counterfactuals, which refer to potential outcomes under different conditions, and using causal graphs to represent causal relationships between variables. Causal inference from observational data is challenging and requires strong assumptions, whereas randomized experiments allow consistent estimation of causal effects.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

Causal Inference: 1.1 Two Types of Causal Questions

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Causal Inference

Prediction and causation are very different. Typical questions are:

Prediction: Predict Y after observing X = x

Causation: Predict Y after setting X = x.

Causation involves predicting the effect of an intervention. For example:

Prediction: Predict health given that a person takes vitamin C

Causation: Predict health if I give a person vitamin C

The difference between passively observing X = x and actively intervening and setting
X = x is significant and requires different techniques and, typically, much stronger assump-
tions. This is the area known as causal inference.

1 Preliminaries
Before we jump into the details, there are a few general concepts to discuss.

1.1 Two Types of Causal Questions

There are two types of causal questions. The first deals with questions like this: do cell
phones cause brain cancer? In this case, there are variables X and Y and we want to know
the causal effect of X on Y . The challenges are: find a parameter θ that characterizes the
causal influence of X on Y and find a way to estimate θ. This is usually what we mean when
we refer to causal inference.
The second question is: given a set of variables, determine the causal relationship between
the variables. This is called causal discovery. This problem is statistically impossible
despite the large number of papers on the topic.

1.2 Two Types of Data

Data can be from a controlled, randomized experiment or from an observational study.
In the former, X is randomly assigned to subjects. In the latter, it is not randomly as-
signed. In randomized experiments, causal inference is straightforward. In observational
(non-randomized) studies, the problem is much harder and requires stronger assumptions
and also requires subject matter knowledge. Statistics and Machine Learning cannot solve
causal problems without background knowledge.

1
1.3 Two Languages for Causation
There are two different mathematical languages for studying causation. The first is based
on counterfactuals. The second is based on causal graphs. It will not seem obvious at first,
but the two are mathematically equivalent (apart from some small details). Actually, there
is a third language called structural equation models but this is very closely related to causal
graphs.

1.4 Example
Consider this story. A mother notices that tall kids have a higher reading level than short
kids. The mother puts her small child on a device and stretches the child until he is tall.
She is dismayed to find out that his reading level has not changed.
The mother is correct that height and reading skill are associated. Put another way, you
can use height to predict reading skill. But that does not imply that height causes reading
skill. This is what statisticians mean when they say:
correlation is not causation.
On the other hand, consider smoking and lung cancer. We know that smoking and lung
cancer are associated. But we also believe that smoking causes lung cancer. In this case,
we recognize that intervening and forcing someone to smoke does change his probability of
getting lung cancer.

1.5 Prediction Versus Causation

The difference between prediction (association/correlation) and causation is this: in predic-
tion we are interested in
P(Y ∈ A|X = x)
which means: the probability that Y ∈ A given that we observe that X is equal to x. For
causation we are interested in
P(Y ∈ A|set X = x)
which means: the probability that Y ∈ A given that we set X equal to x. Prediction is
about passive observation. Causation is about active intervention. The phrase correlation
is not causation can be written mathematically as

P(Y ∈ A|X = x) 6= P(Y ∈ A|set X = x).

Despite the fact that causation and association are different, people confuse them up all
the time, even people trained in statistics and machine learning. On TV recently there was a
report that good health is associated with getting seven hours of sleep. So far so good. Then
the reporter goes on to say that, therefore, everyone should strive to sleep exactly seven
hours so they will be healthy. Wrong. That’s confusing causation and association. Another
TV report pointed out a correlation between people who brush their teeth regularly and low

2
● ●

● ●
Y

Y
● ●

● ●

X X

Figure 1: Left: X and Y have positive association. Right: The lines are the counterfactuals,
i.e. what would happen to each person if I changed their X value. Despite the positive
association, the causal effect is negative. If we increase X everyone’s Y values will decrease.

rates of heart disease. An interesting correlation. Then the reporter (a doctor in this case)
went on to urge people to brush their teeth to save their hearts. Wrong!
To avoid this confusion we need a way to discuss causation mathematically. That is,
we need someway to make P(Y ∈ A|set X = x) formal. As I mentioned earlier, there are
two common ways to do this. One is to use counterfactuals. The other is to use causal
graphs. There are two different languages for saying the same thing.
Causal inference is tricky and should be used with great caution. The main messages
are:
1. Causal effects can be estimated consistently from randomized experiments.
2. It is difficult to estimate causal effects from observational (non-randomized) experi-
ments.
3. All causal conclusions from observational studies should be regarded as very tentative.
Causal inference is a vast topic. We will only touch on the main ideas here.

2 Counterfactuals
Consider two variables X and Y . We will call X the “exposure” or the “treatment.” We
call Y the “response” or the “outcome.” For a given subject we see (Xi , Yi ). What we don’t
see is what their value of Yi would have been if we changed their value of Xi . This is called

3
the counterfactual. The whole causal story is made clear in Figure 1 which shows data (left)
and the counterfactuals (right).
Suppose now that X is a binary variable that represents some exposure. So X = 1 means
the subject was exposed and X = 0 means the subject was not exposed. We can address the
problem of predicting Y from X by estimating E(Y |X = x). To address causal questions,
we introduce counterfactuals. Let Y1 denote the response if the subject is exposed. Let Y0
denote the response if the subject is not exposed. Then
(
Y1 if X = 1
Y =
Y0 if X = 0.

More succinctly
Y = XY1 + (1 − X)Y0 . (1)
If we expose a subject, we observe Y1 but we do not observe Y0 . Indeed, Y0 is the value we
would have observed if the subject had been exposed. The unobserved variable is called a
counterfactual. The variables (Y0 , Y1 ) are also called potential outcomes. We have enlarged
our set of variables from (X, Y ) to (X, Y, Y0 , Y1 ). A small dataset might look like this:

X Y Y0 Y1
1 1 * 1
1 1 * 1
1 0 * 0
1 1 * 1
0 1 1 *
0 0 0 *
0 1 1 *
0 1 1 *

The asterisks indicate unobserved variables. Causal questions involve the the distribution
p(y0 , y1 ) of the potential outcomes. We can interpret p(y1 ) as p(y|set X = 1) and we can
interpret p(y0 ) as p(y|set X = 0). The mean treatment effect or mean causal effect is defined
by
θ = E(Y1 ) − E(Y0 ) = E(Y |set X = 1) − E(Y |set X = 0).
The parameter θ has the following interpretation: θ is the mean response if we exposed
everyone minus the mean response if we exposed no-one.

Lemma 1 In general,

E[Y1 ] 6= E[Y |X = 1] and E[Y0 ] 6= E[Y |X = 0].

4
Exercise: Prove this.

Suppose now that we observe a sample (X1 , Y1 ), . . . , (Xn , Yn ). Can we estimate θ? In

general the answer is no. We can estimate
α = E(Y |X = 1) − E(Y |X = 0)
but α is not equal to θ. Quantities like E(Y |X = 1) and E(Y |X = 0) are predictive param-
eters. These are things that are commonly estimated in statistics and machine learning. In
general, we cannot consistently estimate θ.

2.1 Two Ways to Make θ Estimable

Fortunately, there are two ways to make θ estimable. The first is randomization and the
second is adjusting for confounding.

Randomization. Suppose that we randomly assign X. Then X will be independent of

(Y0 , Y1 ). In symbols:
random treatment assignment implies : (Y0 , Y1 ) q X.
Warning! Note that X is not independent of Y .
If X is randomly assigned, then θ = α where
α = E(Y |X = 1) − E(Y |X = 0).
A consistent estimator of α (and hence θ) is the difference of means Y 1 − Y 0 .

To summarize: If X is randomly assigned then correlation = causation. This is

why people spend millions of dollars doing randomized experiments.

The same results hold when X is continuous. In this case there is a counterfactual Y (x)
for each value x of X. We again have that, in general,
E[Y (x)] 6= E[Y |X = x].
See Figure 1. But if X is randomly assigned, then we do have E[Y (x)] = E[Y |X = x] and
so E[Y (x)] can be consistently estimated using standard regression methods. Indeed, if we
had randomly chosen the X values in Figure 1 then the plot on the left would have been
downward sloping. To see this, note that θ(x) = E[Y (x)] is defined to be the average of the
lines in the right plot. Under randomization, X is independent of Y (x). So
right plot = θ(x) = E[Y (x)] = E[Y (x)|X = x] = E[Y |X = x] = left plot.

5
Adjusting For Confounding. In some cases it is not feasible to do a randomized
experiment and we must use data from from observational (non-randomized) studies. Smok-
ing and lung cancer is an example. Can we estimate causal parameters from observational
(non-randomized) studies? The answer is: sort of.
In an observational study, the treated and untreated groups will not be comparable.
Maybe the healthy people chose to take the treatment and the unhealthy people didn’t. In
other words, X is not independent of (Y0 , Y1 ). The treatment may have no effect but we
would still see a strong association between Y and X. In other words, α might be large even
though θ = 0.
Here is a simplified example. Suppose X denotes whether someone takes vitamins and
Y is some binary health outcome (with Y = 1 meaning “healthy.”)

X 1 1 1 1 0 0 0 0
Y0 1 1 1 1 0 0 0 0
Y1 1 1 1 1 0 0 0 0

In this example, there are only two types of people: healthy and unhealthy. The healthy
people have (Y0 , Y1 ) = (1, 1). These people are healthy whether or not that take vitamins.
The unhealthy people have (Y0 , Y1 ) = (0, 0). These people are unhealthy whether or not
that take vitamins. The observed data are:

X 1 1 1 1 0 0 0 0
Y 1 1 1 1 0 0 0 0.

In this example, θ = 0 but α = 1. The problem is that people who choose to take
vitamins are different than people who choose not to take vitamins. That’s just another way
of saying that X is not independent of (Y0 , Y1 ).

To account for the differences in the groups, we can measure confounding variables.
These are the variables that affect both X and Y . These variables explain why the two groups
of people are different. In other words, these variables account for the dependence between
X and (Y0 , Y1 ). By definition, there are no such variables in a randomized experiment. The
hope is that if we measure enough confounding variables Z = (Z1 , . . . , Zk ), then, perhaps the
treated and untreated groups will be comparable, conditional on Z. This means that X is
independent of (Y0 , Y1 ) conditional on Z. We say that there is no unmeasured confounding,
or that ignorability holds, if
X q (Y0 , Y1 ) Z.

6
The only way to measure the important confounding variables is to use subject matter
knowledge. In other words, causal inference in observational studies is not possible
without subject matter knowledge.

Theorem 2 Suppose that

X q (Y0 , Y1 ) Z.

Then Z Z
θ ≡ E(Y1 ) − E(Y0 ) = µ(1, z)p(z)dz − µ(0, z)p(z)dz (2)

where
µ(x, z) = E(Y |X = x, Z = z).
A consistent estimator of θ is
n n
1X 1X
θb = b(1, Zi ) −
µ µ
b(0, Zi )
n i=1 n i=1

where µ
b(x, z) is an appropriate, consistent estimator of the regression function µ(x, z) =
E[Y |X = x, Z = z].

Remark: Estimating the quantity in (2) well is difficult and involves an area of statistics
called semiparametric inference. In statistics, biostatistics, econometrics and epidemiology,
this is the focus of much research.

Proof. We have

where we used the fact that X is independent of (Y0 , Y1 ) conditional on Z in the third line
and the fact that Y = (1 − X)Y1 + XY0 in the fourth line.

The process of including confounding variables and using equation (2) is known as adjust-
ing for confounders and θb is called the adjusted treatment effect. The choice of the estimator
µ
b(x, z) is delicate. If we use a nonparametric method then we have to choose the smoothing
parameter carefully. Unlike prediction, bias and variance are not equally important. The

7
usual bias-variance tradeoff does not apply. In fact bias is worse than variance and we
need to choose the smoothing parameter smaller than usual. As mentioned above, there is
a branch of statistics called semiparametric inference that deals with this problem in detail.
It is instructive to compare the casual effect
Z Z
θ = µ(1, z)p(z)dz − µ(0, z)p(z)dz

with the predictive quantity

α = E(Y |X = 1) − E(Y |X = 0)
Z Z
= µ(1, z)p(z|X = 1)dz − µ(0, z)p(z|X = 0)dz

which are mathematically (and conceptually) quite different.

We need to treat θb cautiously. It is very unlikely that we have successfully measured all
the relevant confounding variables so θb should be regarded as a crude approximation to θ at
best.
In the case where E[Y |X = x, Z = z] is linear, the adjusted treatment effect takes a
simple form. Suppose that E[Y |X = x, Z = z] = β0 + β1 x + β2T z. Then
Z Z
T
θ = [β0 + β1 + β2 z]dP (z) − [β0 + β2T z]dP (z) = β1 .

In a linear regression, the coefficient in front of x is the causal effect of x if (i) the model is
correct and (ii) all confounding variables are included in the regression.
To summarize: the coefficients in linear regression have a causal intepretation if (i) the
model is correct and (ii) every possible confounding factor is included in the model.

Aleix Ruiz de Villa Robert - Causal Inference For Data Science (MEAP V04) - Manning (2023)
No ratings yet
Aleix Ruiz de Villa Robert - Causal Inference For Data Science (MEAP V04) - Manning (2023)
217 pages
36-708 Statistical Machine Learning Homework #4 Solutions: DUE: April 19, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #4 Solutions: DUE: April 19, 2019
16 pages
Stat1602 Outline
No ratings yet
Stat1602 Outline
3 pages
Causal Inference: 1.1 Two Types of Causal Questions
No ratings yet
Causal Inference: 1.1 Two Types of Causal Questions
19 pages
Causality
No ratings yet
Causality
22 pages
Lecture 21
No ratings yet
Lecture 21
8 pages
The International Journal of Biostatistics: An Introduction To Causal Inference
No ratings yet
The International Journal of Biostatistics: An Introduction To Causal Inference
62 pages
1-Introduction To Applied Econometrics
No ratings yet
1-Introduction To Applied Econometrics
33 pages
Causal Inference in Statistics: An Overview
100% (1)
Causal Inference in Statistics: An Overview
51 pages
Kenneth Rothman - Timothy L. Lash - Modern Epidemiology-LWW (2020) - 96-142
No ratings yet
Kenneth Rothman - Timothy L. Lash - Modern Epidemiology-LWW (2020) - 96-142
47 pages
Causal Inference in Statistics: An Overview
100% (2)
Causal Inference in Statistics: An Overview
51 pages
Spe2024 CIlect KF
No ratings yet
Spe2024 CIlect KF
35 pages
r354 Reprint Corrected
No ratings yet
r354 Reprint Corrected
61 pages
An Introduction To Causal Inference
No ratings yet
An Introduction To Causal Inference
67 pages
Causal Inference: An Introduction: Qingyuan Zhao
No ratings yet
Causal Inference: An Introduction: Qingyuan Zhao
51 pages
IS4242 W4 Causal Inference & Experiment
No ratings yet
IS4242 W4 Causal Inference & Experiment
87 pages
Causal Notes
No ratings yet
Causal Notes
17 pages
01 Foundations
No ratings yet
01 Foundations
102 pages
Pearl 10 A
No ratings yet
Pearl 10 A
20 pages
A brief introduction to causal inference in machine learning
No ratings yet
A brief introduction to causal inference in machine learning
88 pages
Regression c
No ratings yet
Regression c
48 pages
Introduction_to_Causal_Inference-Aug25_2020-Neal
No ratings yet
Introduction_to_Causal_Inference-Aug25_2020-Neal
61 pages
Imperial Causality
No ratings yet
Imperial Causality
124 pages
SSRN Id4324450
No ratings yet
SSRN Id4324450
48 pages
Causal Inference: Yu Xie University of Michigan
No ratings yet
Causal Inference: Yu Xie University of Michigan
51 pages
Econometrics Review #1
No ratings yet
Econometrics Review #1
35 pages
3 Experiments and Observational Studies
No ratings yet
3 Experiments and Observational Studies
6 pages
Causal-Inference-_Estimating-Counterfactuals
No ratings yet
Causal-Inference-_Estimating-Counterfactuals
15 pages
Casual Tutorial Slides
No ratings yet
Casual Tutorial Slides
254 pages
causal-inference-intro
No ratings yet
causal-inference-intro
16 pages
Causal inference lesson one pdf
No ratings yet
Causal inference lesson one pdf
16 pages
Thinking
No ratings yet
Thinking
16 pages
Causal Inference, Michael E. Sobel
No ratings yet
Causal Inference, Michael E. Sobel
3 pages
265 Full
No ratings yet
265 Full
7 pages
09 Causal Inference II: MSBA7003 Quantitative Analysis Methods
No ratings yet
09 Causal Inference II: MSBA7003 Quantitative Analysis Methods
34 pages
Causal Inference in The Social Sciences
No ratings yet
Causal Inference in The Social Sciences
30 pages
Simple Linear Regression Scott M Lynch
No ratings yet
Simple Linear Regression Scott M Lynch
111 pages
annurev-statistics-033121-114601
No ratings yet
annurev-statistics-033121-114601
30 pages
Dev Unit 5
No ratings yet
Dev Unit 5
22 pages
Buy ebook Causal Inference for Data Science MEAP Alex Ruiz De Villa cheap price
100% (5)
Buy ebook Causal Inference for Data Science MEAP Alex Ruiz De Villa cheap price
50 pages
Probability and Causality
No ratings yet
Probability and Causality
300 pages
08 Causal Inference I: MSBA7003 Quantitative Analysis Methods
No ratings yet
08 Causal Inference I: MSBA7003 Quantitative Analysis Methods
32 pages
An Introduction To Causal Inference: Fabian Dablander
No ratings yet
An Introduction To Causal Inference: Fabian Dablander
15 pages
Week 6 Logic of Multivariate Analysis: The Elaboration Approach
No ratings yet
Week 6 Logic of Multivariate Analysis: The Elaboration Approach
24 pages
Research Variables
100% (1)
Research Variables
108 pages
Bulbulia Et Al 2021
No ratings yet
Bulbulia Et Al 2021
9 pages
The Statistics of Causal Inference: A View From Political Methodology
No ratings yet
The Statistics of Causal Inference: A View From Political Methodology
23 pages
Statistics & Relative Risk (Pt 1)
No ratings yet
Statistics & Relative Risk (Pt 1)
26 pages
Causal Inference What If 1st Edition Miguel A. Hernan - Get instant access to the full ebook content
100% (2)
Causal Inference What If 1st Edition Miguel A. Hernan - Get instant access to the full ebook content
71 pages
MIT (14.32) Spring 2009 J. Angrist Preliminaries
No ratings yet
MIT (14.32) Spring 2009 J. Angrist Preliminaries
6 pages
Experiments and Causality
No ratings yet
Experiments and Causality
21 pages
Coefficient of Skewness
No ratings yet
Coefficient of Skewness
89 pages
Causal Inference: 36-350, Data Mining, Fall 2009 4 December 2009
No ratings yet
Causal Inference: 36-350, Data Mining, Fall 2009 4 December 2009
11 pages
Observation and Experiment An Introduction to Causal Inference 1st Edition Paul R. Rosenbaum - Quickly download the ebook to never miss important content
100% (1)
Observation and Experiment An Introduction to Causal Inference 1st Edition Paul R. Rosenbaum - Quickly download the ebook to never miss important content
51 pages
Bayesian Causal Tutorial Ohiostate June2019
No ratings yet
Bayesian Causal Tutorial Ohiostate June2019
56 pages
Complete Download Causal Inference What If 1st Edition Miguel A. Hernan PDF All Chapters
No ratings yet
Complete Download Causal Inference What If 1st Edition Miguel A. Hernan PDF All Chapters
78 pages
Chapter 9 Notes - Multivariate Correlations
No ratings yet
Chapter 9 Notes - Multivariate Correlations
6 pages
Variables, Correlation and Causality
No ratings yet
Variables, Correlation and Causality
6 pages
Causal Inference
No ratings yet
Causal Inference
2 pages
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet
Impulse Balance Theory and its Extension by an Additional Criterion
From Everand
Impulse Balance Theory and its Extension by an Additional Criterion
Reinhard Selten
1/5 (1)
Flatland Turned on Out
From Everand
Flatland Turned on Out
Eric Eliason
No ratings yet
Differential Privacy: 1 N I 1 N N
No ratings yet
Differential Privacy: 1 N I 1 N N
7 pages
Density Estimation 36-708
No ratings yet
Density Estimation 36-708
32 pages
Linear Regression: 1 1 N N I I I D I I
No ratings yet
Linear Regression: 1 1 N N I I I D I I
20 pages
Sparse Additive Models: University of California, Berkeley, USA
No ratings yet
Sparse Additive Models: University of California, Berkeley, USA
22 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
Nonparametric Classification 10/36-702: 1 1 N N N I I
No ratings yet
Nonparametric Classification 10/36-702: 1 1 N N N I I
20 pages
Support Vector Machines
No ratings yet
Support Vector Machines
5 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
High-Dimensional, Two-Sample Testing
No ratings yet
High-Dimensional, Two-Sample Testing
9 pages
Online Learning: T T T T T T T T
No ratings yet
Online Learning: T T T T T T T T
8 pages
Boosting: I I I I
No ratings yet
Boosting: I I I I
5 pages
10/36-702 Statistical Machine Learning Homework #2 Solutions
No ratings yet
10/36-702 Statistical Machine Learning Homework #2 Solutions
11 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
No ratings yet
36-708 Statistical Methods For Machine Learning Homework #1 Solutions
12 pages
Manifold Estimation, Hidden Structure and Dimension Reduction
No ratings yet
Manifold Estimation, Hidden Structure and Dimension Reduction
39 pages
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
No ratings yet
Homework 4 Due Friday April 19 3:00 PM Submit A PDF File On Canvas
2 pages
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
No ratings yet
36-708 Statistical Machine Learning Homework #3 Solutions: DUE: March 29, 2019
22 pages
Data Analysis Exam 1 36-401, Section B
No ratings yet
Data Analysis Exam 1 36-401, Section B
3 pages
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
No ratings yet
Data Analysis Project 2 Due 5:00 PM Nov 21 1 Instructions
3 pages
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
No ratings yet
36-401 Modern Regression HW #2 Solutions: Problem 1 (36 Points Total)
15 pages
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
No ratings yet
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
25 pages
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
No ratings yet
Lecture 4: Simple Linear Regression Models, With Hints at Their Estimation
12 pages
Nonparametric Regression
No ratings yet
Nonparametric Regression
24 pages
Lecture 9: Predictive Inference
No ratings yet
Lecture 9: Predictive Inference
10 pages
1 Review
No ratings yet
1 Review
7 pages
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
No ratings yet
Lecture 7: Diagnostics: 36-401, Fall 2017, Section B
35 pages
Lecture 8: Inference 36-401, Fall 2015, Section B
No ratings yet
Lecture 8: Inference 36-401, Fall 2015, Section B
16 pages
HW7
No ratings yet
HW7
1 page
Eng12 PR 2 q2 Module 16
No ratings yet
Eng12 PR 2 q2 Module 16
12 pages
MDLT 134 Project
No ratings yet
MDLT 134 Project
5 pages
research part 1
No ratings yet
research part 1
9 pages
Sums of Independent Random Variables: Scott She Eld
No ratings yet
Sums of Independent Random Variables: Scott She Eld
10 pages
MEM797 - Introduction 8 Oct 2022
No ratings yet
MEM797 - Introduction 8 Oct 2022
13 pages
Amstar - Assessing The Methodological Quality of Systematic Reviews
No ratings yet
Amstar - Assessing The Methodological Quality of Systematic Reviews
3 pages
Assignment 2 - Quantitative (Latest Update As On 19 Jan 24)
No ratings yet
Assignment 2 - Quantitative (Latest Update As On 19 Jan 24)
20 pages
5 Largesampletest
No ratings yet
5 Largesampletest
41 pages
Calibration Curve of Diclofenac Sodium
No ratings yet
Calibration Curve of Diclofenac Sodium
8 pages
Arima Word
No ratings yet
Arima Word
13 pages
Assignment Project Using SPSS
No ratings yet
Assignment Project Using SPSS
14 pages
Statistical Methods - Psychiatry - Research & SPSS
No ratings yet
Statistical Methods - Psychiatry - Research & SPSS
345 pages
Chapter 3 Control Chart For Variables
No ratings yet
Chapter 3 Control Chart For Variables
66 pages
Introduction To Multiple Regression: Chapter 14 - 1
No ratings yet
Introduction To Multiple Regression: Chapter 14 - 1
62 pages
07 Hypo Test 5 Qns
No ratings yet
07 Hypo Test 5 Qns
3 pages
Report
No ratings yet
Report
53 pages
Publishable Format
100% (1)
Publishable Format
9 pages
Quiz On Statistical Tests
No ratings yet
Quiz On Statistical Tests
6 pages
SAMPLE Midterm Exam #2
No ratings yet
SAMPLE Midterm Exam #2
11 pages
Download Research and Evaluation in Education and Psychology Integrating Diversity With Quantitative Qualitative and Mixed Methods 5th Edition Donna M. Mertens ebook file with all chapters
No ratings yet
Download Research and Evaluation in Education and Psychology Integrating Diversity With Quantitative Qualitative and Mixed Methods 5th Edition Donna M. Mertens ebook file with all chapters
60 pages
Sampling and Data Collection: Lecture 19-20 Research Methods (Business) Isp-Aht
No ratings yet
Sampling and Data Collection: Lecture 19-20 Research Methods (Business) Isp-Aht
6 pages
01 Introduction Lecture Notes PDF
0% (1)
01 Introduction Lecture Notes PDF
18 pages
Telling Half The Story Making Explicit The Significance of Methods and Methodologies in Music Education Research
No ratings yet
Telling Half The Story Making Explicit The Significance of Methods and Methodologies in Music Education Research
11 pages
Download full Qualitative Research Methods: Collecting Evidence, Crafting Analysis, Communicating Impact 2nd Edition Sarah J. Tracy ebook all chapters
100% (7)
Download full Qualitative Research Methods: Collecting Evidence, Crafting Analysis, Communicating Impact 2nd Edition Sarah J. Tracy ebook all chapters
55 pages
E Book Ta PDF
No ratings yet
E Book Ta PDF
291 pages
AP Statistics Final Project
No ratings yet
AP Statistics Final Project
5 pages
Ayurvedic Drug Discovery - Review
No ratings yet
Ayurvedic Drug Discovery - Review
22 pages
Catapult Project Report
No ratings yet
Catapult Project Report
8 pages
The Advantages of Naturalistic Observation
No ratings yet
The Advantages of Naturalistic Observation
2 pages

Causal Inference: 1.1 Two Types of Causal Questions

Uploaded by

Causal Inference: 1.1 Two Types of Causal Questions

Uploaded by

Causal Inference

Prediction and causation are very different. Typical questions are:

Prediction: Predict Y after observing X = x

Causation involves predicting the effect of an intervention. For example:

Prediction: Predict health given that a person takes vitamin C

1.1 Two Types of Causal Questions

1.2 Two Types of Data

1.5 Prediction Versus Causation

P(Y ∈ A|X = x) 6= P(Y ∈ A|set X = x).

E[Y1 ] 6= E[Y |X = 1] and E[Y0 ] 6= E[Y |X = 0].

Suppose now that we observe a sample (X1 , Y1 ), . . . , (Xn , Yn ). Can we estimate θ? In

2.1 Two Ways to Make θ Estimable

Randomization. Suppose that we randomly assign X. Then X will be independent of

To summarize: If X is randomly assigned then correlation = causation. This is

Theorem 2 Suppose that

with the predictive quantity

which are mathematically (and conceptually) quite different.

You might also like