Fundamental Problem of Causal Inference
Definition
The problem that, for the same individual, the outcomes under treatment (W=1) and control (W=0) cannot be observed simultaneously
The core problem of causal inference, formalized by Holland (1986):
“The fundamental problem of causal inference is that we can observe at most one of the potential outcomes for each unit.”
In the equation above, only one of and can be observed.
Intuitive Understanding
Example: Drug Effect
When drug A is administered to patient Alice, only the outcome under that condition can be observed:
| Patient | Drug A outcome (observed) | No-drug outcome (counterfactual) |
|---|---|---|
| Alice | Y=recovered | ? (unobservable) |
Counterfactual outcome:
- “What would have happened if Alice had not received the drug?”
- The answer to this question cannot be observed directly
Missing Data Perspective
Causal inference can be viewed as a missing data problem:
| Unit | W | Y(1) | Y(0) | ITE |
|---|---|---|---|---|
| 1 | 1 | 5 | ? | ? |
| 2 | 0 | ? | 3 | ? |
| 3 | 1 | 7 | ? | ? |
| 4 | 0 | ? | 4 | ? |
- Treatment group: observed, missing
- Control group: observed, missing
Solution Strategies
1. Group-level Comparison (ATE estimation)
Estimate the average treatment effect (ATE) instead of the individual-level ITE:
Estimable from observational data under assumptions.
2. Leveraging Key Assumptions
Identification is possible when the three key assumptions hold:
- SUTVA
- Ignorability
- Positivity
3. Matching / Weighting
Infer counterfactual outcomes by comparing similar individuals:
- Propensity Score Matching: match individuals with similar propensities
- IPW: balance group distributions with weights
4. Model-based Estimation
Estimate potential outcomes with a predictive model:
- T-Learner: an outcome model for each treatment group
- S-Learner: estimate the treatment effect with a single unified model
RCT vs Observational Study
RCT (Randomized Controlled Trial)
Random assignment automatically satisfies Ignorability:
- The ATE can be estimated by comparing the means of the treatment/control groups
- “Gold standard” for causal inference
Observational Study
No random assignment → additional assumptions required:
- Conditional ignorability:
- Covariate adjustment required
Why Does It Matter?
-
Prediction ≠ Causation
- Prediction: “Given X, what is Y?”
- Causation: “If we change X, how does Y change?”
-
The effect of intervention
- Essential for decision-making in policy, treatment, marketing, etc.
-
Counterfactual thinking
- The ability to answer “what if we had done it differently?”
Related Concepts
- Potential Outcomes - Rubin Causal Model
- Causal Assumptions Overview - integrated overview of the assumptions
- Counterfactual Reasoning - counterfactual reasoning
- Selection Bias - bias due to differences between treatment/control groups
- ITE - Individual Treatment Effect
References
- Holland, P. W. (1986). Statistics and Causal Inference. JASA
- yaoSurveyCausalInference2021 - Section 2.5
- Potential Outcomes - Rubin (1974) framework